Multi-Threaded Integer Performance

How do the new Xeon E7 v2 chips compare to the existing Xeon servers when it comes to some multi-threaded workloads?

LZMA performance: compression

When it comes to raw integer processing, the new Xeon delivers up 70% better performance than the previous generation and up to 2.3x better performance than the Xeon X7560. With only 12 cores active to simulate performance of the 12-core models, e.g. E7-8857 v2 and E7-4860 v2, we can get a rough idea how the interesting 3GHz 12-core E7-8857 v2 performs, which has the same TDP (130W) as the previous generation. In those circumstances the new Xeon E7 v2 is 50% faster than the previous generation and twice as fast as the Xeon 7560 (and the actual E7-8857 v2 will be clocked slightly higher).

LZMA performance: decompression

Decompression gives a similar performance landscape, though the E5-2680 now drops below the X7560.

Integer Performance Application Development: Linux Kernel Compile
Comments Locked

125 Comments

View All Comments

  • Kevin G - Friday, February 21, 2014 - link

    And a quick addition:

    There will indeed be a quick adoption to Haswell-EX not because of AVX2 or DDR4 but rather transactional memory support (TSX). For the large databases and applications these systems are targeted at, TSX should prove to be helpful.
  • TiGr1982 - Friday, February 21, 2014 - link

    I agree, TSX should make a lot of sense for these E7's - they have a huge core count and huge shared memory at the same time.
  • Schmide - Friday, February 21, 2014 - link

    I think your L3 latency numbers are off. I think typical Intel L3 latencies are 30-40 clocks ~3-4ns.
  • Schmide - Friday, February 21, 2014 - link

    Oops my bad i miss used the calculator. Ignore.
  • dylan522p - Friday, February 21, 2014 - link

    No power consumption numbers?
  • JohanAnandtech - Saturday, February 22, 2014 - link

    Coming...we had to run lots of test in parallel, so it was not possible to make sure all systems were similar. Also we should test with workloads that require a lot more memory to get an idea.
  • mslasm - Friday, February 21, 2014 - link

    Note that E7-8857 v2 has 12 cores but no HT, so only has 12 threads as well (see http://ark.intel.com/products/75254/Intel-Xeon-Pro... Thus it is not equivalent to a 3Ghz E7-4860V2, as 4860 has HT for a total of 24 threads

    Also, there must be a typo either in the graph or in the text on the "single thread" integer performance test: "Opteron ... at 2.4GHz would deliver about 2481 MIPs", while - according to the graph - it already delivers 2636 @ 2.3Ghz.
  • JohanAnandtech - Saturday, February 22, 2014 - link

    Good point. There is little gain from HT in OpenFoam, but it will influence the LZMA benchmarks. So the Openfoam findings are still valid, but not the LZMA. The kernel compile is somewhat in between.
  • JohanAnandtech - Saturday, February 22, 2014 - link

    I will rerun the benchmarks without HT to check.
  • mslasm - Saturday, February 22, 2014 - link

    Thanks! I did not mean to imply HT matters "a lot", but it may influence some (and I admit I don't know much about how your benchmarks behave, other than parallel LZMA which I worked a lot with) - so it just does not sound right to outright call it equivalent, and I wish AT only has statements anyone can just trust :)

Log in

Don't have an account? Sign up now