Single-Threaded Integer Performance

The LZMA compression benchmark only measures a part of the performance of some real-world server applications (file server, backup, etc.). The reason why we keep using this benchmark is that it allows us to isolate the "hard to extract instruction level parallelism (ILP)" and "sensitive to memory parallelism and latency" integer performance. That is the kind of integer performance you need in most server applications.

One more reason to test performance in this manner is that the 7-zip source code is available under the GNU LGPL license. That allows us to recompile the source code on every machine with the -O2 optimization with gcc 4.8.2.

LZMA Single-Threaded Performance: Compression

The Xeon E5-2650L Haswell core is only able to boost to 2.5 GHz, while the Xeon D has a newer core (Broadwell) and is capable of 2.6 GHz. Still, the Xeon E5 is 6% faster. The most likely explanation is that the Xeon E5-2650L (65W TDP) keeps turboboost higher for a longer time than the Xeon D (45W TDP). 

The Xeon D and Atom C2750 run at the same clockspeed in this single threaded task (2.6 GHz), but you can see how much difference a wide complex architecture makes. The Broadwell Core is able to run about twice as many instructions in parallel as the Silvermont core. The Haswell/Broadwell core results clearly show that well designed wide architectures remain quite capable, even in "low ILP" (Instruction Level Parallelism) code.

Let's see how the chips compare in decompression. Decompression is an even lower IPC (Instructions Per Clock) workload, as it is pretty branch intensive and depends on the latencies of the multiply and shift instructions.

LZMA Single-Threaded Performance: Decompression

The Xeon E5 runs at 2.5 GHz, the Xeon D at 2.6 GHz, the Xeon E3-1230L at 2.8 GHz, The Xeon E3-1265L can reach 3.7 GHz. The decompression results follow the same logic. There does not seem to be a difference between a Broadwell, Haswell or Ivy Bridge core: performance is almost linear with (turboboost) clockspeed. The only exception is the Xeon E3-1240 which turboboost to 3.8 GHz, but outperforms the other by a larger than expected. The explanation is pretty simple: the higher TDP (80 W) allows the chip to sustain turbo boost clock speeds for much longer. 

Memory Subsystem: Latency Multi-Threaded Integer Performance
Comments Locked

90 Comments

View All Comments

  • julianb - Saturday, October 31, 2015 - link

    Thanks for the reply, man.
    And sorry for my late reply, totally forgot about this thread :)
  • eva2000 - Tuesday, June 23, 2015 - link

    Nice... Xeon D-1540 is awesome, but I wish it was clocked 0.2Ghz higher across the board would be just enough to tip that scale versus E5. Did my own benchmarks at https://community.centminmod.com/threads/2864/ :)
  • extide - Wednesday, June 24, 2015 - link

    Thats probably exactly why it ISNT clocked 0.2Ghz higher across the board ;)

    I'm sure Intel wants to see some space between this and E5.
  • boogerlad - Tuesday, June 23, 2015 - link

    If this was marketed for the consumer market with the ability to overclock, this would outsell everything completely. This is what the enthusiast needs!!!
  • Refuge - Tuesday, June 23, 2015 - link

    I don't think this is going to do much of anything for an enthusiast.

    Unless they are interested in building a server for some experiment or project.
  • JohanAnandtech - Wednesday, June 24, 2015 - link

    I still think the i7 59xx series is a better match for consumers: higher clocks and thus ST performance. The Xeon D most interesting features such as integrated 10 GBe and low power don't interest most performance consumers. Most people will have a hard time saturating a 1 GBe line and power savings are not a priority.
  • tspacie - Wednesday, June 24, 2015 - link

    Seems to tick all the boxes for a software development machine. Very good at compilation. Reasonably priced for the performance. Low power. ECC memory. I'm tempted
  • extide - Wednesday, June 24, 2015 - link

    EXACTLY what I was thinking!
  • MrSpadge - Saturday, June 27, 2015 - link

    I would be very tempted by such a chip as well, using it for BOINC. However, Broadwell looses some of the power efficiency advantage if you push it harder, i.e. the largest gains are at low and moderate frequency. Perfect for such server chips and mobile ones, but not so much for people aiming for 4+ GHz.
  • MaxKreimerman - Tuesday, June 23, 2015 - link

    Sounds impresive in just 45w package, but imposible to find in the retail sites such as newegg or wiredzone

Log in

Don't have an account? Sign up now