Single-Threaded Integer Performance

The LZMA compression benchmark only measures a part of the performance of some real-world server applications (file server, backup, etc.). The reason why we keep using this benchmark is that it allows us to isolate the "hard to extract instruction level parallelism (ILP)" and "sensitive to memory parallelism and latency" integer performance. That is the kind of integer performance you need in most server applications.

One more reason to test performance in this manner is that the 7-zip source code is available under the GNU LGPL license. That allows us to recompile the source code on every machine with the -O2 optimization with gcc 4.8.2.

LZMA Single-Threaded Performance: Compression

The Xeon E5-2650L Haswell core is only able to boost to 2.5 GHz, while the Xeon D has a newer core (Broadwell) and is capable of 2.6 GHz. Still, the Xeon E5 is 6% faster. The most likely explanation is that the Xeon E5-2650L (65W TDP) keeps turboboost higher for a longer time than the Xeon D (45W TDP). 

The Xeon D and Atom C2750 run at the same clockspeed in this single threaded task (2.6 GHz), but you can see how much difference a wide complex architecture makes. The Broadwell Core is able to run about twice as many instructions in parallel as the Silvermont core. The Haswell/Broadwell core results clearly show that well designed wide architectures remain quite capable, even in "low ILP" (Instruction Level Parallelism) code.

Let's see how the chips compare in decompression. Decompression is an even lower IPC (Instructions Per Clock) workload, as it is pretty branch intensive and depends on the latencies of the multiply and shift instructions.

LZMA Single-Threaded Performance: Decompression

The Xeon E5 runs at 2.5 GHz, the Xeon D at 2.6 GHz, the Xeon E3-1230L at 2.8 GHz, The Xeon E3-1265L can reach 3.7 GHz. The decompression results follow the same logic. There does not seem to be a difference between a Broadwell, Haswell or Ivy Bridge core: performance is almost linear with (turboboost) clockspeed. The only exception is the Xeon E3-1240 which turboboost to 3.8 GHz, but outperforms the other by a larger than expected. The explanation is pretty simple: the higher TDP (80 W) allows the chip to sustain turbo boost clock speeds for much longer. 

Memory Subsystem: Latency Multi-Threaded Integer Performance
Comments Locked

90 Comments

View All Comments

  • Krysto - Tuesday, June 23, 2015 - link

    Betteridge law.
  • Metaluna - Tuesday, June 23, 2015 - link

    ...fails in this case. Did you read the review?
  • CajunArson - Tuesday, June 23, 2015 - link

    While desktop Broadwell isn't all that great, these server parts really show off Intel's accomplishments in improving power efficiency and performance-per-watt with 14nm.

    ARM has a huge hill to climb to really compete with these parts, and we've already seen AMD effectively skip its first iteration of an ARM product because they probably got wind of the Xeon D and decided they would have to do both a die-shrink and completely customized ARM core just to keep up.
  • The_Assimilator - Tuesday, June 23, 2015 - link

    I very much doubt whether we'll ever see another server CPU from AMD, regardless of ARM cores or not. If they even manage to get Zen out the door, *and* it's not another massive flop, I will be impressed.
  • Refuge - Tuesday, June 23, 2015 - link

    I root for them everyday, but lets not give them too big of a hill to climb with a broken leg now. lol
  • extide - Tuesday, June 23, 2015 - link

    Take it easy man, AMD is not going down the drain any time soon, and we WILL see some future server oriented parts come from them. But how fast will they be? That's the question and we wont know for a while...
  • Kjella - Tuesday, June 23, 2015 - link

    Really? Last quarter they had a $187 million total comprehensive loss on $1030 million in revenue, even if you exclude the restructuring cost they lost $100 million for a -10% deficit. The stockholder's equity is almost gone with $17 million left, after that getting funding or a credit limit will become much harder.

    And Q2 is probably going to be another bloody quarter with no major CPU or GPU launches and firesales of old Win8 stock in preparation for Win10. The console ramp-up is usually in Q3 in preparation for Christmas, not before the summer. Last quarter's loss they took almost entirely from their cash reserves, they're now in the lower end of what they need to operate, if they lose this quarter too they must cut where it hurts bad.
  • Guspaz - Tuesday, June 23, 2015 - link

    When we needed a low-power and low-cost server solution, we went with a desktop i3, because for some reason Intel supports ECC RAM on the i3 and lower, but not in the i5 and higher.
  • julianb - Tuesday, June 23, 2015 - link

    Very interested in this SOC.

    If possible could we see how the Xeon D deal with Cinebench Multithreaded test?
    I am into 3D CPU rendering and would like to know how does the Xeon D-1540 compare to say i7-3930K or i7-4790K.
    I realize the purpose of Xeon D-1540's existence is different but still...
    Thank you.
  • MrSpadge - Saturday, June 27, 2015 - link

    An eco-tuned 5820K seem better. I don't suppose you're going to render 24/7 all the time, so the electricity savings from the 14 nm Broadwell will have a hard time making up for the massive difference in initial cost.

Log in

Don't have an account? Sign up now