CPU Tests: SPEC Performance

SPEC2017 is a series of standardized tests used to probe the overall performance between different systems, different architectures, different microarchitectures, and setups. The code has to be compiled, and then the results can be submitted to an online database for comparison. It covers a range of integer and floating point workloads, and can be very optimized for each CPU, so it is important to check how the benchmarks are being compiled and run.

For compilers, we use LLVM both for C/C++ and Fortran tests, and for Fortran we’re using the Flang compiler. The rationale of using LLVM over GCC is better cross-platform comparisons to platforms that have only have LLVM support and future articles where we’ll investigate this aspect more. We’re not considering closed-sourced compilers such as MSVC or ICC.

clang version 10.0.0
clang version 7.0.1 (ssh://git@github.com/flang-compiler/flang-driver.git
 24bd54da5c41af04838bbe7b68f830840d47fc03)

-Ofast -fomit-frame-pointer
-march=x86-64
-mtune=core-avx2
-mfma -mavx -mavx2

Our compiler flags are straightforward, with basic –Ofast and relevant ISA switches to allow for AVX2 instructions. We decided to build our SPEC binaries on AVX2, which puts a limit on Haswell as how old we can go before the testing will fall over. This also means we don’t have AVX512 binaries, primarily because in order to get the best performance, the AVX-512 intrinsic should be packed by a proper expert, as with our AVX-512 benchmark. All of the major vendors, AMD, Intel, and Arm, all support the way in which we are testing SPEC.

To note, the requirements for the SPEC licence state that any benchmark results from SPEC have to be labeled ‘estimated’ until they are verified on the SPEC website as a meaningful representation of the expected performance. This is most often done by the big companies and OEMs to showcase performance to customers, however is quite over the top for what we do as reviewers.

SPEC2017 Rate-1 Estimated Total

In the single threaded test, the jump over the regular Zen 3 Ryzen mobile variant (5980HS) at the same power is quite substantial: +9.6% on integer performance and +14.1% on floating point. The move from DDR4 to DDR5 is quite substantial in that regard, and it’s seen in a lot of our upcoming benchmarks.

We didn’t see any change from 35 W to 45 W to 65 W in our AMD testing as the power consumption of the chip in single threaded workloads did not exceed 24 W, however we did see performance difference in Intel’s Alder Lake going from 45 W to 65 W, showcasing how much power the core can consume.

But if we compared that to Intel’s latest Alder Lake offerings, there’s a deficit in both categories – even though our lowest data here is at 45 W, we can see that the 45 W testing of the previous generation Intel also beats the 6900HS at SPECint (but AMD wins in SPECfp). This is something that carries through to multi-threaded performance.

SPEC2017 Rate-N Estimated Total

For Multi-Threaded performance, we only saw the slightest improvement from AMD moving up to 65 W, perhaps showcasing that the hardware is limited in other ways than just power and the uplift from DDR4 to DDR5. In any event, at 35 W, AMD still surpasses what the previous generation Intel i9-11980HK can provide at 65 W.

But if we compare it to Intel’s latest Alder Lake processors, featuring 6 performance cores and 8 efficiency cores, we now have 20 threads up against AMD’s 16 threads. If we compare 45 W to 45 W, Intel has a +14.0% lead in integer and a +13.3% lead in floating point, despite the 20% increase in threads. With Intel introducing this dual tier performance with hybrid SoCs, multi-threaded performance is going to be a combination of fast+slow and it all comes down to how the system can divide up the work.

Performance Per Watt CPU Tests: Office and Science
Comments Locked

92 Comments

View All Comments

  • DannyH246 - Wednesday, March 2, 2022 - link

    For a laugh.
  • Speedfriend - Wednesday, March 2, 2022 - link

    Seriously, how old are you?
  • abufrejoval - Friday, March 4, 2022 - link

    It's a slow season (for computers) so they have to spread it out some. The other pieces evidently have been prepared already as parting gifts by Ian.
  • vegemeister - Tuesday, March 1, 2022 - link

    >Per-Thread Power/Clock Control: Rather than being per core, each thread can carry requirements

    Does that imply the core can change its voltage and clocking on the same timescale as switching SMT thread? I thought modern SMT was fine-grained enough that there are instructions from both threads in-flight at once.

    Or is it just for simplifying the OS's cpufreq driver?

    >For example, if a core is idle for a few seconds, would it be better to put in a sleep state?

    A few hundred microseconds, surely?
  • Arnulf - Tuesday, March 1, 2022 - link

    "... following AMD’s cadence of naming its mobile processors after painters"

    As opposed to what, their desktop lineup naming (also named after painters)? Consumer processors are named after painters.
  • syxbit - Tuesday, March 1, 2022 - link

    >>While we haven’t touched battery life or graphics in this article

    that's pretty critical for a Laptop review.
    I'm pretty tired of Intel reviews constantly covering their 12th gen superiority without talking about power. It's easy to beat a competitor if you just double the power budget. It's laughable that Intel is pretending they've caught up to Apple.
  • Oxford Guy - Tuesday, March 1, 2022 - link

    I am sure those producing the Steam handheld would like reviewers to not test battery life.
  • ninjaquick - Tuesday, March 1, 2022 - link

    How fast do these chips perform vp9 4k decode? A major use case moving forward will be game streaming, and I'm struggling to find hardware acceleration numbers.
  • dwillmore - Tuesday, March 1, 2022 - link

    Error on page 3: "yCrundher" is a misspelling
  • YukaKun - Tuesday, March 1, 2022 - link

    Writing this from a 5900HX (Asus G17 Strix) and upgrading from a i7 7700HQ that, I have to say is really efficient for what it is, the AMD laptop is just in another league of its own. Both have a 90Wh battery and the Intel, not even new, would break the 4h mark. This thing has as much usage as my tablets with normal usage. It's really impressive and, for the go stuff, it's so SO nice. Then you need to game and it just works. The 6800M is quite the beast in its own right. Sad this thing doesn't have a mux switch, but it still works amazingly well.

    This preamble was just to say, I'm surprised the 6000HK isn't a lot better, but I guess it's to be expected. On paper, the 6000 mobile series has a lot of potential with PCIe4 and slightly better process. DDR5 is too new IMO to show a definitive advantage on mobile, but maybe next gen will leap. I have DDR4L 3200 with my 5900HX and I put DDR4L 2666 to the i7 7700HQ, so DDR5L needs to be way faster than the crappy 4800 MT/s JEDEC spec we have currently.

    Regards.

Log in

Don't have an account? Sign up now