SPEC2006 & 2017: Industry Standard - ST Performance

One big talking point around the new Ryzen 3000 series is the new augmented single-threaded performance of the new Zen 2 core. In order to investigate the topic in a more controlled manner with better documented workloads, we’ve fallen back to the industry standard SPEC benchmark suite.

We’ll be investigating the previous generation SPEC CPU2006 test suite giving us some better context to past platforms, as well as introducing the new SPEC CPU2017 suite. We have to note that SPEC2006 has been deprecated in favour of 2017, and we must also mention that the scores posted today are noted as estimates as they’re not officially submitted to the SPEC organisation.

For SPEC2006, we’re still using the same setup as on our mobile suite, meaning all the C/C++ benchmarks, while for SPEC2017 I’ve also went ahead and prepared all the Fortran tests for a near complete suite for desktop systems. I say near complete as due to time constraints we’re running the suite via WSL on Windows. I’ve checked that there are no noticeable performance differences to native Linux (we’re also compiling statically), however one bug on WSL is that it has a fixed stack size so we’ll be missing 521.wrf_r from the SPECfp2017 collection.

In terms of compilers, I’ve opted to use LLVM both for C/C++ and Fortran tests. For Fortran, we’re using the Flang compiler. The rationale of using LLVM over GCC is better cross-platform comparisons to platforms that have only have LLVM support and future articles where we’ll investigate this aspect more. We’re not considering closed-sourced compilers such as MSVC or ICC.

clang version 8.0.0-svn350067-1~exp1+0~20181226174230.701~1.gbp6019f2 (trunk)
clang version 7.0.1 (ssh://git@github.com/flang-compiler/flang-driver.git 
  24bd54da5c41af04838bbe7b68f830840d47fc03)

-Ofast -fomit-frame-pointer
-march=x86-64
-mtune=core-avx2 
-mfma -mavx -mavx2

Our compiler flags are straightforward, with basic –Ofast and relevant ISA switches to allow for AVX2 instructions.

The Ryzen 3900X system was run in the same way as the rest of our article with DDR4-3200CL16, same as with the i9-9900K, whilst the Ryzen 2700X had DDR-2933 with similar CL16 16-16-16-38 timings.

SPECint2006 Speed Estimated Scores

In terms of the int2006 benchmarks, the improvements of the new Zen2 based Ryzen 3900X is quite even across the board when compared to the Zen+ based Ryzen 2700X. We do note however somewhat larger performance increases in 403.gcc and 483.xalancbmk – it’s not immediately clear as to why as the benchmarks don’t have one particular characteristic that would fit Zen2’s design improvements, however I suspect it’s linked to the larger L3 cache.

445.gobmk in particular is a branch-heavy workload, and the 35% increase in performance here would be better explained by Zen2’s new additional TAGE branch predictor which is able to reduce overall branch misses.

It’s also interesting that although Ryzen3900X posted worse memory latency results than the 2700X, it’s still able to outperform the latter in memory sensitive workloads such as 429.mcf, although the increases for 471.omnetpp is amongst the smallest in the suite.

However we still see that AMD has an overall larger disadvantage to Intel in these memory sensitive tests, as the 9900K has large advantages in 429.mcf, and posting a large lead in the very memory bandwidth intensive 462.libquantum, the two tests that put the most pressure on the caches and memory subsystem.

SPECfp2006(C/C++) Speed Estimated Scores

In the fp2006 benchmarks, we gain see some larger jumps on the part of the Ryzen 3900X, particularly in 482.sphinx3. These two tests along with 450.soplex are characterized by higher data cache misses, so Zen2’s 16MB L3 cache should definitely be part of the reason we see such larger jumps.

I found it interesting that we’re not seeing much improvements in 470.lbm even though this is a test that is data store heavy, so I would have expected Zen2’s additional store AGU to greatly benefit this workload. There must be some higher level memory limitations which is bottlenecking the test.

453.povray isn’t data heavy nor branch heavy, as it’s one of the more simple workloads in the suite. Here it’s mostly up to the execution backend throughput and the ability of the front-end to feed it fast enough that are the bottlenecks. So while the Ryzen 3900X provides a big boost over the 2700X, it’s still largely lagging behind the 9900K, a characteristic we’re also seeing in the similar execution bottlenecked 456.hmmer of the integer suite.

SPEC2006 Speed Estimated Total

Overall, the 3900X is 25% faster in the integer and floating point tests of the SPEC2006 suite, which corresponds to an 17% IPC increase, above AMD's officially published figures for IPC increases.

Moving on to the 2017 suite, we have to clarify that we’re using the Rate benchmark variations. The 2017 suite’s speed and rate benchmarks differ from each other in terms of workloads. The speed tests were designed for single-threaded testing and have large memory demands of up to 11GB, while the rate tests were meant for multi-process tests. We’re using the rate variations of the benchmarks because we don’t see any large differentiation between the two variations in terms of their characterisation and thus the performance scaling between the both should be extremely similar. On top of that, the rate benchmarks take up to 5x less time (+1 hour vs +6 hours), and we're able run them on more memory limited platforms (which we plan on to do in the future).

SPECint2017 Rate-1 Estimated Scores

In the int2017 suite, we’re seeing similar performance differences and improvements, although this time around there’s a few workloads that are a bit more limited in terms of their performance boosts on the new Ryzen 3900X.

Unfortunately I’m not quite as familiar with the exact characteristics of these tests as I am with the 2006 suite, so a more detailed analysis should follow in the next few months as we delve deeper into microarchitectural counters.

SPECfp2017 Rate-1 Estimated Scores

In the fp2017 suite, things are also quite even. Interesting enough here in particular AMD is able to leapfrog Intel’s 9900K in a lot more workloads, sometimes winning in terms of absolute performance and sometimes losing.

SPEC2017 Rate-1 Estimated Total

As for the overall performance scores, the new Ryzen 3900X improves by 23% over the 2700X. Although closing the gap greatly and completely, it’s just a hair's width shy of actually beating the 9900K’s absolute single-threaded performance.

SPEC2017 Rate-1 Estimated Performance Per GHz

Normalising the scores for frequency, we see that AMD has achieved something that the company hasn’t been able to claim in over 15 years: It has beat Intel in terms of overall IPC. Overall here, the IPC improvements over Zen+ are 15%, which is a bit lower than the 17% figure for SPEC2006.

We already know about Intel’s new upcoming Sunny Cove microarchitecture which should undoubtedly be able to regain the IPC crown with relative ease, but the question for Intel is if they’ll be able to still maintain the single-thread absolute performance crown and continue to see 5GHz or similar clock speeds with the new core design.

Test Bed and Setup Benchmarking Performance: Web Tests
Comments Locked

447 Comments

View All Comments

  • beginning - Thursday, July 11, 2019 - link

    I noticed that at the E3 2019 tech day, AMD recommended DDR4-3600 CL16 RAM. I see that 3200 MHz RAM has been used in the AMD testbench. I read the description about avoiding overclocking but 3600 MHz RAMs come with a factory clock of 3600 MHz, right? I know I am missing something. What am I missing?
  • sknaumov - Thursday, July 11, 2019 - link

    Do you plan to make some tests of these CPUs on older, cheaper and colder motherboards? It would be very interesting to see results of b450 chipset and whether it is possible to use DDR4-3600MHz with tight timings on these older boards. Or at least provide more info about what has more priority for memory speed and timings on AMD platform - CPU or chipset.
  • viperswhip - Thursday, July 11, 2019 - link

    I am going to wait to build a PC for a bit, however, I am super excited by this launch and disappointed by the video card launch. I expect to have an AMD chip since Intel has no answer for this, and we shall see on the video cards, but if I was building today I'd probably get a 2070 RTX super.
  • PProchnow - Friday, July 12, 2019 - link

    Here's is Jus' a good ol' boy trying out. No OC off stock Multi but 3333Mhz RAM
    #1
    https://browser.geekbench.com/v4/cpu/13863634

    Rather a new rig and it is X470 up to the A.A BIOS and it is MSI Gaming Plus.
    OK link #2 is here and I stroked the DDR$ up top 3333Mhz. I also stroked the fan
    to stay sub 70C. Wild OCs will take water at least "in The Home" versus LiqN2 Lab.

    https://browser.geekbench.com/v4/cpu/13865361

    BTW where is the Bragging Thread? My MOBO is the MSI X470 Gaming Plus BIOS A.A makes Ryzen 9 go BTW.
    I have yet to up the MULTI in case you want to know. I wonder what good Ocers will get with the right stuff.

    Single-Core Performance
    Memory Score 6431
    Floating Point Score 5409
    Integer Score 5190
    Crypto Score 6888
    Single-Core Score 5589

    You underst and that RAM set at 1672 is 1/2 the common referred to speed. 3344Mhz is the common nomenclature.

    ***Single-Core Score ***Multi-Core Score
    5589 47755
    Geekbench 4.3.4 Tryout for Windows x86 (64-bit)
    Result Information
    Upload Date July 12 2019 08:16 PM
    Views 2
    System Information
    System Information
    Operating System Microsoft Windows 10 Pro (64-bit)
    Model Micro-Star International Co., Ltd. MS-7B79
    Motherboard Micro-Star International Co., Ltd. X470 GAMING PLUS (MS-7B79)
    Memory 32768 MB DDR4 SDRAM 1672MHz
    Northbridge AMD Ryzen SOC 00
    Southbridge AMD X470 51
    BIOS American Megatrends Inc. A.A0
    Processor Information
    Name AMD Ryzen 9 3900X
    Topology 1 Processor, 12 Cores, 24 Threads
    Identifier AuthenticAMD Family 23 Model 113 Stepping 0
    Base Frequency 3.80 GHz
    Maximum Frequency 4.53 GHz
  • Maxiking - Tuesday, July 23, 2019 - link

    Why would anyone brag about something if

    You can't reach 5.0ghz +
    You can't reach even the boost frequency on a single core
    You can't beat consistently competitor's older 14nm cpu architecture which has been on the market since 2016...
    You can't beat RAM OC'ing records either because over 3733mhz IF gets actually downlocked and due tu that, "faster" ram performs worse unless you OC 7400mhz, which is not possible even with liquid nitrogen.
  • PProchnow - Friday, July 12, 2019 - link

    These are my scores with my Ryzen 9 3900X.
    #1
    https://browser.geekbench.com/v4/cpu/13863634

    Rather a new rig and it is X470 up to the A.A BIOS and it is MSI Gaming Plus.
    OK link #2 is here and I stroked the DDR$ up top 3333Mhz. I also stroked the fan
    to stay sub 70C. Wild OCs will take water at least "in The Home" versus LiqN2 Lab.

    https://browser.geekbench.com/v4/cpu/13865361

    BTW where is the Bragging Thread? My MOBO is the MSI X470 Gaming Plus BIOS A.A makes Ryzen 9 go BTW.
    I have yet to up the MULTI in case you want to know. I wonder what good Ocers will get with the right stuff.

    Single-Core Performance
    Memory Score 6431
    Floating Point Score 5409
    Integer Score 5190
    Crypto Score 6888
    Single-Core Score 5589

    You underst and that RAM set at 1672 is 1/2 the common referred to speed. 3344Mhz is the common nomenclature.

    ***Single-Core Score ***Multi-Core Score
    5589 47755
    Geekbench 4.3.4 Tryout for Windows x86 (64-bit)
    Result Information
    Upload Date July 12 2019 08:16 PM
    Views 2
    System Information
    System Information
    Operating System Microsoft Windows 10 Pro (64-bit)
    Model Micro-Star International Co., Ltd. MS-7B79
    Motherboard Micro-Star International Co., Ltd. X470 GAMING PLUS (MS-7B79)
    Memory 32768 MB DDR4 SDRAM 1672MHz
    Northbridge AMD Ryzen SOC 00
    Southbridge AMD X470 51
    BIOS American Megatrends Inc. A.A0
    Processor Information
    Name AMD Ryzen 9 3900X
    Topology 1 Processor, 12 Cores, 24 Threads
    Identifier AuthenticAMD Family 23 Model 113 Stepping 0
    Base Frequency 3.80 GHz
    Maximum Frequency 4.53 GHz

    Now you can cross ref with others.
  • Meteor2 - Monday, July 15, 2019 - link

    Nice!
  • willis936 - Wednesday, July 17, 2019 - link

    The editor's choice awards are a bit strange to me. Zen 1 didn't receive one even though it was the largest CPU performance increase from a company this century. The i7-4950HQ received an editor's choice silver award even though it had little importance to the industry. And the 3700X, which offers comparable SP performance to competing intel products at a huge discount and smaller power budget gets the same editor's choice level as the i7-4950HQ?
  • willis936 - Wednesday, July 17, 2019 - link

    I know it was a different editor at the time, but the selective excitement is a bit of a bummer. eDRAM was exciting to see at the time and then nothing ever came of it. The enthusiasm of chiplets under the new editor comes through much less. That too is fine. However if the rating system is what it is then I don't think it's much to argue that chiplets are much more disruptive than eDRAM and is already making much larger waves.
  • Maxiking - Monday, July 22, 2019 - link

    AMD fraund getting finally the attention it deserves

    https://www.youtube.com/watch?v=x03FyPQ3a3E

    check at 05m25s

Log in

Don't have an account? Sign up now