Translating to IPC: All This for 3%?

Contrary to popular belief, increasing IPC is difficult. Attempt to ensure that each execution port is fed every cycle requires having wide decoders, large out-of-order queues, fast caches, and the right execution port configuration. It might sound easy to pile it all on, however both physics and economics get in the way: the chip still has to be thermally efficient and it has to make money for the company. Every generational design update will go for what is called the ‘low-hanging fruit’: the identified changes that give the most gain for the smallest effort. Usually reducing cache latency is not always the easiest task, and for non-semiconductor engineers (myself included), it sounds like a lot of work for a small gain.

For our IPC testing, we use the following rules. Each CPU is allocated four cores, without extra threading, and power modes are disabled such that the cores run at a specific frequency only. The DRAM is set to what the processor supports, so in the case of the new CPUs, that is DDR4-2933, and the previous generation at DDR4-2666. I have recently seen threads which dispute if this is fair: this is an IPC test, not an instruction efficiency test. The DRAM official support is part of the hardware specifications, just as much as the size of the caches or the number of execution ports. Running the two CPUs at the same DRAM frequency gives an unfair advantage to one of them: either a bigger overclock/underclock, and deviates from the intended design.

So in our test, we take the new Ryzen 7 2700X, the first generation Ryzen 7 1800X, and the pre-Zen Bristol Ridge based A12-9800, which is based on the AM4 platform and uses DDR4. We set each processors at four cores, no multi-threading, and 3.0 GHz, then ran through some of our tests.

For this graph we have rooted the first generation Ryzen 7 1800X as our 100% marker, with the blue columns as the Ryzen 7 2700X. The problem with trying to identify a 3% IPC increase is that 3% could easily fall within the noise of a benchmark run: if the cache is not fully set before the run, it could encounter different performance. Shown above, a good number of tests fall in that +/- 2% range.

However, for compute heavy tasks, there are 3-4% benefits: Corona, LuxMark, CineBench and GeekBench are the ones here. We haven’t included the GeekBench sub-test results in the graph above, but most of those fall into the 2-5% category for gains.

If we take out Cinebench R15 nT result and the Geekbench memory tests, the average of all of the tests comes out to a +3.1% gain for the new Ryzen 2700X. That sounds bang on the money for what AMD stated it would do.

Cycling back to that Cinebench R15 nT result that showed a 22% gain. We also had some other IPC testing done at 3.0 GHz but with 8C/16T (which we couldn’t compare to Bristol Ridge), and a few other tests also showed 20%+ gains. This is probably a sign that AMD might have also adjusted how it manages its simultaneous multi-threading. This requires further testing.

AMD’s Overall 10% Increase

With some of the benefits of the 12LP manufacturing process, a few editors internally have questioned exactly why AMD hasn’t redesigned certain elements of the microarchitecture to take advantage. Ultimately it would appear that the ‘free’ frequency boost is worth just putting the same design in – as mentioned previously, the 12LP design is based on 14LPP with performance bump improvements. In the past it might not have been mentioned as a separate product line. So pushing through the same design is an easy win, allowing the teams to focus on the next major core redesign.

That all being said, AMD has previously already stated its intentions for the Zen+ core design – rolling back to CES at the beginning of the year, AMD stated that they wanted Zen+ and future products to go above and beyond the ‘industry standard’ of a 7-8% performance gain each year.

Clearly 3% IPC is not enough, so AMD is combining the performance gain with the +250 MHz increase, which is about another 6% peak frequency, with better turbo performance with Precision Boost 2 / XFR 2. This is about 10%, on paper at least. Benchmarks to follow.

Improvements to the Cache Hierarchy: Lower Latency = Higher IPC Precision Boost 2 and XFR2: Ensuring It Hertz More
Comments Locked

545 Comments

View All Comments

  • techguymaxc - Thursday, April 19, 2018 - link

    Either you don't have a fast enough GPU to remove the GPU bottleneck or there's something wrong with your data because there is NO chance Ryzen is faster than *lake in GTA V, with lower IPC and clocks.

    Don't get me wrong, Ryzen 2 looks like a good product family and I wouldn't discourage anyone from buying.
  • SaturnusDK - Thursday, April 19, 2018 - link

    As everyone else that are misreading the results. Tests are done at stock speeds and no overclocking.
  • LurkingSince97 - Thursday, April 19, 2018 - link

    Yes there is.

    Stock CPU and RAM speeds. Fully spectre / meltdown patched on both sides. Who is re-using old results? This review re-uses old results for the older generation Ryzen, and so some of the performance boost could be false (new drivers, OS patches, firmware, bios....).

    More investigation is needed on all sides. Many other review sites are significantly more lazy than AT and are likely recycling old results for the Intel side.

    As for your GPU bottleneck.... um no. Look at the results, as the resolution goes up, THEN you get GPU bottlenecked and all CPUs look the same. At low resolutions, it is clearly not GPU bottlenecked as there is a big FPS difference by CPU.
  • jaydee - Thursday, April 19, 2018 - link

    Great review. Curious to see how things scale down for a 35W TDP part compared to Intel's latest 35W TDP CPUs.
  • SaturnusDK - Thursday, April 19, 2018 - link

    Gamers Nexus have tested the 2700X to work at 1.175V locked to 4.1GHz where it consumes 129W compared to stock frequency and stock voltage where it consumes 200W. Performance is generally the same on average.
  • Flunk - Thursday, April 19, 2018 - link

    Wow, that single-thread performance delta sure has shrunk hasn't it? Between meltdown and higher core clocks on the Zen+.
  • mapesdhs - Saturday, April 21, 2018 - link

    Wonder whether it won't be that much longer until AMD launches something which actually beats Intel in IPC. Atm, people keep saying Intel wins on IPC, but it's only because Intel has punched its clock rates through the roof (it's like the old P4 days again), something they could have done years ago but never bothered because there was no competition, just as they could have released a consumer 8-core long ago but didn't (the 3930K was a crippled 8-core, but back then AMD couldn't even beat mainstream SB, never mind SB-E).
  • mkaibear - Monday, April 23, 2018 - link

    You know IPC is "instructions per clock", yeah? So saying Intel wins on IPC because their clock rate is faster doesn't make sense, it's like saying UK cars have a higher mpg then US cars because their gallons are bigger.

    Intel wins (won?) on IPC because they executed more instructions per MHz of the clock rate. When you couple that with a faster clock rate you get a double whammy of performance. It does appear that AMD has almost closed the door on IPC but is still not operating on as high a clock rate.
  • Targon - Monday, April 23, 2018 - link

    This is why many are looking forward to Zen 2 in 2019, which will have true design improvements compared to Zen and Zen+. Zen+ is a small and incremental improvement over Zen(first generation Ryzen chips). Combined with 7nm, we may very well see AMD get very close to Intel clock speeds while having very similar, if not better IPC and a higher core count.
  • MajGenRelativity - Thursday, April 19, 2018 - link

    Looks like a good review. Glad to see AMD closing the performance gap even further!

Log in

Don't have an account? Sign up now