Power Results (15W and 25W)

Based on the SKU table, Intel was very keen to point out that all of the Y-series processors for Ice Lake and all the 15W U-series processors have cTDP up modes. This means that OEMs, if they build for it, can take advantage of a higher base power of a processor which leads to longer turbo periods and a higher frequency during sustained performance levels.

While cTDP is a good idea, one of the issues we have with the concept is that Intel’s OEM partners that design the laptops and notebooks for these processors don’t ever advertise or publicise if they’re using a CPU in cTDP up or down mode. I could understand why a vendor might not want to advertise using a down mode, but an up mode means extra performance, and it’s hard to tell from the outside what is going on.

For what it is worth, most users cannot change between these modes anyway. They are baked into the firmware and the operating system. However there are a few systems that do expose this to the user, as I recently found out with my Whiskey Lake-U platform, where the OS power plan has advanced options to set the TDP levels. Very interesting indeed.

Also, for Ice Lake-U, Intel is introducing a feature called Intel Dynamic Tuning 2.0.

We covered this in our architecture disclosure article, but the short and simple of it is that it allows OEMs to implement a system whereby the PL1/TDP of a system can change based on an algorithm over time. So it allows for higher strict turbo, and then adjusts the turbo budget over time.

This feature will be branded under Intel’s Adaptix brand, which covers all these CPU optimizations. However, it should be noted, that this feature is optional for the OEM. It requires the OEM to actually do the work to characterize the thermal profile of the system. We suspect that it will be mostly on premium devices, but as the chips roll out into cheaper systems, this will not be there. Intel is not making this feature standard.

Testing Power

Based on the time available, we weren’t able to do much power testing. What I was able to do was run a power profile during the start of our 3DPM AVX512 test in both 15W and 25W modes for the Core i7-1065G7.

The test here runs for 20 seconds, then rests for 10 seconds. Here are the first four sub-tests, and there are a lot of interesting points to note.

The peak power in these systems is clearly the PL2 mode, which on the Intel SDS platform seems to be around the 50W mode. Given that the functional test system is a bit of a chonk, with a strong thermal profile and the fan on all the time, this is perhaps to be expected. The suggested PL2 for Kaby Lake-R was 44W, so this might indicate a small jump in strategy. Of course, with the Kaby Lake-R designs, we never saw many devices that actually had a PL2 of 44W – most OEMs chose something smaller, like 22W or 35W.

The fact that the CPU can sustain a 50W PL2 means that Intel could easily release Ice Lake into the desktop market at the 35W range. Easy. Please do this Intel.

Second to note is the AVX-512 frequency. Not listed here, but under the 15W mode we saw the AVX-512 frequency around 1.0-1.1 GHz, while at 25W it was around 1.4-1.5 GHz. That’s quite a drop from non AVX-512 code, for sure.

Third, we come to the turbo window. Increasing the base TDP means that the turbo window has more budget to turbo, and we can see that this equates to more than 2x on all the sub-tests. In the 15W mode, on the first test, we blow through the budget within 5 seconds, but on the 25W mode, we can actually turbo all the way through the 20 seconds of the first test. This means that there is still technically budget on the table by the time we start the second test under the 25W mode.

Also, that third test – if you are wondering why that graph looks a little light on the data points compared to the others, it is because the AVX-512 instructions took so much of the time on the CPU, that our power software didn’t get any for itself to update the power values. We still got enough to make a graph, but that just goes to show what hammering the CPU can do.

For the base power consumption, we actually have an issue here with the observer effect. Our polling software is polling too often and spiking up the power a little bit. However, if we take the average power consumption between 25-30 seconds, under 25W this is 2.96W, and under 15W this is 2.87W, which is similar.

For users interested in the score differential between the two:

For 3DPM without AVX instructions, the 15W mode scored 816, and 25W mode scored 1020 (+25%).
For 3DPM with AVX-512, the 15W mode scored 7204, and 25W mode scored 9242 (+28%).

SPEC2017 and SPEC2006 Results (15W) System Results (15W)
Comments Locked

261 Comments

View All Comments

  • jospoortvliet - Friday, August 2, 2019 - link

    Sometimes people have insightful additions or questions. That is never you so I wouldn’t miss your ‘input’.
  • Phynaz - Friday, August 2, 2019 - link

    But yet you replied. Doh!
  • Korguz - Friday, August 2, 2019 - link

    and so did you !!! :-)
  • Phynaz - Saturday, August 3, 2019 - link

    Your comprehension skills aren’t that great, are they. Maybe that’s why you can’t afford a good cpu. Did you finish school?
  • Korguz - Saturday, August 3, 2019 - link

    yep.. but you obliviously havent as only children resort to insults, like you do. and again.. grow up
  • POlaris1983 - Thursday, August 1, 2019 - link

    Thermals and TDP are a test for UNdervolting and OCing on THICC laptops using ai windows OS GUI interface apps for easy one button flipping on and off for these CPUs and GPUs and RAM Timings customizations. Even for desktop towers soon using keyboard functions in special keys like on a laptop once they solve the luqid cooling issues on the THICC laptops.
  • thetrashcanisfull - Thursday, August 1, 2019 - link

    Ian,
    In this and the Ryzen 3000 review, I noticed that the 3DPM benchmarks with AVX enabled seem to benefit from AVX-512 much more than I would anticipate.

    If I'm understanding things correctly, the AVX-512 parts are capable of 2x512b FMAC / cycle in the case of Skylake-server or 1x512b FMAC + 1x512b ALU / cycle in the case of Sunny Cove, with both handling 2x512b load + 1x512b store / cycle. This would suggest to me that their vector FP performance/cycle ought to be around double that of Skylake-client or Zen 2, both of which do 2x256b FMAC / cycle and 2x256b loads + 1x256b store / cycle. However, in the 3DPM benchmark we see AVX-512 CPUs outpace the performance/cycle of AVX2 CPUs by a factor of 4 - possibly even more than 4, once we account for the frequency penalties associated with AVX-512!

    Am I misunderstanding some critical piece of the AVX-512 extension that explains this boost, or is there something wrong with the AVX2 codepath for this benchmark? Only using xmm instructions? Not using FMA instructions?
  • Mysticial - Friday, August 2, 2019 - link

    A while back, Ian sent me the non-vectorized and AVX512-vectorized binaries for 3DPM for me to analyze. (I never looked at the AVX2 version since this was before it was made.)

    Based on what I saw, I'm not at all surprised by the result. While I can't say that it fully explains such a large difference between AVX2 and AVX512, there are at least two things I noticed in the AVX512 binary that would contribute towards it.

    1. There are 64-bit integer multiplies. AVX512 has the vpmullq instruction. AVX2 does not. Emulating this instruction in AVX2 is *extremely* costly.
    2. The ratio of "heavy" to "light" AVX512 instructions is very low. Therefore, the 2nd FMA isn't needed to gain on AVX2.

    I've never analyzed the AVX2 binary itself to see how that 64-bit multiply is being handled. It could be vectorized with extreme overhead, not vectorized at all, or worked-around at an algorithmic level.
  • thetrashcanisfull - Friday, August 2, 2019 - link

    ohhhh... That makes more sense. I assumed that the 3DPM benchmark was doing primarily floating point math. I also didn't realize that AVX2 didn't support packed 64b muls... Thanks for the info!
  • Alexvrb - Friday, August 2, 2019 - link

    "The suggested PL2 for Kaby Lake-R was 44W, so this might indicate a small jump in strategy."

    Yeah, whereby TDP is virtually meaningless and every machine is a complete mystery box until you buy it and discover what actual thermals/power/performance are like - again regardless of the TDP. This is all without overclocking, mind you.

Log in

Don't have an account? Sign up now