Power Consumption: Big Improvements to Video Playback

It was teased earlier in the review, but it makes sense at this stage to talk about power consumption.

With a system as complex as a modern APU or SoC, the initial plans for this review involved getting a development system with the right shunts and hooks to measure the core and graphics power separately in a thermally unconstrained environment for both Kaveri and Carrizo, but unfortunately the parts didn’t come together at the time they were needed. Instead we had access to a Watts Up PRO, a power outlet based monitor with some recording capabilities. While the hardware was not ideal for what we wanted to test, it provided a large chunk of interesting data.

We did a number of tests with data monitoring enabled on both the HP Elitebooks. When AMD released Carrizo, a lot of fuss was made about video playback for several reasons. Firstly, Carrizo implements an adjusted playback pathway for data so instead of moving data from the decoder to the GPU to the display controller, it moves data directly from decoder to display, saving power in the process.

AMD also listed the video playback power of Carrizo (as compared to Kaveri) as significantly reduced. In the example above in the top right, the Kaveri APU is consuming nearly 5W, whereas Carrizo will consume only 1.9W for 1080p content.

The other video playback optimization in Carrizo is the Unified Video Decoder. The bandwidth and capability of the UVD is increased four-fold, allowing the system to ‘sleep’ between completed frames, saving power.

Video Playback, 1080p30 h264

For the first test, we took a 1920x1080 resolution h264 video at 30 FPS (specifically Big Buck Bunny) and recorded the power consumption for playback.

The difference here is striking. The Carrizo system in this instance has sustained power consumption lower than that of the Kaveri system. Overall the Kaveri system draws 11W over idle to play back our test video while the Carrizo system only draws 6.8W over idle for the same task. Put another way, the load power cost at the wall for  watching 1080p video is about 4W lower on Carrizo as compared to Kaveri, which is close to what AMD claimed in the first slide above (and note we’re measuring at the wall, so chances are there are other chipset optimizations being done under the hood).

Video Playback, 2160p30 h264

The same video but in 4K format was also tested on both systems. It is at this point I should say that the Kaveri system was unable to play the 4K video properly (Kaveri doesn't officially support 4K decoding to begin with), and would only show about 20% of the frames. Audio was also affected.

In this case power consumption is above that of the 1080p video, and both systems require around 11 watts from idle to sustained performance. The added benefit with Carrizo though is that you can actually watch the video.

Other Power Benchmarks

We also ran power tests on a set of our regular benchmarks to see the results.

Three-Dimensional Particle Movement

In our 3DPM test, we typically script up a batch of six runs and take the average score. For this we did it to the single thread and multithreaded environments.

In single threaded mode, two interesting things occurred. First, as we expected, the Carrizo system can idle lower than the Kaveri. Second is that the Carrizo system actually goes into a higher power state at load by almost 4W. This means that the delta (Load to Idle) is 8W higher for Carrizo than Kaveri.

It is easy to take away from this that Carrizo, as an APU, uses more power. But that is not what is happening. Carrizo, unlike Kaveri, integrates the chipset onto the same die as the APU (better integration, saves power), but it also means that it is essentially shut off at idle. Part of Carrizo’s optimizations is power management, so the ability to shut something down and fire it back up again gives a larger low-to-high delta automatically. Essentially, more things are turning on. The fact that the Carrizo power numbers are higher than Kaveri during the benchmark is correlated by the performance, despite Kaveri having the higher TDP.

For the multithreaded test, both systems settle to similar power consumptions as the single threaded test, although the Carrizo system has a much more varied power profile, which also finishes the benchmark earlier than the Kaveri.

Octane and Kraken

For the web tests, we expect them to be partially threaded but because they probe a number of real-world and synthetic tests, there should be some power variation.

Octane is actually relatively flat, instigating similar power profiles to both. Again, it looks like that Carrizo expends more energy to do the same amount of work, however it is easy to forget that the Carrizo idle power state is lower due to optimizations.

With Kraken we also get a flat profile, although one could argue that we’re seeing a classic case of running quick and finishing the benchmark sooner vs. a more sedate path.

WebXPRT

This graph was shown earlier in the review, but let’s look at it again, as it is a good example of a bursty workload:

With average power numbers only a few watts above the idle numbers, both systems do a good job on overall power though again it is easy to think that the larger delta of the Carrizo numbers means that the APU is consuming more power. This is where if you try and calculate the actual energy consumed for each system, you get stupid numbers: 1208.7 joules for the Kaveri and 1932.8 joules for the Carrizo. Without starting from the same platform (or without taking numbers direct from the cores), there are obviously other things at play (such as Carrizo’s capability to control more power planes).

WinRAR

Our final power test is WinRAR, which is characterized as a variable threaded load involving lots of little compressible web files and a handful of uncompressible videos.

In this instance I was surprised to see both systems perform similarly. The HP Elitebook G2 actually has the upper hand here, as it is equipped with dual channel memory. WinRAR is a very memory bandwidth affected benchmark, so the G2 has an upper hand in performance but will also balance between drawing more power for two modules or running in a more efficient mode if there is sufficient data at the CPU.

How Hot is too Hot? Temperatures and Thermal Results Negative Feedback Loops: How To Escape the Pit
Comments Locked

175 Comments

View All Comments

  • basicmath - Tuesday, February 9, 2016 - link

    No it's really not, this laptop came from the factory with dual channel capability but that capability was not utilised because that would have shown the platform in a much better light, he even states that he checked the chips in the G2 to confirm that it was single channel. Upgrading the RAM on a laptop is a simple process that any end user can perform. The only discernible difference between the APU in the G2 & G3 is the number of GPU cores so why did he even bother testing the G3 without using dual channel configuration?
  • Intel999 - Sunday, February 7, 2016 - link

    @Ian

    I look forward to that R-series test as it will provide a sneak peek at how much DDR4 relieves the bottleneck on integrated graphics when Bristol Ridge comes out.

    That $70 Athlon X4 845 is intriguing as well.
  • AS118 - Saturday, February 6, 2016 - link

    Good article (although to be fair, I mostly skimmed it), and I agree with the conclusions. AMD should try harder to make sure their high-end products are paired with good components. Single-channel ram, bad screens, and slow hard drives with an A10 or mobile FX defeats the purpose of having those higher end APU's.

    Plus, people will get a bad impression of AMD if a lot of them have poor trackpads, etc. I wish they'd make their own "signature" brand of laptops, and find someone to help make them a thing.
  • TheinsanegamerN - Thursday, February 11, 2016 - link

    Both clevo and MSI have treated AMD well before, Im sure either would love to have exclusive rights to the high end AMD notebook.

    That being said, I doubt AMD has the intelligence to pull it off. They seem to be run by monkeys 90% of the time.
  • Cryio - Saturday, February 6, 2016 - link

    I'm really sorry for AMD. Kaveri and Carizzo on mobile, when configured to the proper ram, cooling and when using the highest performing part ... would've provided awesome performance, compared to Haswell and Broadwell. But no one bothered.

    Bristol Ridge will basically be Carizzo with DDR4 support and since it will be even better binned 28 nm CPUs, maybe we'll get even higher frequency out of Excavator. As for GCN 2.0 GPUs ... it will be interesting to see.

    I love my Surface Pro 4, even given the disaster that is Skylake drivers and Windows 10 horrible efficiency compared to W8.1. But MAN. I would've loved a proper Carrizo based Surface Pro/Book.
  • Gadgety - Saturday, February 6, 2016 - link

    A confirmation, with in depth detail. Nice write up.
  • Khenglish - Sunday, February 7, 2016 - link

    I would have really liked to see some dual channel results, or at least pulling a memory stick from the Kaveri and Intel systems to get a fair comparison. AMD says Zen brings a 40% IPC improvement. It'd be great to have a baseline to see if that 40% improvement is enough. In the dual channel intel to single channel AMD comparisons it does not appear to be enough, but we don't know how big of a factor memory was.
  • Jon Irenicus - Sunday, February 7, 2016 - link

    I want to buy an amd part for my next notebook but as was mentioned in the article, oems only choose bargain basement platforms to put the amd inside. The elitebook is the one exception, along with the lenovo if you don't mind the bulk.

    But the elitebooks are super overpriced for what you get. They need to release an hp spectre version of a notebook with a zen apu, a dell xps notebook variant with an apu. Ideally, the models that include a discreet gpu should allow the apu to work in tandem.

    In 2017 dx12 will be in full effect with games, and having two gpus working together by default could give a lot of amd equipped systems a larger edge, especially if the oversized ipc deficits between excavator and intel parts is minimized with zen.

    The future really does rest on Zen, amd needs to laser focus on performance per watt and ipc, and equip the 2017 apus with polaris gpu parts or vega or whatever the first iterations will be called. That has to be the minimum. Put those in nice chassis with solid battery life and that is all they need.
  • Intel999 - Sunday, February 7, 2016 - link

    In DX12 dual graphics will be automatic. Even an Intel Igpu combined with a discrete Nvidia or AMD GPU will "merge" the graphic capabilities in the laptop.

    Theoretically, an AMD APU combined with an AMD GPU might have an advantage as all graphics would be from the same underlying graphic architecture. Time will tell if this bares out.
  • Ryan Smith - Monday, February 8, 2016 - link

    Note that it's only going to be as automatic as the game developer makes it, as devs will be responsible for implementing it. For the moment game devs are going to be the wildcard.

Log in

Don't have an account? Sign up now