Power Measurements using Trepn

Measuring power draw is an interesting unique capability of Qualcomm's MDPs. Using their Trepn Profiler software and measurement hardware integrated into the MDP, we can measure a number of different power rails on the device, including power draw from each CPU core, the digital core (including video decoder and modem) and a bunch of other measures.

Measuring and keeping track of how different SoCs consumer power is something we've wanted to do for a while, and at least under the Qualcomm MDP umbrella at this point it's possible to measure right on the device.

The original goal was to compare power draw on 45nm MSM8660 versus 28nm MSM8960, however we encountered stability issues with Trepn profiler on the older platform that are still being resolved. Thankfully it is possible to take measures on MSM8960, and for this we turned to a very CPU intensive task that would last long enough to get a good measure, and also load both cores so we can see how things behave. That test is the Moonbat Benchmark, which is a web-worker wraper of the sunspider 0.9 test suite. We fired up a test consisting of 4 workers and 50 runs inside Chrome beta (which is web-worker enabled), and profiled using Trepn.

If you squint at the graph, you can see that one Krait core can use around 750 mW at maximum load. I didn't enable the CPU frequency graph (just to keep things simple above) but is 750 mW number happens right at 1.5 GHz. The green spikes from battery power are when we're drawing more than the available current from USB - this is also why you see devices sometimes discharge even when plugged in. There's an idle period at the end that I also left visible - you can see how quickly Qualcomm's governor suspends the second core completely after our moonbat test finishes running.

Here's another run of moonbat on Chrome Beta where we can see the same behavior, but zoomed in a bit better - each Krait core will consume anywhere between 450 mW and 750 mW depending on the workload, which does change during our run while V8 does its JIT compilation and Chrome dispach things to each CPU.

The next big question is obviously - well how much does GPU contribute to power drain? The red "Digital Core Rail Power" lines above include the Adreno 225 GPU, video decode, and "modem digital" blocks. Cellular is disabled on the MDP MSM8960, and we're not decoding any video, so in the right circumstances we can somewhat isolate out the GPU. To find out, I profiled a run of GLBenchmark Egypt on High settings (which is an entirely GPU compute bound test) and let it run to completion. You can see how the digital rail bounces between 800 mW and 1.2 W while the test is running. Egypt's CPU portions are pretty much single-threaded as well, as shown by the yellow and green lines above.

Another interesting case is what this looks like when browsing the web. I fired up the analyzer and loaded the AnandTech homepage followed by an article, and scrolled the page in the trace above. Chrome and "Browser" on Android now use the GPU for composition and rendering the page, and you can see the red line in the plot spike up when I'm actively panning and translating around on the page. In addition, the second CPU core only really wakes up when either loading the page and parsing HTML.

One thing we unfortunately can't measure is how much power having the baseband lit up on each different air interface (CDMA2000 1x, EV-DO, WCDMA, LTE, etc.) consumes, as the MDP MSM8960 we were sampled doesn't have cellular connectivity enabled. This is something that we understand in theory (at least for the respective WCDMA and LTE radio resource states), but remains to be empirically explored. It's unfortunate that we also can't compare to the MDP MSM8660 quite yet, but that might become possible pretty quickly.

GPU Performance - Adreno 225 Final Thoughts
Comments Locked

86 Comments

View All Comments

  • Denithor - Wednesday, February 22, 2012 - link

    Probably very much like desktop performance here: going from 1 to 2 cores is a huge upgrade, even for single-threaded apps, because it off-loads background chores to the second core and you get a full core dedicated to your running app. Going from there to 3/4/6/8 cores is really only helpful if your apps are truly multi-threaded or you're heavily multitasking.

    Now, the increase in IPC is definitely going to help everything go faster.
  • vision33r - Wednesday, February 22, 2012 - link

    Then, what's the point of Nvidia doing the quad core and even planning for greater than 4 cores.
  • Wishmaster89 - Wednesday, February 22, 2012 - link

    IMHO its only pure marketing. Eventually we'll get to a point where we'll have more cores in our phones than in most desktop workstations, only because it'll sell better. Sad but true.
  • Smart hero - Friday, February 24, 2012 - link



    I think an optimizes performance for Smart phone win over high-end end one ( I believe pod’s need high-end performance )

    Reason
    1.Less game can be played in a small screen .
    2.smart phone is not a cloud computer .
    3.Traditionally , still we use win-32 bit more than 64 bit …are we changing with CPU???????????
  • R1V4L - Tuesday, April 3, 2012 - link

    How can You compare these video cards when the hardware is running different versions of software ?
    Let me tell you that I've tested my Samsung Galaxy S2 running official Android 4.0.3 with Vsync on and on Egypt test I got 60fps !!! - evidently this result is influenced by the framelimit imposed by Samsung drivers.
    So these benchmarks You did are not showing us the truth. Sorry to raise this problem - but I dare You to do these tests again :)))
  • R1V4L - Tuesday, April 3, 2012 - link

    Galaxy S2 - official android 4.0.3 : 45.67 FPS
    Funny, huh ?!
    Redo the tests, please - otherwise we will think this is a commercial crappy presentation for new products - not a professional benchmark.

Log in

Don't have an account? Sign up now