SPECing Denver's Performance

Finally, before diving into our look at Denver in the real world on the Nexus 9, let’s take a look at a few performance considerations.

With so much of Denver’s performance riding on the DCO, starting with the DCO we have a slide from NVIDIA profiling the execution of SPECInt2000 on Denver. In it NVIDIA showcases how much time Denver spends on each type of code execution – native ARM code, the optimizer, and finally optimized code – along with an idea of the IPC they achieve on this benchmark.

What we find is that as expected, it takes a bit of time for Denver’s DCO to kick in and produce optimized native code. At the start of the benchmark execution with little optimized code to work with, Denver initially executes ARM code via its ARM decoder, taking a bit of time to find recurring code. Once it finds that recurring code Denver’s DCO kicks in – taking up CPU time itself – as the DCO begins replacing recurring code segments with optimized, native code.

In this case the amount of CPU time spent on the DCO is never too great of a percentage of time, however NVIDIA’s example has the DCO noticeably running for quite some time before it finally settles down to an imperceptible fraction of time. Initially a much larger fraction of the time is spent executing ARM code on Denver due to the time it takes for the optimizer to find recurring code and optimize it. Similarly, another spike in ARM code is found roughly mid-run, when Denver encounters new code segments that it needs to execute as ARM code before optimizing it and replacing it with native code.

Meanwhile there’s a clear hit to IPC whenever Denver is executing ARM code, with Denver’s IPC dropping below 1.0 whenever it’s executing large amounts of such code. This in a nutshell is why Denver’s DCO is so important and why Denver needs recurring code, as it’s going to achieve its best results with code it can optimize and then frequently re-use those results.

Also of note though, Denver’s IPC per slice of time never gets above 2.0, even with full optimization and significant code recurrence in effect. The specific IPC of any program is going to depend on the nature of the code, but this serves as a good example of the fact that even with a bag full of tricks in the DCO, Denver is not going to sustain anything near its theoretical maximum IPC of 7. Individual VLIW instructions may hit 7, but over any period of time if a lack of ILP in the code itself doesn’t become the bottleneck, then other issues such as VLIW density limits, cache flushes, and unavoidable memory stalls will. The important question is ultimately whether Denver’s IPC is enough of an improvement over Cortex A15/A57 to justify both the power consumption costs and the die space costs of its very wide design.

NVIDIA's example also neatly highlights the fact that due to Denver’s favoritism for code reuse, it is in a position to do very well in certain types of benchmarks. CPU benchmarks in particular are known for their extended runs of similar code to let the CPU settle and get a better sustained measurement of CPU performance, all of which plays into Denver’s hands. Which is not to say that it can’t also do well in real-world code, but in these specific situations Denver is well set to be a benchmark behemoth.

To that end, we have also run our standard copy of SPECInt2000 to profile Denver’s performance.

SPECint2000 - Estimated Scores
  K1-32 (A15) K1-64 (Denver) % Advantage
164.gzip
869
1269
46%
175.vpr
909
1312
44%
176.gcc
1617
1884
17%
181.mcf
1304
1746
34%
186.crafty
1030
1470
43%
197.parser
909
1192
31%
252.eon
1940
2342
20%
253.perlbmk
1395
1818
30%
254.gap
1486
1844
24%
255.vortex
1535
2567
67%
256.bzip2
1119
1468
31%
300.twolf
1339
1785
33%

Given Denver’s obvious affinity for benchmarks such as SPEC we won’t dwell on the results too much here. But the results do show that Denver is a very strong CPU under SPEC, and by extension under conditions where it can take advantage of significant code reuse. Similarly, because these benchmarks aren’t heavily threaded, they’re all the happier with any improvements in single-threaded performance that Denver can offer.

Coming from the K1-32 and its Cortex-A15 CPU to K1-64 and its Denver CPU, the actual gains are unsurprisingly dependent on the benchmark. The worst case scenario of 176.gcc still has Denver ahead by 17%, meanwhile the best case scenario of 255.vortex finds that Denver bests A15 by 67%, coming closer than one would expect towards doubling A15's performance entirely. The best case scenario is of course unlikely to occur in real code, though I’m not sure the same can be said for the worst case scenario. At the same time we find that there aren’t any performance regressions, which is a good start for Denver.

If nothing else it's clear that Denver is a benchmark monster. Now let's see what it can do in the real world.

The Secret of Denver: Binary Translation & Code Optimization CPU Performance
Comments Locked

169 Comments

View All Comments

  • rpmrush - Wednesday, February 4, 2015 - link

    I find the 4:3 aspect ratio a turn off. Why change now. There are zero apps natively designed for this in the Android ecosystem. Why would a developer make a change for one device? It just seems like more fragmentation for no reason. I'm picking up a Shield Tab soon.
  • kenansadhu - Wednesday, February 4, 2015 - link

    One example to drive my point: I bought kingdom rush and found out that on my widescreen tablet, the game won't fit the screen properly. If any, this will fit apps previously designed for ipads well. Hate to admit it, but apple has such a huge lead in the tablet market it's just reasonable for developers to focus on them first.
  • melgross - Wednesday, February 4, 2015 - link

    Well, there are almost no tablet apps at all for Android. One reason is because of the aspect ratio being the same for phones and tablets. Why bother writing g a tablet app when the phone app can stretch to fit the screen exactly? Yes, they're a waste of time, but hey, it doesn't cost developers anything either.

    Maybe goi g to the much more useful 4:3 ratio for tablets will force new, real tablet apps.

    It's one reason why there are so many real iPad apps out there.
  • retrospooty - Wednesday, February 4, 2015 - link

    You sound like you are stuck in 2012. Update your arguments ...
  • UtilityMax - Sunday, February 8, 2015 - link

    There will be more tablets coming with 4:3 screen. Samsung's next flagship tablet will be 4:3. As much as I like watching movies on a wide screen, I think it's not the killer tablet application for most users, and most people will benefit from having a more balanced 4:3 screen. It works better for web browsing, ebooks, and productivity apps.
  • Impulses - Wednesday, February 4, 2015 - link

    Most simpler apps just scale fine one way or the other... I think 4:3 makes a ton of sense for larger tablets, it remains almost exactly as tall in landscape mode (which a lot of people seem to favor, and I find bizarre) and more manageable in portrait since it's shorter.

    7-8" & 16:9 is still my personal preference, since I mostly use it for reading in portrait. Try to think outside of your personal bubble tho... I bought the Nexus 9 for my mother who prefers a larger tablet, never watches movies on it, yet almost always uses it in landscape.

    It'll be perfect for her, shoot, it even matches the aspect ratio of her mirrorless camera so photos can be viewed full screen, bonus.
  • UtilityMax - Sunday, February 8, 2015 - link

    I personally think about the reverse. Big tablets with 9-11 screens are often bought for media consumption. Because of that, it makes sense for them to come with a wide screen. For me, having wide screen for watching movies on the flights and in the gym was one of the prime reasons to buy a Samsung Galaxy Tab S 10.5, even though its benchmarks look only so so.

    However, a 9 to 11 inch tablet is too bulky to hold in one hand and type with another. It almost begs for a stand. So for casual use, like casual web browsing or ebook reading, a smaller tablet with a 4:3 screen works better. And so I went ahead and got a tablet with 4:3 screen for that purpose.
  • Impulses - Monday, February 9, 2015 - link

    Valid points, obviously usage cases can differ a lot, that's the nice thing about Android tho... It doesn't have to conform to any one aspect ratio that won't fit everyone's taste.
  • LordConrad - Wednesday, February 4, 2015 - link

    I love the 4:3 aspect ratio. I primarily use tablets in Portrait Mode, and have always disliked the "tall and thin" Portrait Mode of traditional android tablets. This is the main area where Google has always fallen behind Apple, IMHO. This is the main reason I gave my Nexus 7 (2013) to my nephew and bought a Nexus 9, and I have no regrets.
  • R. Hunt - Thursday, February 5, 2015 - link

    Agreed. I understand that YMMV and all that, but to me, large widescreen tablets are simply unusable in portrait. I'd love to have the choice of a 3:2 Android tablet though.

Log in

Don't have an account? Sign up now