Driving the Retina Display: A Performance Discussion

As I mentioned earlier, there are quality implications of choosing the higher-than-best resolution options in OS X. At 1680 x 1050 and 1920 x 1200 the screen is drawn with 4x the number of pixels, elements are scaled appropriately, and the result is downscaled to 2880 x 1800. The quality impact is negligible however, especially if you actually need the added real estate. As you’d expect, there is also a performance penalty.

At the default setting, either Intel’s HD 4000 or NVIDIA’s GeForce GT 650M already have to render and display far more pixels than either GPU was ever intended to. At the 1680 and 1920 settings however the GPUs are doing more work than even their high-end desktop counterparts are used to. In writing this article it finally dawned on me exactly what has been happening at Intel over the past few years.

Steve Jobs set a path to bringing high resolution displays to all of Apple’s products, likely beginning several years ago. There was a period of time when Apple kept hiring ex-ATI/AMD Graphics CTOs, first Bob Drebin and then Raja Koduri (although less public, Apple also hired chief CPU architects from AMD and ARM among other companies - but that’s another story for another time). You typically hire smart GPU guys if you’re building a GPU, the alternative is to hire them if you need to be able to work with existing GPU vendors to deliver the performance necessary to fulfill your dreams of GPU dominance.

In 2007 Intel promised to deliver a 10x improvement in integrated graphics performance by 2010:

In 2009 Apple hired Drebin and Koduri.

In 2010 Intel announced that the curve had shifted. Instead of 10x by 2010 the number was now 25x. Intel’s ramp was accelerated, and it stopped providing updates on just how aggressive it would be in the future. Paul Otellini’s keynote from IDF 2010 gave us all a hint of what’s to come (emphasis mine):

But there has been a fundamental shift since 2007. Great graphics performance is required, but it isn't sufficient anymore. If you look at what users are demanding, they are demanding an increasingly good experience, robust experience, across the spectrum of visual computing. Users care about everything they see on the screen, not just 3D graphics. And so delivering a great visual experience requires media performance of all types: in games, in video playback, in video transcoding, in media editing, in 3D graphics, and in display. And Intel is committed to delivering leadership platforms in visual computing, not just in PCs, but across the continuum.

Otellini’s keynote would set the tone for the next few years of Intel’s evolution as a company. Even after this keynote Intel made a lot of adjustments to its roadmap, heavily influenced by Apple. Mobile SoCs got more aggressive on the graphics front as did their desktop/notebook counterparts.

At each IDF I kept hearing about how Apple was the biggest motivator behind Intel’s move into the GPU space, but I never really understood the connection until now. The driving factor wasn’t just the demands of current applications, but rather a dramatic increase in display resolution across the lineup. It’s why Apple has been at the forefront of GPU adoption in its iDevices, and it’s why Apple has been pushing Intel so very hard on the integrated graphics revolution. If there’s any one OEM we can thank for having a significant impact on Intel’s roadmap, it’s Apple. And it’s just getting started.

Sandy Bridge and Ivy Bridge were both good steps for Intel, but Haswell and Broadwell are the designs that Apple truly wanted. As fond as Apple has been of using discrete GPUs in notebooks, it would rather get rid of them if at all possible. For many SKUs Apple has already done so. Haswell and Broadwell will allow Apple to bring integration to even some of the Pro-level notebooks.

To be quite honest, the hardware in the rMBP isn’t enough to deliver a consistently smooth experience across all applications. At 2880 x 1800 most interactions are smooth but things like zooming windows or scrolling on certain web pages is clearly sub-30fps. At the higher scaled resolutions, since the GPU has to render as much as 9.2MP, even UI performance can be sluggish. There’s simply nothing that can be done at this point - Apple is pushing the limits of the hardware we have available today, far beyond what any other OEM has done. Future iterations of the Retina Display MacBook Pro will have faster hardware with embedded DRAM that will help mitigate this problem. But there are other limitations: many elements of screen drawing are still done on the CPU, and as largely serial architectures their ability to scale performance with dramatically higher resolutions is limited.

Some elements of drawing in Safari for example aren’t handled by the GPU. Quickly scrolling up and down on the AnandTech home page will peg one of the four IVB cores in the rMBP at 100%:

The GPU has an easy time with its part of the process but the CPU’s workload is borderline too much for a single core to handle. Throw a more complex website at it and things get bad quickly. Facebook combines a lot of compressed images with text - every single image is decompressed on the CPU before being handed off to the GPU. Combine that with other elements that are processed on the CPU and you get a recipe for choppy scrolling.

To quantify exactly what I was seeing I measured frame rate while scrolling as quickly as possible through my Facebook news feed in Safari on the rMBP as well as my 2011 15-inch High Res MacBook Pro. While last year’s MBP delivered anywhere from 46 - 60 fps during this test, the rMBP hovered around 20 fps (18 - 24 fps was the typical range).


Scrolling in Safari on a 2011, High Res MBP - 51 fps


Scrolling in Safari on the rMBP - 21 fps

Remember at 2880 x 1800 there are simply more pixels to push and more work to be done by both the CPU and the GPU. It’s even worse in those applications that have higher quality assets: the CPU now has to decode images at 4x the resolution of what it’s used to. Future CPUs will take this added workload into account, but it’ll take time to get there.

The good news is Mountain Lion provides some relief. At WWDC Apple mentioned the next version of Safari is ridiculously fast, but it wasn’t specific about why. It turns out that Safari leverages Core Animation in Mountain Lion and more GPU accelerated as a result. Facebook is still a challenge because of the mixture of CPU decoded images and a standard web page, but the experience is a bit better. Repeating the same test as above I measured anywhere from 20 - 30 fps while scrolling through Facebook on ML’s Safari.

Whereas I would consider the rMBP experience under Lion to be borderline unacceptable, everything is significantly better under Mountain Lion. Don’t expect buttery smoothness across the board, you’re still asking a lot of the CPU and GPU, but it’s a lot better.

Achieving Retina Boot Camp Behavior & Software Funniness
Comments Locked

471 Comments

View All Comments

  • dannyboy153 - Saturday, June 23, 2012 - link

    The Sony Z is more of a "consumer" laptop than a creative laptop. Here's why:

    1) The 1080p (9x16) LCD is great for watch movies but the loss in 1" vertical height is annoying.

    2) No discrete GPU built into the Laptop.

    3) No high res output. VGA and HDMI doesn't cut it. I have no idea why their dock doesn't have DVI or display port even though it's equipped with a discrete GPU. Their implementation of the dock is admirable, but it's filled with bugs. Read the reviews.

    I'll have to admit the MBP is heavier by almost 2x the weight of the Sony Z. But at ~4.5 lbs, it's not overly heavy. MBP advantage:

    1) None of the disadvantages of 1-3 above.

    2) 15.4" screen is HUGE for me (coming form an X200). Also, it's like the best of both worlds for glossy and matte LCDs; beautiful and vastly reduced glare.

    3) The Sony Z has a quad core but the MBP is more powerful. Notice I didn't mention the weaker Quad core of the Sony Z as one of its disadvantage because I believe it's hard enough for them to even offer such power in their laptop.
  • ananduser - Saturday, June 23, 2012 - link

    The 2010 VaioZ had discrete video, 1080p screen, quad raid SSD option. blu ray, slim profile, etc.
    The 2012 VaioZ does not have discrete built in, only via external dock.

    So...considering what the Z was for 2010, Anand never sang such high praises for it. Why ? Because he's a macuser and couldn't care less about another company's efforts.
  • OCedHrt - Sunday, June 24, 2012 - link

    http://www.anandtech.com/show/5430/sony-vaio-z-wit...

    They did eventually do a review of the 2011/2012 Z, however they're not as tolerant of small faults as they are with apple products.

    I remember back in the day when Anand would wipe the floor about keyboards not having enough pitch. But on the macbook pro retina the reduced pitch is just "different" not terrible.
  • gstrickler - Sunday, June 24, 2012 - link

    The MBP keyboard doesn't have reduced pitch, it's a standard 19mm pitch. It has slightly reduced key travel.
  • OCedHrt - Monday, June 25, 2012 - link

    Sorry I meant key travel. Incorrect use of terms on my part.
  • dannyboy153 - Sunday, June 24, 2012 - link

    For 2010, there were plenty of laptops with 1080p. Name one laptop now with the Apple's display. The Z is a great laptop if you're a consumer of media. But for creators, the Apple is superior.
  • Spunjji - Monday, June 25, 2012 - link

    Name another *13"* laptop in 2010 with 1080p. Go on. We can play this game all day!
  • SanX - Saturday, June 23, 2012 - link

    hdmi can not handle 1080 output?
  • dannyboy153 - Sunday, June 24, 2012 - link

    I don't consider 1080p hi res. At a minimum it has to be at least 1200p in 10x16 format for 24" monitors. For the price of the Sony Z, not being able to do 2560x1600 is a shame.
  • OCedHrt - Monday, June 25, 2012 - link

    Reading on forums there doesn't seem to be any issues with 1920x1200 external output, but 2560x1600 does not work without a hack for reduced refresh rate.

Log in

Don't have an account? Sign up now