Gaming Performance

As I mentioned earlier, under OS X games have to specifically be written to use both GPUs in the new Mac Pro. Under Windows however it's just a matter of enabling CrossFire X. I ran the new Mac Pro with dual FirePro D700s through a few of Ryan's 2014 GPU test suite games. The key comparison here is AMD's Radeon R9 280X CF. I've put all of the relevent information about the differences between the GPUs in the table below:

Mac Pro (Late 2013) GPU Comparison
  AMD Radeon R9 280X AMD FirePro D700
SPs 2048 2048
GPU Clock (base) 850MHz 650MHz
GPU Clock (boost) 1000MHz 850MHz
Single Precision GFLOPS 4096 GFLOPS 3481 GFLOPS
Texture Units 128 128
ROPs 32 32
Transistor Count 4.3 Billion 4.3 Billion
Memory Interface 384-bit GDDR5 384-bit GDDR5
Memory Datarate 6000MHz 5480MHz
Peak GPU Memory Bandwidth 288 GB/s 264 GB/s
GPU Memory 3GB 6GB

Depending on thermal conditions the 280X can be as little as 17% faster than the D700 or as much as 30% faster, assuming it's not memory bandwidth limited. In the case of a memory bandwidth limited scenario the gap can shrink to 9%.

All of the results below are using the latest Radeon WHQL drivers at the time of publication (13-12_win7_win8_64_dd_ccc_whql.exe) running 64-bit Windows 8.1. Keep in mind that the comparison cards are all run on our 2014 GPU testbed, which is a 6-core Ivy Bridge E (i7-4960X) running at 4.2GHz. In other words, the other cards will have a definite CPU performance advantage (20 - 30% depending on the number of active cores).

You'll notice that I didn't run anything at 4K for these tests. Remember CrossFire at 4K is still broken on everything but the latest GCN 1.1 hardware from AMD.

Battlefield 3 - 2560x1440 - Ultra Quality + 4x MSAA

Battlefield 3 starts out telling the story I expected to see. A pair of 280Xes ends up being 16% faster than the dual FirePro D700 setup in the Mac Pro. You really start to get an idea of where the Mac Pro's high-end GPU configuration really lands.

Bioshock Infinite - 2560x1440 - Ultra Quality + DDoF

Bioshock ends up at the extreme end of what we'd expect to see between the 280X and D700. I tossed in a score from Bioshock under OS X, which obviously doesn't have CF working and ends up at less than half of the performance of the D700. If you're going to do any heavy 3D gaming, you'll want to do it under Windows still.

Company of Heroes 2 - 2560x1440 - Maximum Quality + Med. AA

Not all games will scale well across multiple GPUs: Company of Heroes 2 is one of them. There's no performance uplift from having two 280Xes and thus the D700 performs like a slower single GPU R9 280X.

Company of Heroes 2 - Min. Frame Rate - 2560x1440 - Maximum Quality + Med. AA

Metro: Last Light - 2560x1440 - High Quality

Metro is the one outlier in our test suite. Although CrossFire is clearly working under Windows, under Metro the D700 behaves as if it wasn't. I'm not sure what's going on here, but this does serve as a reminder that relying on multi-GPU setups to increase performance does come with a handful of these weird cases - particularly if you're using non-standard GPU configurations.

GPU Choices 4K Support & The 4K Experience
Comments Locked

267 Comments

View All Comments

  • FunBunny2 - Tuesday, December 31, 2013 - link

    Has everybody forgotten? This is just a Cube with one round corner. I suppose Tim will claim that's been patented too.
  • newrigel - Wednesday, March 1, 2017 - link

    Right.... with a unified core in it he he.... Mac's rule
  • Y0ssar1an22 - Tuesday, December 31, 2013 - link

    Off the Mac Pro topic but how come the 2013 13" rMBP scores significantly lower than the 2012 and various MBAs in the Cinebench 11.5? I'm personally interested as I have one on order :-) It scores better in later tests (so presumably not a typo?) Cinebench caught my eye as the first cross-benchmark in the review.

    Thanks for this review, and looking forward to the rMBPs in depth!

  • iwod - Tuesday, December 31, 2013 - link

    1. What are the likely chances of a Mac that does Desktop Class Gfx card with 2 x8 PCI-E and uses Desktop Haswell instead. Unless i miss anything surely this is a simple change in production line.
    2. SSD speed is slow, for a Peak rate of 2GB/s, it seems Apple firmware or Samsung Controller not capable of feeling up the peak bandwidth? So which is likely the cause?
    3. GFx ECC Ram. How much of a problem is it? For Professional market? And why Apple decide to ditch this since the price difference are minor for the price of Mac Pro.
  • dwade123 - Tuesday, December 31, 2013 - link

    Who the **** put a trashcan here!?
  • e375ued - Wednesday, January 1, 2014 - link

    Is there some convenient reason Anand let the Mac Pro off easy by using Prime95 instead of Intel Burn Test or linpack?
  • Ryan Smith - Wednesday, January 1, 2014 - link

    It was my suggestion to try maxing out the Mac Pro, just to see if it would throttle (and if so, by how much). I picked Prime95 because it's good enough; not that there's anything wrong with IBT or Linpack, but all 3 of those are close enough that it shouldn't matter (and P95 is easy to use).
  • jrs77 - Wednesday, January 1, 2014 - link

    Good test that shows that the thermal core design works like a charm, even when applying very heavy and rather unrealistic loads to the system.

    Most people will run these new Mac Pros with only having a scene rendered or a video-filter applied etc and in this case the system is basically dead-silent and street-noise totally drowns the noise of the fan anyways.

    Just a tad too expensive for me tho.
  • Kevin G - Wednesday, January 1, 2014 - link

    The ‘mid range’ config is a far better value on the 2012 model since it is a 12 core model. The $200 savings can be put toward a better GPU.

    With regard to Cinebench, does it use AVX under OS X? I suspect that it does and that is where the majority of the single threaded CPU performance increase comes from. I strongly suspect that the single threaded performance advantage is far narrower in legacy code that doesn’t take advantage of AVX.

    I’m glad the 2012 model was tested with a Radeon 7950. The ability to upgrade GPU’s matters and it’ll keep the 2012 model competitive for awhile. The system will support future video cards that come in from the PC side of things. With UEFI on video cards now, there is little difference between a Mac and PC version. For what it is worth, I have stuck an EVGA GTX 770 into a 2012 Mac Pro without issue and no modification on the video card or OS X drivers. It just works.

    A bit of a random note is that the GPU connector used in the Mac Pro isn’t new to Apple: they used it for the G4 class daughter cards form 15 years back.

    The PLX chip doesn’t have to do any port switching as a single GPU can drive up to 6 surfaces. That would imply the six DP signals from one GPU are routed in pairs to each of the Falcon Ridge controllers for encapsulation.

    One shocking thing is that wall power draw exceeds that of the PSU’s DC rating. That is worrying as the system itself has only a 450W rated power supply. Due to the AC to DC conversion, there is an efficiency factor but the system has to be running close to its DC limit. Performing several file transfers over powered Thunderbolt devices could put the power draw beyond the rated DC limit. I wonder if Apple has implemented throttling based upon raw power consumption of the system as a whole in addition to temperature and power consumption of individual parts. Perhaps testing the system on a 240V AC circuit would alter things here as it is more efficient power delivery?

    One aspect not accounted for is memory expansion. The 2009/2010/2012 Mac Pro’s will work with registered ECC memory which brings their maximum capacity up to 128 GB. Memory bandwidth too is superior in the dual socket 2010/2012 models: six channels of 1333 Mhz memory does have more bandwidth than four channels at 1866 Mhz. Going multi-socket does carry some overhead but still a bit of a disappointment that the theoretical number didn’t improve.
  • Bill Thompson - Wednesday, January 1, 2014 - link

    My guess is the nVidia-based iMac is faster with After Effects and Premiere because of CUDA.

    Davinci Resolve has been updated for OpenCL, but I don't think Octane or Adobe apps have.

    BTW, FCP X 10.1 displays multiple 4K streams in real time without rendering. It's a serious app.

Log in

Don't have an account? Sign up now