The PCIe Layout

Ask anyone at Apple why they need Ivy Bridge EP vs. a conventional desktop Haswell for the Mac Pro and you’ll get two responses: core count and PCIe lanes. The first one is obvious. Haswell tops out at 4 cores today. Even though each of those cores is faster than what you get with an Ivy Bridge EP, for applications that can spawn more than 4 CPU intensive threads you’re better off taking the IPC/single threaded hit and going with an older architecture that supports more cores. The second point is a connectivity argument.

Here’s what a conventional desktop Haswell platform looks like in terms of PCIe lanes:

You’ve got a total of 16 PCIe 3.0 lanes that branch off the CPU, and then (at most) another 8 PCIe 2.0 lanes hanging off of the Platform Controller Hub (PCH). In a dual-GPU configuration those 16 PCIe 3.0 lanes are typically divided into an 8 + 8 configuration. The 8 remaining lanes are typically more than enough for networking and extra storage controllers.

Ivy Bridge E/EP on the other hand doubles the total number of PCIe lanes compared to Intel’s standard desktop platform:

Here the CPU has a total of 40 PCIe 3.0 lanes. That’s enough for each GPU in a dual-GPU setup to get a full 16 lanes, and to have another 8 left over for high-bandwidth use. The PCH also has another 8 PCIe 2.0 lanes, just like in the conventional desktop case.

I wanted to figure out how these PCIe lanes were used by the Mac Pro, so I set out to map everything out as best as I could without taking apart the system (alas, Apple tends to frown upon that sort of behavior when it comes to review samples). Here’s what I was able to come up with. Let’s start off of the PCH:

Here each Gigabit Ethernet port gets a dedicated PCIe 2.0 x1 lane, the same goes for the 802.11ac controller. All Mac Pros ship with a PCIe x4 SSD, and those four lanes also come off the PCH. That leaves a single PCIe lane unaccounted for in the Mac Pro. Here we really get to see how much of a mess Intel’s workstation chipset lineup is: the C600/X79 PCH doesn’t natively support USB 3.0. That’s right, it’s nearly 2014 and Intel is shipping a flagship platform without USB 3.0 support. The 8th PCIe lane off of the PCH is used by a Fresco Logic USB 3.0 controller. I believe it’s the FL1100, which is a PCIe 2.0 to 4-port USB 3.0 controller. A single PCIe 2.0 lane offers a maximum of 500MB/s of bandwidth in either direction (1GB/s aggregate), which is enough for the real world max transfer rates over USB 3.0. Do keep this limitation in mind if you’re thinking about populating all four USB 3.0 ports with high-speed storage with the intent of building a low-cost Thunderbolt alternative. You’ll be bound by the performance of a single PCIe 2.0 lane.

That takes care of the PCH, now let’s see what happens off of the CPU:

Of the 40 PCIe 3.0 lanes, 32 are already occupied by the two AMD FirePro GPUs. Having a full x16 interface to the GPUs isn’t really necessary for gaming performance, but if you want to treat each GPU as a first class citizen then this is the way to go. That leaves us with 8 PCIe 3.0 lanes left.

The Mac Pro has a total of six Thunderbolt 2 ports, each pair is driven by a single Thunderbolt 2 controller. Each Thunderbolt 2 controller accepts four PCIe 2.0 lanes as an input and delivers that bandwidth to any Thunderbolt devices downstream. If you do the math you’ll see we have a bit of a problem: 3 TB2 controllers x 4 PCIe 2.0 lanes per controller = 12 PCIe 2.0 lanes, but we only have 8 lanes left to allocate in the system.

I assumed there had to be a PCIe switch sharing the 8 PCIe input lanes among the Thunderbolt 2 controllers, but I needed proof. Our Senior GPU Editor, Ryan Smith, did some digging into the Mac Pro’s enumerated PCIe devices and discovered a very familiar vendor id: 10B5, the id used by PLX Technology. PLX is a well known PCIe bridge/switch manufacturer. The part used in the Mac Pro (PEX 8723) is of course not listed on PLX’s website, but it’s pretty close to another one that PLX is presently shipping: the PEX 8724. The 8724 is a 24-lane PCIe 3.0 switch. It can take 4 or 8 PCIe 3.0 lanes as an input and share that bandwidth among up to 16 (20 in the case of a x4 input) downstream PCIe lanes. Normally that would create a bandwidth bottleneck but remember that Thunderbolt 2 is still based on PCIe 2.0. The switch provides roughly 15GB/s of bandwidth to the CPU and 3 x 5GB/s of bandwidth to the Thunderbolt 2 controllers.

Literally any of the 6 Thunderbolt 2 ports on the back of the Mac Pro will give you access to the 8 remaining PCIe 3.0 lanes living off of the CPU. It’s pretty impressive when you think about it, external access to a high-speed interface located on the CPU die itself.

The part I haven’t quite figured out yet is how Apple handles DisplayPort functionality. All six Thunderbolt 2 ports are capable of outputting to a display, which means that there’s either a path from the FirePro to each Thunderbolt 2 controller or the PEX 8723 switch also handles DisplayPort switching. It doesn’t really matter from an end user perspective as you can plug a monitor into any port and have it work, it’s more of me wanting to know how it all works.

Mac Pro vs. Consumer Macs GPU Choices
Comments Locked

267 Comments

View All Comments

  • damianrobertjones - Thursday, January 2, 2014 - link

    "I like the new Mac Pro’s chassis a lot. It’s a risk, but one that absolutely must be taken if the desktop is to continue to exist and thrive."

    Absolute rubbish... Sorry. We simply DO NOT have to change the case. Sure, of course, the option of having a case like this is fantastic but simply changing the case DOES NOT enable this to 'thrive'.
  • AnnonymousCoward - Sunday, January 5, 2014 - link

    Agreed
  • platinumjsi - Thursday, January 2, 2014 - link

    What are you using to monitor the GPU usage? I have been looking for a app for OSX for a while without any joy?
  • hoboville - Thursday, January 2, 2014 - link

    Sigh, lots of fanboyism in the comments, without recognition that this is just a slower, more expensive PC, the only difference is that it can run OSX only programs. Here's some hardware facts:

    This machine is basically a dual-GPU Xeon workstation with 2x 7970 in Crossfire (D700). Nothing special. Ok, so each 7970 has 6 GB of RAM. Well, each 7970 is also underclocked...and the RAM isn't ECC, so if you want one of these workstations for serious GPU compute, you're going to be eating bit errors, and your data is going to be suspect. Real GPGPUs use ECC RAM, period. If ECC doesn't matter, then dual/triple/quad AMD GPUs of any stripe will do you fine. Even better now that R9 290(X) are out, and they have 4 GB of RAM.
    What if I need more local storage than 200 GB? Most raw video is bigger than that. So your files are stored on a NAS, but this machine only has gigabit NICs. If you want to take advantage of RAID throughput for massive files, you'll need 10 Gbit. But this machine can't use 10 Gbit NICs, as there's no place to put them.

    This workstation, then, isn't for serious compute, those who have big files, and it isn't for those who want to use the most powerful GPUs for rendering / modeling. That belongs to Nvidia, there are plenty of benchmarks out there attesting to that fact. You can't get Nvidia on this workstation, so what then? I guess you buy this machine for Mac-specific applications.

    And that's what this machine is for--Mac OS. If you want more power, UNIX/Linux/Windows boxes are where you go (not Apple-restricted Unix either). Are they bigger? Yes. Hotter? Yes. In fashion because small = sexy? Nope. And that's what this comes down to, looks, style, sleekness, and other metrics not relevant to performance. Sure, there's a niche for those who use Mac only software, but what if you want to do more? Apple has convinced people that style and a walled garden of software is more value than function, stop wasting your money and drop OSX!
  • pmhparis - Thursday, January 2, 2014 - link

    Snort, the ignorant NVidia hobo fanboy complains of Mac fanboys...

    Professionals don't store video projects on internal storage, they use DAS devices like Thunderbolt or USB3 disk enclosures.
  • Houston1 - Thursday, January 2, 2014 - link

    Incorrect.
  • Chirpie - Friday, January 3, 2014 - link

    No, it's pretty spot on. Every video environment I've worked in does not keep the project files and assets on the machine. It's a very normal/typical way of doing business with many terabytes worth of files. I'll go one step further though and say that it's not just USB and Thunderbolt but even duplexed gigabit ethernet or optical, or a number of other flavors as well.
  • FunBunny2 - Saturday, January 4, 2014 - link

    Steve was always the best snake oil salesman since Barnum. How Apple can contend that it spends billions and billions of dollars on R&D is baffling. It can't have cost that much to devise a square cornered rectangle, or single cornered Cube. The parts, 99.44% are off the shelf from suppliers.
  • DotFab - Thursday, January 2, 2014 - link

    Many thanks for this impressive review of the MacPro 2013!!
    You treat every point and more I had in mind!
    A huge and fine work, I really feel like I know what's the MP 2013 now.

    Happy new year to AnandTech and to everyone !
  • HisDivineOrder - Thursday, January 2, 2014 - link

    I love that Anand is discussing his well-known Apple addiction and the subsequent fanboyism he engages in. It is good. Admitting he has a problem means he can perhaps one day overcome it.

    One day. Today is not the day.

    How can anyone in their right mind suggest buying such a limited-expandability computer for anything NOT a low-power HTPC? If you pay this much money, you really ought to be able to easily change out the GPU(s).

    When you're so hooked on a company's products you're rubbing them like Gollum rubbing the Ring of Power, I think you've got to stop and take stock.

Log in

Don't have an account? Sign up now