One Card, or Two?

Our first look at the 7950 GX2 will be a direct comparison to its closest SLI relative: the 7900 GT SLI. The purpose of this investigation is to attempt to answer some questions about the how differences in this single-card/multi-GPU implementation affect performance relative to the two card approach. The 7950 GX2 employs faster core and slower memory clock speeds than the 7900 GT, but these differences should produce fairly consistent performance deltas. Each GPU on the 7950 GX2 has twice as much RAM as the 7900 GT cards, but in past investigations we haven't seen memory size make any difference at resolutions below 2048x1536. The attribute we are really interested in is the performance differences created by the onboard PCIe switch.

From our side by side comparison, we can only attribute a maximum of 7% performance gain to the increase core clock of the 7950 GX2 over the 7900 GT SLI. At the same time, with a 10% higher memory clock on the 7900 GT SLI, we should see better performance in memory bandwidth limited situations on the 7900 GT SLI. Under games and settings with a balanced compute and memory load, these differences should come out in the wash.

There are quite a few tests in which both the 7950 GX2 and the 7900 GT SLI configurations are CPU limited or perform very similarly, but at higher resolutions we do see some differentiation. Everything gets magnified at high resolution: shaders must be run on more pixels and memory is hit harder. We could see that performance is fairly similar at CPU limited resolutions, but as we push the limit up above 3 megapixels and enable AA, we do see an advantage in favor of the 7950 GX2. For example, Quake 4 at 2048x1536 with 4xAA shows an absolutely gigantic 32% performance advantage over 7900 GT SLI. This isn't the norm, but even BF2 indicates a 7950 GX2 advantage of 13% (which is more than simple clock speed advantage can account for). Even if we haven't seen it before, memory size could be contributing to this advantage, but it also seems likely that the onboard PCIe switch could be reducing the latency involved in sending the frame data from one GPU to another.

In order to test this theory, we went back and retested the 7950 GX2 with muti-GPU mode disabled. Thus, we are able to bring you a comparison of the performance scaling between 7900 GT and SLI and 7950 GX2 in single and multi-GPU mode. This should give us a better idea as to whether the performance advantage of the GX2 is due to memory size or the PCIe switch.

Battlefield 2 Scaling


Half-Life 2 Episode One Scaling


Black & White 2 Scaling


F.E.A.R. Scaling


Splinter Cell: Chaos Theory Scaling


Quake 4 Scaling


From the data we collected, it looks like the 7900 GT scales better at low resolutions in most cases. It doesn't look like there is a significant scaling advantage for the 7950 GX2 in any game but Quake 4 at 2048x1536 with 4xAA. At this point, we would say that Quake 4 appears to require more than 256MB of RAM when running HQ settings at 2048x1536 with 4xAA, resulting in the huge performance increase with 7950 GX2 over 7900 GT. In some cases, scaling does make a difference in where the performance falls between the 7950 GX2 and the 7900 GT SLI solutions, but it does look like the majority of the performance differences between the two solutions is due to clock speeds and other features which are constant between single and multi-GPU arrangements.

At the end of the day, regardless of how these two cards scale, the 7950 GX2 is consistently faster than a stock 7900 GT SLI setup. Even if there aren't any clear benefits in terms of efficiency on the 7950 GX2 platform, there aren't any drawbacks either. Let's take a closer look at performance.

The Card and The Test Battlefield 2 Performance
Comments Locked

60 Comments

View All Comments

  • JarredWalton - Monday, June 5, 2006 - link

    Yes, SLI profiles are used for full utilization of the GX2 card. (AFAIK - Derek can correct me if I'm wrong.)
  • DerekWilson - Monday, June 5, 2006 - link

    SLI profiles are used if availalbe, but SLI profiles are never required to enable multi-GPU support on NVIDIA hardware.

    there are some advanced options for enabling multi-GPU or single-GPU rendering in the control panel -- even down to the AFR or SFR mode type (and SLIAA modes as a fallback if nothing else will work for you).

    in short -- required: no, used: yes.
  • araczynski - Monday, June 5, 2006 - link

    haven't read the article yet as I didn't see reference to Oblivion benchmarks, and lets be honest, that's the only game out these days that's worth benchmarking (in terms of actually giving the high end cards an actual workout).
  • DigitalFreak - Monday, June 5, 2006 - link

    It's amazing all the cool stuff you can do with PCI Express.
  • Sniderhouse - Monday, June 5, 2006 - link

    quote:

    Not since Quantum3D introduced the Obsidian X24 have we seen such a beast (which, interestingly enough, did actual Scan Line Interleaving on a single card).


    The Voodoo5 5500 had two GPUs on a single card which did true SLI, not to mention the Voodoo5 6000 which had four GPUs, but never really made it to market.
  • shabby - Tuesday, June 6, 2006 - link

    The x24 was also a dual pcb video card, thats what he meant. Not dual chip or whatever.
  • timmiser - Monday, June 5, 2006 - link

    Exactly what I was thinking!
  • DerekWilson - Monday, June 5, 2006 - link

    Perhaps I should have said successful products ... or products that were availble in any real quantity :-)
  • photoguy99 - Monday, June 5, 2006 - link

    From page 1, what limitations are being referred to?

    quote:

    At lower resolutions, CPU overhead becomes a factor, and some limitations of DX9 come into play
  • Ryan Smith - Monday, June 5, 2006 - link

    DX9 itself has a good deal of overhead in some situations, something Microsoft is changing for DX10. We'll have more on that in our upcomming Vista article later this week.

Log in

Don't have an account? Sign up now