Civilization V

Our final game, Civilization 5, gives us an interesting look at things that other RTSes cannot match, with a much weaker focus on shading in the game world, and a much greater focus on creating the geometry needed to bring such a world to life. In doing so it uses a slew of DirectX 11 technologies, including tessellation for said geometry, driver command lists for reducing CPU overhead, and compute shaders for on-the-fly texture decompression.

Civilization V

Civilization V

Because of the fact that Civilization V uses driver command lists, we were originally not going to include it in this benchmark suite as a gaming benchmark. If it were solely a DCL test it would do a good job highlighting the fact that AMD doesn’t currently support the feature, but a poor job of actually explaining any hardware/architectural differences.  It was only after we saw AMD’s reviewer’s guide that we decided to go ahead and include it, because quite frankly we didn’t believe the numbers AMD had published.

With the GTX 580 and the 6970, the 6970 routinely lost to the GTX 580 by large margins. We had long assumed this was solely due to NVIDIA’s inclusion of DCLs, as we’ve seen a moderate single-GPU performance deficit and equally moderate multi-GPU lead for AMD melt away when NVIDIA added DCL support. The 7970 required that we rethink this.

If Civilization V was solely a DCL test, then our 2560 results would be impossible – the 7970 is winning by 12% in a game NVIDIA previous won by a massive margin. NVIDIA only regains their lead at 1680, which at this resolution we’re not nearly as likely to be GPU-bound.

So what changed? AMD has yet to spill the beans, but short of a secret DCL implementation for just CivV we have to look elsewhere. Next to DCL CivV’s other killer feature is its use of compute shaders, and GCN is a compute architecture. To that extent we believe at this point that while AMD is still facing some kind of DCL bottleneck, they have completely opened the floodgates on whatever compute shader bottleneck was standing in their way before. This is particularly evident when comparing the 7970 to the 6970, where the 7970 enjoys a consistent 62% performance advantage. It’s simply an incredible turnabout to see the 7970 do so well when the 6970 did so poorly.

Of course if this performance boost really was all about compute shaders, it raises a particularly exciting question: just how much higher could AMD go if they had DCLs? Hopefully one day that’s an answer we get to find out.

Starcraft II Compute: The Real Reason for GCN
Comments Locked

292 Comments

View All Comments

  • Iketh - Thursday, December 22, 2011 - link

    As mentioned several times in the article and in the comments, time was an issue. You can rest assured that follow-up articles are in the works.
  • Ryan Smith - Thursday, December 22, 2011 - link

    Indeed it is.
  • Malih - Thursday, December 22, 2011 - link

    dude, awesome in-depth (emphasizing on depth) review, thank you very much for the excellent work Ryan.
  • Esbornia - Thursday, December 22, 2011 - link

    After reading a half ass misinforming review full of errors and typos, I think you didn't read it to say something like that.
  • Iketh - Thursday, December 22, 2011 - link

    It is full of typos, but that has nothing to do with in-depth. It was certainly in-depth and a joy to read despite the typos.

    I'd like to know what you believe is misinformation though.
  • SlyNine - Thursday, December 22, 2011 - link

    He probably couldn't understand alot of it and though they were all typo's.
  • WhoBeDaPlaya - Thursday, December 22, 2011 - link

    Sod off you wanker. Go and read Walmart reviews for this cart - they're probably more at your level ;)
  • Marburg U - Thursday, December 22, 2011 - link

    Does Eyefinity Technology 2.0 allow me to launch an application within Windows ON WHICH MONITOR I WANT?
  • NikosD - Thursday, December 22, 2011 - link

    It seems that nobody noticed but where is FP64 = 1/2 FP32 performance that AMD said back in June when they first introduced CGN architecture ?

    I copy from Ryan's June article:

    "One thing that we do know is that FP64 performance has been radically improved: the GCN architecture is capable of FP64 performance up to ½ its FP32 performance. For home users this isn’t going to make a significant impact right away, but it’s going to help AMD get into professional markets where such precision is necessary."

    The truth is that FP64 is 1/4 of FP32 eventually!

    Big Loss in GPGPU community even if 7970 is capable of 3.79Tflops of FP32 compared to 2.7Tflops of 6970
  • R3MF - Thursday, December 22, 2011 - link

    it says 1/2 in the architecture article, but 1/4 in the consumer product review, is this AMD taking a leaf from Nvidia's (shitty) book of using drivers to disable features in non-professional (price-tag) products?

Log in

Don't have an account? Sign up now