Compute & Tessellation

Moving on from our look at gaming performance, we have our customary look at compute performance, bundled with a look at theoretical tessellation performance.

Our first compute benchmark comes from Civilization V, which uses DirectCompute to decompress textures on the fly. Civ V includes a sub-benchmark that exclusively tests the speed of their texture decompression algorithm by repeatedly decompressing the textures required for one of the game’s leader scenes.

Unlike our gaming benchmarks, Civ V’s compute performance strongly favors NVIDIA’s Fermi family here, giving the GTX 550 a very strong showing. At 136fps it’s competitive with the much more expensive 6950 and even comes quite close to the GTX 460 768MB. The 6850 and 5770 meanwhile fall well behind.

What’s quite interesting is that with a bit more gas, the Zotac AMP really climbs up the charts. At 150fps it’s tailing the GTX 460 1GB and even AMD’s single-GPU king, the 6970. And ultimately while this benchmark is compute bound, it’s also memory bound at least part of the time, which is why the GTX 550 outperforms the GTS 450 by 21%.

Our second GPU compute benchmark is SmallLuxGPU, the GPU ray tracing branch of the open source LuxRender renderer. While it’s still in beta, SmallLuxGPU recently hit a milestone by implementing a complete ray tracing engine in OpenCL, allowing them to fully offload the process to the GPU. It’s this ray tracing engine we’re testing.

SmallLuxGPU also gives NVIDIA a solid standing here, however not quite as much as with Civ V. The GTX 550 beats the 5770 and even pulls ahead of the 6850 here. The benefit from the memory bandwidth increase is minimal however, leading to a lower 15% gain versus the GTS 450, and a larger 25% gap versus the GTX 460.

Our final compute benchmark is a Folding @ Home benchmark. Given NVIDIA’s focus on compute for Fermi, cards such as the GTX 560 Ti can be particularly interesting for distributed computing enthusiasts, who are usually looking for a compute card first and a gaming card second.

We do not normally classify Folding@Home to be very memory bandwidth sensitive on GPUs, which makes our results here more interesting than we expected. A 17% gain over the GTS 450 is in-line with the core clock increase, but achieving 81% of the GTX 460 is no small feat. With the AMP’s overclock, the card closes to within 10% of the GTX 460.

At the other end of the spectrum from GPU computing performance is GPU tessellation performance, used exclusively for graphical purposes. With Fermi NVIDIA bet heavily on tessellation, and as a result they do very well at very high tessellation factors. With 1 GPCs the GTX 560 Ti can only retire 1 triangle/clock however, which nullifies much of the architectural advantage on paper.

Even with extreme levels of tessellation, the lower geometry throughput of the GTX 550 keeps a lid on performance. It’s consistently around 80% of the performance of the GTX 460, however the hit for using extreme tessellation ultimately is greater here than on NVIDIA cards with more geometry throughput. As NVIDIA likes to note it has quite a bit more geometry throughput than older generation cards, but tying geometry performance to the number of SMs means parity between high and low-end cards is lost.

The DX11 Detail Tessellation sample reinforces those findings. NVIDIA does keep up their solid scaling from medium to high tessellation versus AMD though., including a very surprising score versus the 5870 and 6870 at high tessellation.

Wolfenstein Power, Temperature, & Noise
Comments Locked

79 Comments

View All Comments

  • vedye - Thursday, March 17, 2011 - link

    Thanks for pointing that out!! But the author will not respond to you. Anantech is already proven pro-Nvidia. If I were them, I would ignore ur post as well. Just keep pretending.
  • Demon-Xanth - Tuesday, March 15, 2011 - link

    From the consumer standpoint, why would I get a 550 over a 460? I read through this and can't come up with a single reason.
  • Gami - Tuesday, March 15, 2011 - link

    there's no point in getting it.. they need to eliminate the stock of the 400 series first, from all outlets, so that this thing would actually have a chance to be even thought of being bought.
  • Taft12 - Tuesday, March 15, 2011 - link

    Soon that reason will be "the GTX 460 768MB is not available", but that is not yet true, and indeed there is no reason to buy this card.
  • qwertymac93 - Tuesday, March 15, 2011 - link

    how did they get 1GB of memory with a 192-bit bus? Are you sure its not 768MB?
  • Demon-Xanth - Tuesday, March 15, 2011 - link

    There's a whole page on that. Plus many comments on other pages.
  • z3nny - Tuesday, March 15, 2011 - link

    Yeah, RTFA.
  • Lolimaster - Tuesday, March 15, 2011 - link

    So after near 1.5years HD5770 is the better buy for mid value customers. Right now in many places HD6850 costs less than the 460 1GB and performs better (even more with the new 11.4pre and future (mejolnir "driver" updates)
  • qwertymac93 - Tuesday, March 15, 2011 - link

    never mind, read the next page... And now i wish you guys had an edit function...
  • z3nny - Tuesday, March 15, 2011 - link

    It would have helped if you RTFA first before posting like a fool.

Log in

Don't have an account? Sign up now