Compute Performance

As always our final set of real-world benchmarks is composed of a look at compute performance. As we have seen with GTX 680 other Kepler cards, Kepler appears to be significantly less balanced between rendering and compute performance than GF110 or GF114/GF116 were, and as a result compute performance suffers.  On the other hand, relative to the GTX 660 the GTX 650 Ti sacrifices a smaller portion of its compute performance than its ROP/L2/memory performance, so this may bode better for computer performance.

Our first compute benchmark comes from Civilization V, which uses DirectCompute to decompress textures on the fly. Civ V includes a sub-benchmark that exclusively tests the speed of their texture decompression algorithm by repeatedly decompressing the textures required for one of the game’s leader scenes. Note that this is a DX11 DirectCompute benchmark.

There really isn’t a lot to say here. The GTX 650 Ti is only slightly ahead of the GTX 550 Ti, never mind the GTX 560. Worse, it’s tied with the 7770 and well behind the 7850. Given the nature of the test, with cards on memory busses this small I believe we’ve run into a proxy test for memory bandwidth rather than compute throughput. Which just goes to show that all of that compute throughput is meaningless without the memory bandwidth and cache to feed the best.

Our next benchmark is SmallLuxGPU, the GPU ray tracing branch of the open source LuxRender renderer. We’re now using a development build from the version 2.0 branch, and we’ve moved on to a more complex scene that hopefully will provide a greater challenge to our GPUs.

Not surprisingly, the GTX 650 Ti loses to just about everything. SmallLuxGPU’s OpenCL renderer just doesn’t mesh well with Kepler and NVIDIA’s drivers. The resulting lead for the 7850 is nothing short of massive.

For our next benchmark we’re looking at AESEncryptDecrypt, an OpenCL AES encryption routine that AES encrypts/decrypts an 8K x 8K pixel square image file. The results of this benchmark are the average time to encrypt the image over a number of iterations of the AES cypher.

Unlike our previous OpenCL benchmark the GTX 650 Ti’s showing isn’t nearly as bad, but neither is it great. Both the GTX 560 and the 7770 are in the lead, but at least the improvement over the GTX 550 Ti is nothing short of amazing. At times NVIDIA’s problem isn’t where GTX 650 Ti is compared to last-generation cards, but rather it is compared to AMD’s strong Radeon HD 7000 series lineup.

Our fourth benchmark is once again looking at compute shader performance, this time through the Fluid simulation sample in the DirectX SDK. This program simulates the motion and interactions of a 16k particle fluid using a compute shader, with a choice of several different algorithms. In this case we’re using an (O)n^2 nearest neighbor method that is optimized by using shared memory to cache data.

Once more the GTX 650 Ti is in trouble. It can beat the GeForce 500 series, but even the 7770 is faster.

Finally, we’ll take a look at one last benchmark to our compute run with the benchmarkable version of the Folding@Home client. Folding@Home and similar initiatives are still one of the most popular consumer compute workloads, so it’s something NVIDIA wants their GPUs to do well at.

Here’s another case where memory bandwidth and L2 cache appear to be a problem. The GTX 650 Ti is much farther behind the GTX 660 than we would have expected, and even the GTX 560 can take a lead here. On the other hand memory bandwidth bottlenecking isn’t so bad that EVGA’s GTX 650 Ti can’t still take the lead over the other factory overclocked cards.

Civilization V Synthetics
Comments Locked

91 Comments

View All Comments

  • chizow - Tuesday, October 9, 2012 - link

    I guess in years past this may and probably should've been branded GTS 650Ti along with the GTS 650 (they've used GTS 250, GTS 450 etc in the past), but I know for a fact Nvidia is trying to establish GTX as its own brand for gamers.

    They really emphasize it with all this "Green Light" business with regard to overclocking, overvolting etc.
  • Blazorthon - Tuesday, October 9, 2012 - link

    Nvidia has been cracking down on overvolting support, so IDK if I'd give them that much credit.
  • Pneumothorax - Tuesday, October 9, 2012 - link

    That will make the price/performance gap even worse as the 7850 is an oc beast.
  • goinginstyle - Tuesday, October 9, 2012 - link

    So why no reviews of the Asus or MSI offerings in these roundups?
  • silverblue - Tuesday, October 9, 2012 - link

    You need to be sent a card in order to review it. ;)
  • Shadowmaster625 - Tuesday, October 9, 2012 - link

    The 7850 1GB is $150. It IS is the direct competitor to the 650 ti. You can speculate all you want about the 1GB being some sort of ephemeral limiting factor, but I dont see it. All I see is that in one game, at BEYOND 1080p resolution, memory becomes a factor. But if you look at it, you can see that it is still so very close on the bell curve. I bet that if you actually tested Skyrim at 1920x1080 (not 1920x1200), there would be much less difference between 2GB and 1GB.

    Even 2 years from now the 7850 1GB is going to be a better performer than the 650ti. Even with the "latest and greatest" games.
  • Shark321 - Tuesday, October 9, 2012 - link

    Where is the 7850 1GB $150?
  • KineticHummus - Tuesday, October 9, 2012 - link

    yeah wish i could fine one of those haha. pretty sure there isnt a single card for 150
  • Marlin1975 - Tuesday, October 9, 2012 - link

    How about $159 AR shipped?

    http://www.newegg.com/Product/Product.aspx?Item=N8...
  • LordanSS - Tuesday, October 9, 2012 - link

    I'm a bit curious about this card's performance with PhysX enabled games.

    Personally, I am a 6970 owner and thus have no access to PhysX effects on games, but there are many workarounds out there where you can add a nVidia card to your machine to compute the physics.

    Borderlands 2 is an example, I know of people buying (cheap) nVidia cards so they can run physics on them. Problem is that, most of the time, these guys are buying really old or cheap hardware (like GT210 or 430 cards), causing a big drop in game performance. Others using more powerful (and expensive) cards, on the other hand, experience good results.

    So I'm thinking what's the cutting point for performance here. I know it's an extremely niche endeavor, but if anyone has experience or thoughts, I'd be glad to hear them.

Log in

Don't have an account? Sign up now