Titan’s Compute Performance, Cont

With Rahul having covered the basis of Titan’s strong compute performance, let’s shift gears a bit and take a look at real world usage.

On top of Rahul’s work with Titan, as part of our 2013 GPU benchmark suite we put together a larger number of compute benchmarks to try to cover real world usage, including the old standards of gaming usage (Civilization V) and ray tracing (LuxMark), along with several new tests. Unfortunately that got cut short when we discovered that OpenCL support is currently broken in the press drivers, which prevents us from using several of our tests. We still have our CUDA and DirectCompute benchmarks to look at, but a full look at Titan’s compute performance on our 2013 GPU benchmark suite will have to wait for another day.

For their part, NVIDIA of course already has OpenCL working on GK110 with Tesla. The issue is that somewhere between that and bringing up GK110 for Titan by integrating it into NVIDIA’s mainline GeForce drivers – specifically the new R314 branch – OpenCL support was broken. As a result we expect this will be fixed in short order, but it’s not something NVIDIA checked for ahead of the press launch of Titan, and it’s not something they could fix in time for today’s article.

Unfortunately this means that comparisons with Tahiti will be few and far between for now. Most significant cross-platform compute programs are OpenCL based rather than DirectCompute, so short of games and a couple other cases such as Ian’s C++ AMP benchmark, we don’t have too many cross-platform benchmarks to look at. With that out of the way, let’s dive into our condensed collection of compute benchmarks.

We’ll once more start with our DirectCompute game example, Civilization V, which uses DirectCompute to decompress textures on the fly. Civ V includes a sub-benchmark that exclusively tests the speed of their texture decompression algorithm by repeatedly decompressing the textures required for one of the game’s leader scenes.  While DirectCompute is used in many games, this is one of the only games with a benchmark that can isolate the use of DirectCompute and its resulting performance.

Note that for 2013 we have changed the benchmark a bit, moving from using a single leader to using all of the leaders. As a result the reported numbers are higher, but they’re also not going to be comparable with this benchmark’s use from our 2012 datasets.

With Civilization V having launched in 2010, graphics cards have become significantly more powerful since then, far outpacing growth in the CPUs that feed them. As a result we’ve rather quickly drifted from being GPU bottlenecked to being CPU bottlenecked, as we see both in our Civ V game benchmarks and our DirectCompute benchmarks. For high-end GPUs the performance difference is rather minor; the gap between GTX 680 and Titan for example is 45fps, or just less than 10%. Still, it’s at least enough to get Titan past the 7970GE in this case.

Our second test is one of our new tests, utilizing Elcomsoft’s Advanced Office Password Recovery utility to take a look at GPU password generation. AOPR has separate CUDA and OpenCL kernels for NVIDIA and AMD cards respectively, which means it doesn’t follow the same code path on all GPUs but it is using an optimal path for each GPU it can handle. Unfortunately we’re having trouble getting it to recognize AMD 7900 series cards in this build, so we only have CUDA cards for the time being.

Password generation and other forms of brute force crypto is an area  where the GTX 680 is particularly weak, thanks to the various compute aspects that have been stripped out in the name of efficiency. As a result it ends up below even the GTX 580 in these benchmarks, never mind AMD’s GCN cards. But with Titan/GK110 offering NVIDIA’s full compute performance, it rips through this task. In fact it more than doubles performance from both the GTX 680 and the GTX 580, indicating that the huge performance gains we’re seeing are coming from not just the additional function units, but from architectural optimizations and new instructions that improve overall efficiency and reduce the number of cycles needed to complete work on a password.

Altogether at 33K passwords/second Titan is not just faster than GTX 680, but it’s faster than GTX 690 and GTX 680 SLI, making this a test where one big GPU (and its full compute performance) is better than two smaller GPUs. It will be interesting to see where the 7970 GHz Edition and other Tahiti cards place in this test once we can get them up and running.

Our final test in our abbreviated compute benchmark suite is our very own Dr. Ian Cutress’s SystemCompute benchmark, which is a collection of several different fundamental compute algorithms. Rahul went into greater detail on this back in his look at Titan’s compute performance, but I wanted to go over it again quickly with the full lineup of cards we’ve tested.

Surprisingly, for all of its performance gains relative to GTX 680, Titan still falls notably behind the 7970GE here. Given Titan’s theoretical performance and the fundamental nature of this test we would have expected it to do better. But without additional cross-platform tests it’s hard to say whether this is something where AMD’s GCN architecture continues to shine over Kepler, or if perhaps it’s a weakness in NVIDIA’s current DirectCompute implementation for GK110. Time will tell on this one, but in the meantime this is the first solid sign that Tahiti may be more of a match for GK110 than it’s typically given credit for.

Titan’s Compute Performance (aka Ph.D Lust) Meet The 2013 GPU Benchmark Suite & The Test
Comments Locked

337 Comments

View All Comments

  • justaviking - Wednesday, February 27, 2013 - link

    On Feb 22, the review closed wtih this teaser:
    "Wrapping things up, on Monday we’ll be taking a look at the final piece of the puzzle"

    Monday was two days ago. Am I impatient? Yes. I am really looking forward to seeing what you have to say about Origin’s tri-SLI full tower Genesis PC.

    Did I miss it somehow?
  • avel - Wednesday, February 27, 2013 - link

    I've been thinking the same thing. While I was waiting I found that Tomshardware has a tri sli titan review up. Maybe Anand will have theirs up today.
  • Ryan Smith - Wednesday, February 27, 2013 - link

    Unfortunately it's going to be a few more days. I'm currently out of commission with the flu, so I haven't been able to finish my work on the Genesis system yet.
  • justaviking - Friday, March 1, 2013 - link

    Oh, sorry to hear about that.
    Get well soon.
  • CiccioB - Monday, March 4, 2013 - link

    It would be nice if you could also address TheJian's post.
    In particular on the reasons for choosing such games instead of those listed and, most of all, if in the future the list of games used will change with at least a part of those more modern ones.

    If you made a choice there must be a reason. It would be nice to let us know which it is. Avoiding giving reasons for your choices is a reason for many to have doubts on impartiality and/or professionalism.

    Thanks in advance
  • CeriseCogburn - Monday, March 4, 2013 - link

    Dream on, you play as many games as the person you have a problem with.

    The site is amd gpu biased out the wazoo, and every blind pig knows it. They failed to get a card from nVidia years ago (a certain G92) and it's been open hatred ever since. Same thing happened to Tom's.

    I'm sure there are other reasons - I've seen some stated - "the confident and arrogant nVidia reps" was one theme.
    The intense "hatred" right now for anyone profitable, especially above and beyond the pined for "take down the giants (Intel and nVidia)" AMD underdog dream of these fantasy activists.

    The desire for the "competitive war" to continue so this site has a reason to exist and do video card reviews, thus the failing piece of crap company AMD must be propped continuously, it is after all fully compliant with "self interest" even if it is, and it is, extremely unethical and completely immoral.

    So don't expect any answers, and there's exactly ZERO chance fair and equitable is the answer.
  • CeriseCogburn - Monday, March 4, 2013 - link

    Don't get me wrong, the site is great, I've been reading it forever, before it was even on the map, and of course people are human and have certain pressures and personal tastes.
    That won't ever change.

    They have many sections, the podcasts are a recent cool addition for some added flavor, and like anything, especially evaluating tow competing entities, perfection is impossible.
  • CiccioB - Monday, March 4, 2013 - link

    I like this site for GPU reviews. I have always found its review better than those done by many other sites.
    They are rich in technical description and give many answers many other sites don't even imagine to question.
    Or ask and answer only by doing a copy & paste from here, and sometimes even without understanding much of what they are C&P.
    The computational tests done here, even in the past years, have not been found anywhere. Others use stupid synthetic benchmark mostly based on OpenCL that require two minutes hack to double their performances or are biased depending on who has sponsored the tests (see AMD and SiSandra Benchmark Suite).

    However I have been thinking that the game choice was always "random".
    Review after review some good games suddenly disappeared to leave space to others that have not real meaning (i.e. games that do 150+ FPS on high end systems are quite ridiculous to bench). Same for very old games recently superseded by new release. And some games never reviewed at all.
    For example, I would like to know games like StarCraft 2, that had big problems with SLI/Crossfire at the time it was published, run now on the latest GPUs with latest drivers. Or games like Arma2 that were unplayable. But I still see Crysis Warhead benches, which is not exacly interesting nor indicative of anything while others already have Crysis 3 benches.
    It would also be a good option to add Physx option when possible. For example, with such a beast like Titan many games have enough room to run Physx at high levels. How does that compare with a SLI solution? Or with no Physx at all? How that impact on these GPUs rather than GK104 or older Fermi?

    But apart these requests, it would really be nice to understand the choice of reviewed games. Because it is well known that games are good or bad on certain architectures more than others, and choosing only most of those that adapt to one or to another with no apparent reason really makes these test quite cheap with respect to others, like for example those done by Techpowerup like it has been addressed before.

    Not answering rally means feeding the doubts. Which for many may change in not being doubts anymore.

    Sorry for my English, it is not my native language
  • clickonflick - Thursday, March 7, 2013 - link

    i agree that the price of this GPU is really high , one could easily assemble a fully mainstream laptop online with dell at this price tag or a desktop, but for gamers, to whom performance is above price. then it is a boon for them

    for more pics check this out

    http://clickonflick/nvidia-geforce-gtx-titan/

    so check the above link for specifications of titan
  • enigz - Thursday, March 7, 2013 - link

    CeriseCogBurn, you shit from your mouth, don't you? I've owned both nvidia and amd cards, I go for performance and I most certainly do not care about spending. It is not about the company. I don't go around slamming the other team online like the bloody ball-less keyboard warrior you are. Do you not realise that that your comments make you look like those "fanboys" which you go around insulting? Go grab a paper towel to clean off all that shit dripping down your chin, then sit down and try to absorb what I've just said while I'll be off to get my Titan. At least AMD and NVIDIA are capable of producing graphics and computing solutions for consumers worldwide while you, Sir, are just capable of being an asshole right here at Anandtech.

Log in

Don't have an account? Sign up now