Out of nowhere, NVIDIA has revealed the NVIDIA Titan V today at the 2017 Neural Information Processing Systems conference, with CEO Jen-Hsun Huang flashing out the card on stage. A mere 7 months after Volta was announced with the Tesla V100 accelerator and the GV100 GPU inside it, NVIDIA continues its breakneck pace by releasing the GV100-powered Titan V, available for sale today. Aimed at a decidedly more compute-oriented market than ever before, the 815 mm2 behemoth die that is GV100 is now available to the broader public.

NVIDIA Compute Accelerator Specification Comparison
  Titan V Tesla V100
Tesla P100
Titan Xp
CUDA Cores 5120 5120 3584 3840
Tensor Cores 640 640 N/A N/A
Core Clock 1200MHz ? ? 1485MHz
Boost Clock 1455MHz 1370MHz 1300MHz 1582MHz
Memory Clock 1.7Gbps HBM2 1.75Gbps HBM2 1.4Gbps HBM2 11.4Gbps GDDR5X
Memory Bus Width 3072-bit 4096-bit 4096-bit 384-bit
Memory Bandwidth 653GB/sec 900GB/sec 720GB/sec 547GB/sec
VRAM 12GB 16GB 16GB 12GB
L2 Cache 4.5MB 6MB 4MB 3MB
Single Precision 13.8 TFLOPS 14 TFLOPS 9.3 TFLOPS 12.1 TFLOPS
Double Precision 6.9 TFLOPS
(1/2 rate)
(1/2 rate)
(1/2 rate)
(1/32 rate)
Tensor Performance
(Deep Learning)
Transistor Count 21.1B 21.1B 15.3B 12B
TDP 250W 250W 250W 250W
Form Factor PCIe PCIe PCIe PCIe
Cooling Active Passive Passive Active
Manufacturing Process TSMC 12nm FFN TSMC 12nm FFN TSMC 16nm FinFET TSMC 16nm FinFET
Architecture Volta Volta Pascal Pascal
Launch Date 12/07/2017 Q3'17 Q4'16 04/07/2017
Price $2999 ~$10000 ~$6000 $1299

For the spec sheet we've gone ahead and lined it up against NVIDA's other Pascal cards, and for good reason. While the Titan series of cards may have started life as a prosumer card in 2013, since then NVIDIA's GPU designs have become increasingly divergent between compute and graphics. And even though the previous Titan Xp was based on the more graphics-focused GP102 GPU, the card itself was primarily (but not solely) pitched as an entry-level compute card, for customers who needed a (relatively) cheap way to do FP32 compute and neural network inferencing in workstations and small clusters.

The Titan V, by extension, sees the Titan lineup finally switch loyalties and start using NVIDIA’s high-end compute-focused GPUs, in this case the Volta architecture based V100. The end result is that rather than being NVIDIA’s top prosumer card, the Titan V is decidedly more focused on compute, particularly due to the combination of the price tag and the unique feature set that comes from using the GV100 GPU. Which isn’t to say that you can’t do graphics on the card – this is still very much a video card, outputs and all – but NVIDIA is first and foremost promoting it as a workstation-level AI compute card, and by extension focusing on the GV100 GPU’s unique tensor cores and the massive neural networking performance advantages they offer over earlier NVIDIA cards.

In this sense the Titan V is a return to form of sorts to the professional side of prosumer for the Titan family. One of the original claims to fame for the original Titan was its high performance in specialized FP64 compute workloads, something that was lost on the later Titan X and Titan Xp.  By switching to NVIDIA’s specialized high-end compute GPUs, the Titan V regains its formerly lost compute capabilities, all the while also gaining all of the compute capabilities NVIDIA has introduced since then. It’s no mistake that Jen-Hsun introduced the card at a neural networking conference, as this is a big chunk of the professional computing audience that NVIDIA is targeting with the card.

Interestingly, comparing it to the PCIe Tesla V100, I’m surprised by just how close the cards are in features and performance. NVIDIA has confirmed that the Titan V gets the GV100 GPU’s full, unrestricted FP64 compute and tensor core performance. To the best of our knowledge (and from what NVIDIA will comment on) it doesn’t appear that they’ve artificially disabled any of the GPU’s core features. What does separate the Titan from the Tesla then from a performance standpoint is quite simple: memory capacity, memory bandwidth, and the lack of NVLink functionality. There are also a number of smaller differences between the cards that help to differentiate them between server and workstation – such as passive versus active cooling, NVLink, and the support policies – but otherwise for customers who are running a small number of cards, the Titan V’s feature set is remarkably close to the much more expensive Tesla V100’s, which is a very interesting development since it goes to show just how confident NVIDIA is that this won’t undermine Tesla sales.

Moving on and diving into the numbers, Titan V features 80 streaming multiprocessors (SMs) and 5120 CUDA cores, the same amount as its Tesla V100 siblings. The differences come with the memory and ROPs. In what's clearly a salvage part for NVIDIA, one of the card's 4 memory partitions has been cut, leaving Titan V with 12GB of HBM2 attached via a 3072-bit memory bus. As each memory controller is associated with a ROP partition and 768 KB of L2 cache, this in turn brings L2 down to 4.5 MB, as well as cutting down the ROP count.

In terms of clockspeeds, the HBM2 has been downclocked slightly to 1.7GHz, while the 1455MHz boost clock actually matches the 300W SXM2 variant of the Tesla V100, though that accelerator is passively cooled. Notably, the number of tensor cores have not been touched, though the official 110 DL TFLOPS rating is lower than the 1370MHz PCIe Tesla V100, as it would appear that NVIDIA is using a clockspeed lower than their boost clock in these calculations.

For the card itself, it features a vapor chamber cooler with copper heatsink and 16 power phases, all for the 250W TDP that has become standard with the single GPU Titan models. Output-wise, the Titan V brings 3 DisplayPorts and 1 HDMI connector. And as for card-to-card communication, PCB itself appears to have NVLink connections on the top, but these look to have been intentionally blocked by the shroud to prevent their use and are presumably disabled.

As mentioned earlier, NVIDIA is unsurprisingly pushing this as a compute accelerator card, especially considering that Titan V features tensor cores and keeps the TITAN branding as opposed to GeForce TITAN. But there are those of us who know better than to assume people won’t drop $3000 to use the latest Titan card for gaming, and while gaming is not the primary (or even secondary) focus of the card, you also won't see NVIDIA denying it. In that sense the Titan V is going to be treated as a jack-of-all-trades card by the company.

To that end, no gaming performance information has been disclosed, but NVIDIA has confirmed that the card uses the standard GeForce driver stack. And on that note, yesterday NVIDIA released 388.59 bringing official Titan V support. Now, how much those drivers have actually been optimized for the GV100 is another matter entirely; Volta is a new architecture, markedly so at times. Speaklng solely off the cuff here, for graphics workloads the card has more resources than the Titan Xp in almost every meaningful metric, but it's also a smaller difference on paper than you might think.

As for NVIDIA's intended market of compute and AI users, the Titan V will be supported by NVIDIA GPU Cloud, which includes TensorRT, a number of deep learning frameworks, and HPC-related tools.

If the golden shroud didn’t already suggest so, the Titan V is also carving out a new eye-watering price point, dropping in at $2999 and on sale now at the NVIDIA store. NVIDIA has, to date, been selling Tesla V100 products as fast as they can produce them, so I'm not going to be surprised if the Titan V sees a similar fate. The $3000 price tag is quite high, even by Titan standards, but with the rare Tesla V100 PCIe card going for around $10,000, the Titan V is markedly cheaper. In fact in some respects I'm surprised NVIDIA is selling a GV100 card for so little; these are GV100 salvage parts that don't make the cut for Tesla - so the alternative would be throwing them away - but it just goes to show how confident NVIDIA is that it won't undermine the Tesla family.

At any rate, for NVIDIA professional users who have been looking to dip their toes into Volta but didn't want a full-fledged Tesla card, the Titan V is clearly going to be a popular card. Over the last two years NVIDIA's AI efforts have been firing on all cylinders, and by bringing a GV100 card down to just $3000, expect to see them crack open the market that much further. I dare say the idea of the "prosumer" Titan has died with this card, but for the rapidly growing professional compute market, this looks to be exactly the kind of card that a lot of developers have been waiting for.

Update (12/8/17): Yesterday, NVIDIA also released driver version 388.59 WHQL, bringing product support to the Titan V, along with Fallout 4 VR support. NVIDIA has noted that the Titan V currently suffers from TDR errors and display blanking during Blu-ray disc playback on high resolutions, as well as from G-Sync display blanking when repeatedly switching between different memory overclocks. Lastly, this minor update features one bug fix, resolving flickering on GTX 1080 equipped G-Sync notebooks.



View All Comments

  • A5 - Friday, December 08, 2017 - link

    "Charging more than I want them to" != "price fixing" or "price gouging".

    Words have meaning, and you apparently don't know them.
  • sonny73n - Friday, December 08, 2017 - link

    A5, your argument about meanings of words is just lame. I will not write an essay on every topic so imbecile like you could understand Reply
  • LordSojar - Friday, December 08, 2017 - link

    Unfortunately... A5 is right in this instance. This isn't gouging and is actually a very reasonable price for the PURPOSE of this card. This isn't a gaming-centric card, despite it being absurdly capable in that regard.

    nVidia is doing what any company in their position would do: maximize profits and focus on markets specific to the strengths of their products.
  • mapesdhs - Monday, December 11, 2017 - link

    Indeed, kudos for them for doing so. What a strange world where people think making a profit is evil. Must be the new generation of proto-communists I suppose, the rabid desire for free stuff.

    The free market has a wonderful metric for deciding what succeeds and what doesn't, it's called price. If it's too high, it won't sell. If it's too low, demand will spike and they'll fail to meet the demand, so raising the price slows the demand. This is money 101. Those who don't like this basic notion of trade just want the efforts of others without paying for it. No wonder we have a welfare state that's out of control.

    What's so funny to me is that even $10K is absolutely *nothing* in some pro markets. Some of the systems I used to work with were in the $500K to $1.5M range, and the companies that used them were able to do things that meant their investment paid for itself in just the first day of use. Hobbyists and gamers really have little idea of the true nature of industrial hardware; nothing wrong with that, they don't need to know, but the context of articles like this ought to be enough to convey that there's a deeper big money world beyond the narrow slice of life called gaming.
  • MrSpadge - Friday, December 08, 2017 - link

    It's not "extreme capitalist" to believe in some basic freedom: they are free to offer products and you are free not to buy them, if you don't want to. Do you think that's wrong? Do you also imprison every Porsche or Lamborghini merchant as you drive by, because you think they should offer you their cars for less? Reply
  • tamalero - Friday, December 08, 2017 - link

    Incorrect analogy, because cars like Porshe and Lamborghinis do not have any real function other than "going fast" and "look good". Aka its a luxury unnecessary that can easily be replaced with multiple brands and for different costs according to your needs. Reply
  • bji - Friday, December 08, 2017 - link

    This is a really dumb comment. Reply
  • Drumsticks - Friday, December 08, 2017 - link

    Er, by that definition, pretty much any use of the computer other than directly supporting your societal functions (i.e. if it's for work, in which case you typically aren't paying for it), is unnecessary. Even if you do pay for your own hardware, faster hardware can quickly offset higher prices, and for any work related tasks, this looks to be at a decent price. Reply
  • Jconsumer - Friday, December 08, 2017 - link

    or a 1950's American Capitalist Automobile like they do in Cuba. Viva La Communism!!! Reply
  • peevee - Friday, December 08, 2017 - link

    I bet he does. Where do you think bolshevicks and maoists come from? Reply

Log in

Don't have an account? Sign up now