At the very end of May we saw NVIDIA’s first effort to expand Fermi beyond the $300 space with the GeForce GTX 465, a further cut-down GF100 core priced at launch at $279. Unfortunately for NVIDIA, it wasn’t even a lackluster launch – while GF100 performs quite well with most of its functional units enabled (i.e. GTX 480), disabling additional units isn’t doing the GPU any favors. Furthermore disabling those units does little to temper the chip’s high power draw – something that’s only reasonable on the higher-end cards – resulting in a card that ate a lot of power while losing to AMD’s Radeon HD 5850.

In short, the GTX 465 is a lesson of how you can only cut down GPU so far. NVIDIA went too far, and ended up with a part that had GTX 285 performance and GTX 470 power consumption.

Today NVIDIA is back in the saddle with something entirely new: GF104 and the GTX 460. The second member of the Fermi family is ready for its day in the sun, and in many ways it’s nothing like we expected. Designed from the start as a smaller chip than GF100, GF104 is the basis of the GTX 460 line of products which fix the GTX 465’s ills while delivering the GTX 465’s performance. It’s what the GTX 465 should have been, and it’s priced as low as $199. And as we’ll see, it’s the first NVIDIA card in a long time that we can give a glowing review for.

  GTX 480 GTX 465 GTX 460 1GB GTX 460 768MB GTX 285
Stream Processors 480 352 336 336 240
Texture Address / Filtering 60/60 44/44 56/56 56/56 80 / 80
ROPs 48 32 32 24 32
Core Clock 700MHz 607MHz 675MHz 675MHz 648MHz
Shader Clock 1401MHz 1215MHz 1350MHz 1350MHz 1476MHz
Memory Clock 924MHz (3696MHz data rate) GDDR5 802MHz (3208MHz data rate) GDDR5 900MHz (3.6GHz data rate) GDDR5 900MHz (3.6GHz data rate) GDDR5 1242MHz (2484MHz data rate) GDDR3
Memory Bus Width 384-bit 256-bit 256-bit 192-bit 512-bit
Frame Buffer 1.5GB 1GB 1GB 768MB 1GB
FP64 1/8 FP32 1/8 FP32 1/12 FP32 1/12 FP32 1/12 FP32
Transistor Count 3B 3B 1.95B 1.95B 1.4B
Manufacturing Process TSMC 40nm TSMC 40nm TSMC 40nm TSMC 40nm TSMC 55nm
Price Point $499 $249 $229 $199 N/A

GF104, the heart of the GTX 460 series being launched today, is the first waterfall part of the Fermi family. As we saw with AMD’s Radeon HD 5000 series last year and NVIDIA’s GeForce 9000 series before that, NVIDIA is in the process of taking the base GF100 design and reducing it for the construction of smaller, lower performing GPUs suitable for use in video cards at lower prices for the larger markets.

The final tally for GF104 is 1.95 billion transistors, which occupies a die space slightly more than that of AMD’s Cypress in the 5800 series. To put this in comparison, this is about 200 million fewer transistors than AMD’s Cypress, or 550 million more than NVIDIA’s older GT200 GPU that powered the GeForce GTX 200 series. This makes the GF104 the biggest GPU we’ve seen for the prices NVIDIA is targeting, a sign of the increasing pricing pressure between NVIDIA and AMD.

GF104 like GF100 before it is not initially being shipped in a “full” configuration. The chip has 2 Graphics Processing Clusters (GPCs) containing 4 SMs each, for a total of 8 SMs adding up to 384 CUDA cores. The GeForce GTX 460 will be shipping with 1 of the 8 SMs disabled, leaving it with 336 enabled CUDA cores. NVIDIA tells us that the reason they’re shipping the first GF104 parts with a disabled SM is due to yields – they wouldn’t be able to meet the demand for cards if they only shipped cards with 8 functional SMs. Unlike GF100, outirght poor yields don’t appear to be a huge factor here. Our impression from discussing the issue with NVIDIA is that GF104 is yielding around where it should be for a chip of its size, with NVIDIA choosing to take a hit on selling “full” chips for a higher price in order to sell more chips overall. In any case it gives them some room for expansion in the future should they decide to release a “full” GF104 based product.

Perhaps the most surprising thing about GF104 is that it’s not a simple reduced version of GF100 like what AMD did with the Evergreen series. Instead NVIDIA made some very significant changes to the design of their SMs for GF104, resulting in a waterfall product that’s undoubtedly Fermi but also notably different from GF100. There’s a lot to discuss here, so we’ll get more in to this in a bit.

Moving on to the cards, NVIDIA is launching 2 cards today. At $229 there is the GeForce GTX 460 1GB, the closest thing we’ll see to a “full” GF104 part for the time being. The GTX 460 1GB has 7 of 8 SMs enabled along with all 32 ROPs, with a 256bit memory bus connecting the GPU to 1GB of GDDR5. The core is clocked at 675MHz core, 1350MHz shader, and 900MHz (3.6GHz effective) memory. The TDP for this part is 160W, with an unofficial idle power draw in the 20W-30W range.

The other GeForce GTX 460 being launched today is the GeForce GTX 460 768MB at $199, a slightly further cut-down card. As NVIDIA’s ROPs are closely tied to their memory controllers, the only way to reduce the amount of memory on a card is to disable memory controllers along with the ROPs. As a result the GTX 460 768MB has less memory than the GTX 460 1GB, but also only 24 ROPs connected to a 192bit memory bus. The shaders remain unchanged, giving the GTX 460 768MB the same compute/shading abilities as the GTX 460 1GB, but only 75% of the ROP capability and memory bandwidth. The clocks are unchanged from the GTX 460 1GB: 675MHz core, 1350MHz shader, and 900MHz (3.6GHz effective) memory.

Given these differences, we’re a bit dumbfounded by the naming. With the differences in memory and the differences in the ROP count, the two GTX 460 cards are distinctly different. If NVIDIA changed the clockspeeds in the slightest, we’d have the reincarnation of the GTX 275 and GTX 260. NVIDIA’s position is that the cards are close enough that they should have the same name, but this isn’t something we agree with. One of these cards should have had a different model number – probably the 768MB card with something like the GTX 455. The 1GB card does not eclipse the 768MB card, but this is going to lead to a lot of buyer confusion. The best GTX 460 is not the $199 one.

Today’s launch will be a mixed bag in terms of availability. $199 has long been known to be a critical price point with buyers, which is what makes this card so important for NVIDIA as it allows them to finally tap that market once more. However to get there they’re using their entire initial run of GF104 to build the 768MB versions of the GTX 460. There should be plenty of 768MB cards available for today’s launch, but the bulk of 1GB cards are roughly 2 weeks late (1 or 2 may show up early if the vendor does rush shipping). So what we have is a hard launch for the GTX 460 768MB, but a soft launch for the GTX 460 1GB. We’re not entirely thrilled with this – particularly as we believe the 1GB cards to be the better buy – but if nothing else it’s better than the GTX 480 launch.

Today’s launch will also be resulting in an interesting mix of price points. NVIDIA has lowered the MSRPs on the GTX 470 and GTX 465, while AMD’s prices have been slowly drifting down over the last month too. As a result we end up with roughly the following:

July 2010 Video Card MSRPs
NVIDIA Price AMD
  $700 Radeon HD 5970
$500  
 
$400 Radeon HD 5870
$330  
 
$300 Radeon HD 5850
$250  
$230  
$200 Radeon HD 5830

With these prices AMD and NVIDIA both have themselves comfortably stratified until you drop below $250. AMD doesn’t have anything between the 5850 and 5830, while they have a price gap of $80-$100. Meanwhile the 5830 is priced directly against the GTX 460 768MB. NVIDIA’s pricing will be taking advantage of this gap, while giving the 5830 a run for its money at $200.

GF104: NVIDIA Goes Superscalar
Comments Locked

93 Comments

View All Comments

  • GeorgeH - Monday, July 12, 2010 - link

    I actually think it should be called a 460. People with a reasonable amount of tech knowledge are going to know the difference, and people without it are already trained to think that a bigger RAM number is always better (see the tons of "bargain" cards with 1GB+ of slow DDR.)

    Basically, the performance differential here is already clear from the full product name, and we have one fewer model number muddying a GPU market already overflowing with a ridiculous amount of model names that have identical, nearly identical, or completely misleading performance capabilities relative to other model names.
  • Daeros - Monday, July 12, 2010 - link

    I would agree with you except for the common practice of mfg's doubling up on VRAM. When that happens, there will be 768MB, 1GB, 1.5GB, and 2GB versions, and people will have no idea the 1.5GB will probably be slower than the 1GB version.
  • tigersty1e - Monday, July 12, 2010 - link

    This line made me read it twice.

    "This in turn means the blades of the fan sit at the same height as the lip, blocking direct airflow out the back. With this design the card is still exhausting at least some air out of the rear of the card, but it shouldn’t be as much as a fully-open card such as our custom Asus GTX 460."

    rear and back are the same thing. I think you meant to say the the cover blocks direct airflow out the front.
  • Ryan Smith - Monday, July 12, 2010 - link

    I define the front of the card to be the side with the display ports, so the cover blocks airflow out of the opposite end, the back.
  • fausto412 - Monday, July 12, 2010 - link

    While a very nice card...too little too late...i got a 5870 3 weeks ago. if you have a high end quad core and want to play BFBC2 in the highest settings only a gtx480 or 5870 will do. i chose the cheaper, cooler and more efficient card.
  • kumquatsrus - Monday, July 12, 2010 - link

    or gtx 460 in sli for less?
  • Jamahl - Monday, July 12, 2010 - link

    Yeah look around, not just at Anand's flawed benches.

    TPU shows sli 460's losing to the 5970 by 20% at 1680 and 1920, and a massive 30% at 2560.

    They aren't even close, but keep dreaming anyway.
  • DominionSeraph - Tuesday, July 13, 2010 - link

    The reply was to a 5870, not 5970. Just a matter of $300 difference between the two.
  • DrMrLordX - Monday, July 12, 2010 - link

    On your power consumption + noise charts (at the very least), the Zotac card's results are shown with white text superimposed over a yellow bar on the graph. This makes the white text almost completely unreadable. One can guess at the numbers based on context, but still . . .
  • bobjones32 - Monday, July 12, 2010 - link

    The benchmarks look good at first look, but then I realized I purchased my current 4870 for $150 nearly a year and a half ago.

    Looking more closely at these benchmarks, the GTX 460 beats the 4870 in most tests, but not by that much, and at lower resolutions the 4870 actually wins.

    Shouldn't it be clear-cut that a $200 brand new card from today destroys a $150 card from nearly a year and a half ago?

Log in

Don't have an account? Sign up now