Back at a trade show some time ago, I once had a reader ask me what the most important price tier was for our video card reviews. Was it the high-end cards with their groundbreaking performance? Or was it the mainstream cards, which bring existing performance down to cheaper prices?

It was a question I admittedly had to stop and think about for a moment. But in the end, as fun as the high-end cards are – and as fun as window shopping is – my answer to the question was that mainstream cards were more important. It is mainstream cards that more gamers can afford, and judging from trends in monitor sales, mainstream cards that most gamers actually need to drive their monitors.

For the mainstream market then, a traditional NVIDIA top-to-bottom launch is both exciting and trying. Exciting because you get a sneak peek at what’s coming down the pipe in a couple of months, and trying because who doesn’t want a new video card right this second? Ultimately for NVIDIA’s largest market by volume, it means there’s a bit of a wait.

But after that wait, here we are: GP106, the heart of the GeForce GTX 1060. Announced in early July, NVIDIA’s mainstream/enthusiast GPU is part of a quick succession of Pascal releases for the company, who has now released (in various fashions) 4 different Pascal GPUs. Of the Pascal family so far, GP106 is the smallest, and the cheapest. But for NVIDIA’s customers it may very well be the most important, as GTX 1060 significantly raises the performance bar for 1080p gaming.

NVIDIA GPU Specification Comparison
  GTX 1080 GTX 1070 GTX 1060 GTX 960
CUDA Cores 2560 1920 1280 1024
Texture Units 160 120 80 64
ROPs 64 64 48 32
Core Clock 1607MHz 1506MHz 1506MHz 1126MHz
Boost Clock 1733MHz 1683MHz 1709MHz 1178MHz
TFLOPs (FMA) 8.9 TFLOPs 6.5 TFLOPs 4.4 TFLOPs 2.4 TFLOPs
Memory Clock 10Gbps GDDR5X 8Gbps GDDR5 8Gbps GDDR5 7Gbps GDDR5
Memory Bus Width 256-bit 256-bit 192-bit 128-bit
VRAM 8GB 8GB 6GB 2GB
FP64 1/32 1/32 1/32 1/32
TDP 180W 150W 120W 120W
GPU GP104 GP104 GP106 GM204
Transistor Count 7.2B 7.2B 4.4B 2.94B
Manufacturing Process TSMC 16nm TSMC 16nm TSMC 16nm TSMC 28nm
Launch Date 05/27/2016 06/10/2016 07/19/2016 01/22/2015
Launch Price MSRP: $599
Founders $699
MSRP: $379
Founders $449
MSRP: $249
Founders $299
$199

We’ll start things off as always with the specifications. GeForce GTX 1060 ships with a fully enabled GP106 GPU, so right off the bat we’re seeing the full capabilities of NVIDIA’s mainstream GPU. A full GP106 (and consequently GTX 1060) ships with 1280 CUDA cores, which is half the number found on GTX 1080/GP104. Similarly, this means we’re looking at half the texture units, and half of the polymorph geometry engines.

However on the backend of the rendering pipeline, things are a bit more interesting NVIDIA deviates a bit from tradition, and a bit from making GP106 a true halving of GP104. Whereas you’d expect half of a GP104 to ship with 4 ROP/L2/MC partitions – for a total of 32 ROPs and a 128-bit memory bus – instead we get a 6 partition setup, only 2 partitions shy of GP104.

The significance of this is threefold. First, it means that GP106 isn’t strictly half of a GP104. Second, it means that it has a higher ratio of ROP throughput, L2 cache capacity, and memory bandwidth per CUDA core than parts such as GTX 1080. Third, it’s only the second time in recent history where NVIDIA has used a larger backend configuration on a mainstream part – GK106 being the first – which after NVIDIA’s performance gains and overall success with Maxwell, comes as a bit of a surprise.

In any case, what we’re left with is a slightly unbalanced design for GTX 1060/GP106. The GPU’s 10 SMs are divided up into two GPCs, half the configuration of GP104. This means that GP106 can rasterize 32 pixels per clock on the frontend, but the backend ROPs can accept 48 pixels per clock. This kind of imbalance is not a bad thing – the ROPs are responsible for other tasks such as MSAA that eat up more time – but it’s interesting thing to see happen, as it gives us a bit of an idea of what kind of bottlenecks NVIDIA needed to balance out behind the scenes in the development process.

As it stands, I suspect that the larger backend is a compromise solution for coming up with more memory bandwidth versus GTX 960/GM206. Relative to their Maxwell predecessors, all of the other Pascal parts so far have received significant bandwidth increases in order to feed their faster GPUs. For GTX 1080 this was GDDR5X, and for GTX 1070 this was faster 8 Gbps GDDR5 combined with a unified 256-bit memory bus. For GP106 to follow this path, NVIDIA would need to either implement GDDR5X (a more costly and volume-limited technology at this time), or go with a wider memory bus (via a wider overall backend), and the company has seemingly opted for the latter.

Meanwhile even with 6 ROP partitions, GP106 ends up being a relatively small chip. At 200mm2 on the dot, it’s 12% smaller than the 227mm2 GM206 that precedes it, and even smaller than the 214mm2 GK106 before that. This makes GP106 NVIDIA’s smallest mainstream (-06) GPU to date, something that no doubt is an important factor in keeping the GPU economical on TSMC’s cutting edge (read: more expensive) 16nm FinFET process.

Moving on to clockspeeds, the GTX 1060 ships with a 1.7GHz boost clock, which is smack-dab in the middle of the boost clocks for the GTX 1080 and GTX 1070. In other words, the card is clocked just as high as its higher-end siblings, and the only difference will be the overall number of processing units within the GPU. On a pure SM basis this puts GTX 1060 at 4.35 TFLOPs, giving the card 67% of the compute and geometry performance of the GTX 1070. Or to compare it to the outgoing GTX 960, we’re looking at an 80% increase in those same metrics.

Geeking out about GP106 aside, let’s talk about the rest of what makes up GeForce GTX 1060. Owing to its non-power-of-two memory bus size, GTX 1060 will be shipping with 6GB of VRAM, which is 2GB less than the GP104 cards and 2-4GB more than the older GTX 960. The card will use the same 8Gbps memory as on GTX 1070, giving the card 192GB/sec of total memory bandwidth, a hefty 71% increase over GTX 960. Combined with the increased memory compression efficiency of Pascal, it goes without saying that GTX 1060 should hopefully not find itself wanting for memory bandwidth.

As for power and thermals, NVIDIA is slotting GTX 1060 in the same TDP envelope as the outgoing GTX 960, which is 120W. This makes the GTX 1060 NVIDIA’s flagship sub-150W card, suitable for installation in systems that only offer a single 6-pin power connector or can’t ventilate GTX 1070+ levels of heat.

Overall NVIDIA’s TDPs have fluctuated a bit over the years, especially in the hyper-competitive mainstream market. After reaching a high of 170W with GTX 760, NVIDIA invested most of their Maxwell savings in lowering the TDP for GTX 960, and this seems to have worked out well for them. Still, I think it will be interesting to see if what follows GTX 1060 will stay below 150W, as we’re at the start of what’s likely a very long cycle on TSMC’s 16nm process.

What does 120W get you then? In short, it gets you just shy of GTX 980 performance combined with Pascal’s feature set for 45W (27%) less power consumption. Or to frame this in reference of the other Pascal cards, we’re looking at about 73% of GTX 1070’s performance. Consequently this card rides a sometimes fine line between being a 1080p card and a 1440p card, but in practice it’s going to be best suited as a no-compromises 1080p card, especially as newer games hit the market.

As for where it fits into the market, as with the GTX 1080 and GTX 1070, the situation is a bit more complicated due to the presence of reference cards and how they’re priced and sold. NVIDIA has extended the Founders Edition line to the GTX 1060, offering a reference card here as well. However the Founders Edition card is once again positioned as a premium card, so while its otherwise stock performance is a good proxy for the GTX 1060’s baseline performance, its price and positioning is not. Equally important, unlike the higher-end cards, the GTX 1060 Founders Edition isn’t even a retail card in the traditional sense, but rather is only sold by NVIDIA through their web store.

GeForce GTX 1060 Configurations
  Base Founders Edition
Core Clock 1506MHz 1506MHz
Boost Clock 1709MHz 1709MHz
Memory Clock 8Gbps GDDR5 8Gbps GDDR5
Cooler Manufacturer Custom
(Typical: 1 or 2 Fan Open Air)
NVIDIA Reference
(Blower)
Price Starting at $249 $299

With the GTX 1060 then, the supply of cards is almost entirely partner focused. While the reference card exists, the only card sold through traditional retailers/etailers will be their semi and fully custom designs. For partners this is a good thing, as it means they can more readily differentiate themselves from one another and need not directly compete with the reference design. At the same time for buyers this means we’re looking at a wider variety of designs, ranging from basic MSRP designs to high-end designs like the custom card we’re also having a look at today, the ASUS ROG Strix GTX 1060 OC.

Pricing then is equally variable. The base MSRP for GTX 1060 starts at $249, with the NVIDIA Founders Edition card retailing for $299. Various factory overclocked then run the gamut from $259 to $314 for the aforementioned ASUS card. The end result is that there are a lot of options on the market, both with the benefit of variety and the drawback of potentially having to do some hunting to get a specific card when it’s in stock.

At least at MSRP then, the GTX 1060 is consistently priced with NVIDIA’s other Pascal cards, which is to say higher than the 900 series. Overall, GTX 1000 series pricing is closer to that of the 700 series, with the GTX 1060 launching at the same $249 price point the GTX 760 launched at back in 2013. As a result NVIDIA is going to miss the $199 mainstream sweet spot that the GTX 960 launched at, at least for the time being.

As for the competition, at MSRP GTX 1060 is clearly a shot across the bow at AMD’s Radeon RX 480 8GB, not quite overlapping it in price but otherwise going after the same target market with very aggressive performance and power targets. The difference is going to be that while AMD offers a $199 option with the 4GB RX 480 (albeit in limited numbers), NVIDIA will be aiming higher entirely, so there’s not a perfect overlap between the two. That said, NVIDIA and its partners will still be aiming to make the GTX 1060 the better value in the long run.

The bigger issue for NV and partners right now is the supply issue. NVIDIA has struggled to keep supply in check with demand for the entire GTX 1000 series. GTX 1060 has not proven any different in that regard. Inventory trackers show that cards come into stock regularly, but they also go just as quickly. The one thing that has gone better for buyers – or at least those who can get their hands on a card – is that prices haven’t gone over MSRP like on the other GeForce cards. GTX 1060 cards still hit the market at $249. Traditional economic theory means that having prices hold to MSRP is probably worsening the supply shortage – so there is a tradeoff to be had – but it also means that NVIDIA and its partners (and their retailers) aren’t gouging consumers.

Summer 2016 GPU Pricing Comparison
AMD Price NVIDIA
  $699 GeForce GTX 1080
Radeon R9 Nano $459  
  $439 GeForce GTX 1070
Radeon R9 390X $329  
Radeon R9 390 $299  
  $249 GeForce GTX 1060
Radeon RX 480 (8GB) $239  
Radeon RX 480 (4GB) $199  
Meet the GeForce GTX 1060 Founders Edition
Comments Locked

189 Comments

View All Comments

  • anandreader106 - Friday, August 5, 2016 - link

    First thought: Still no Doom benchmarks being factored in?

    Ryan,

    You are my favorite GPU reviewer. Period. However I do think I need clarity on your Final Words.

    It's my opinion that DirectX 11 performance is "good enough" from Nvidia and AMD thus far in this new generation. So I'm left wondering, why aren't you going more in-depth with DirectX 12 and Vulcan titles/performance? Wouldn't that give us the best indication of what to expect going forward?
  • cknobman - Friday, August 5, 2016 - link

    The best indication you will get is that when reviewing Nvidia cards none of these things will be addressed?

    Why, because Nvidia is not doing so hot at them and it would not make their cards look better than AMD's.

    Look at the other 1060 benchmarks and comparisons and you will see that:
    A. Nvidia is behind on dx12 and the 480 => 1060
    B. @1080p the 1060 is overkill and a $200 480 4gb (or even a $180 470) is all you need
    C. Because of Nvidia's "founders edition" price gouge model most 3rd parties are trying to get away with charging more than $250. Reality is most 1060's are >= $270 which makes the AMD 480 the better buy.
  • StrangerGuy - Friday, August 5, 2016 - link

    It's funny the AMD fanboys always harp about the evil $300 1060 and never mentions how their favorite $200 480 is essentially vaporware and 8GB versions price gouged to death.
  • Ryan Smith - Friday, August 5, 2016 - link

    "It's my opinion that DirectX 11 performance is "good enough" from Nvidia and AMD thus far in this new generation. So I'm left wondering, why aren't you going more in-depth with DirectX 12 and Vulcan titles/performance? Wouldn't that give us the best indication of what to expect going forward?"

    The benchmark suite only gets updated periodically. It's a lot of effort to design and validate a testing sequence, and then run (and possibly re-run) 30 some-odd cards through it. So adding games has the net effect of slowing things down even further.

    At this point we're updating the testbed to Broadwell next month, at which point we'll refresh the games list as necessary.

    Though I will note that there's a reason we run so many (9) games: one game is too small of a sample size. Right now Doom is the only Vulkan game on the market,* so while it's a very interesting first look at Vulkan, it's not something that's going to be representative of Vulkan as a whole.

    * We'll ignore DOTA 2 since it's not meaningfully GPU limited on these fast cards
  • CHADBOGA - Friday, August 5, 2016 - link

    Doom is one of those few games out there that will inspire people to go one way or the other and should be included in your benchmark suite.
  • Scali - Saturday, August 6, 2016 - link

    Aside from that, the Vulkan implementation in DOOM is not yet complete.
    As you can read in the DOOM FAQ, they use AMD shader intrinsics extensions, but no equivalent for nVidia. Likewise, on AMD hardware, async compute is enabled, on nVidia it is not yet. The FAQ says they're still working on optimizing the code with nVidia.

    While it may be interesting to benchmark DOOM's Vulkan implementation to get an idea of where we currently stand, I don't think it is mature enough at this point to say anything about performance in Vulkan games in general, or how AMD and nVidia stack up, since you're comparing apples to oranges at this point.
  • rhysiam - Friday, August 5, 2016 - link

    I too am curious as to why the whole DX11 vs 12 comparison wasn't even raised. DX12 does not appear once in the conclusion page. The 1060 is the better DX11 card, no question. It's early days for DX12, but what we're seeing so far is enough to suggest things may well be quite different. The three DX12 titles in the review (Hitman, RoTR & AoS) are the three strongest games for the 480 by far. Add Doom via Vulkan into the mix and you have 4 NextGen API titles that put the 480 at or above 1060 performance. Of course we can't make hard and fast recommendations based on a few titles like this, but surely it's worth mentioning at least, if not exploring in detail?

    This might be a minor point except for the fact that you dismiss the 4GB 480 based on speculation/extrapolation that its VRAM won't be enough to keep it competitive future demanding titles. Surely those demanding titles will increasingly be (or at least offer) DX12 though? So if you're advocating a 1060 over the 480 4GB based on longevity and future performance, the DX12 question has to be raised doesn't it?
  • rj030485 - Friday, August 5, 2016 - link

    Think Ryan needs to work on his math. He says the 1060 is 17% faster than the 480 in GTA V when the difference more like 30%.
  • Ryan Smith - Friday, August 5, 2016 - link

    Oh geeze. This is what happens when you read the wrong column in a spreadsheet. Thanks!
  • onemoar@gmail.com - Friday, August 5, 2016 - link

    I don't know why anands witcher 3 scores are so low
    I am pushing 80FPS in places with everything turned up to ultra and post effects on with no hairworks

Log in

Don't have an account? Sign up now