AMD's Path to Polaris

With the benefit of hindsight, I think in reflection that the 28nm generation started out better for AMD than it ended. The first Graphics Core Next card, Radeon HD 7970, had the advantage of launching more than a quarter before NVIDIA’s competing Kepler cards. And while AMD trailed in power efficiency from the start, at least for a time there they could compete for the top spot in the market with products such as the Radeon HD 7970 GHz Edition, before NVIDIA rolled out their largest Kepler GPUs.

However I think where things really went off of the rails for AMD was mid-cycle, in 2014, when NVIDIA unveiled the Maxwell architecture. Kepler was good, but Maxwell was great; NVIDIA further improved their architectural and energy efficiency (at times immensely so), and this put AMD on the back foot for the rest of the generation. AMD had performant parts from the bottom R7 360 right up to the top Fury X, but they were never in a position to catch Maxwell’s efficiency, a quality that proved to resonate with both reviewers and gamers.

The lessons of the 28nm generation were not lost on AMD. Graphics Core Next was a solid architecture and opened the door to AMD in a number of ways, but the Radeon brand does not exist in a vacuum, and it needs to compete with the more successful NVIDIA. At the same time AMD is nothing if not scrappy, and they can surprise us when we least expect it. But sometimes the only way to learn is the hard way, and for AMD I think the latter half of the 28nm generation was for the Radeon Technologies Group learning the hard way.

So what lessons did AMD learn for Polaris? First and foremost, power efficiency matters. It matters quite a lot in fact. Every vendor – be it AMD, Intel, or NVIDIA – will play up their strongest attributes. But power efficiency caught on with consumers, more so than any other “feature” in the 28nm generation. Though its importance in the desktop market is forum argument fodder to this day, power efficiency and overall performance are two sides of the same coin. There are practical limits for how much power can be dissipated in different card form factors, so the greater the efficiency, the greater the performance at a specific form factor. This aspect is even more important in the notebook space, where GPUs are at the mercy of limited cooling and there is a hard ceiling on heat dissipation.

As a result a significant amount of the work that has gone into Polaris has been into improving power efficiency. To be blunt, AMD has to be able to better compete with NVIDIA here, but AMD’s position is more nuanced than simply beating NVIDIA. AMD largely missed the boat on notebooks in the last generation, and they don’t want to repeat their mistakes. At the same time, starting now with an energy efficient architecture means that when they scale up and scale out with bigger and faster chips, they have a solid base to work from, and ultimately, more chances to achieve better performance.

The other lesson AMD learned for Polaris is that market share matters. This is not an end-user problem – AMD’s market share doesn’t change the performance or value of their cards – but we can’t talk about what led to Polaris without addressing it. AMD’s share of the consumer GPU market is about as low as it ever has been; this translates not only into weaker sales, but it undermines AMD’s position as a whole. Consumers are more likely to buy what’s safe, and OEMs aren’t much different, never mind the psychological aspects of the bandwagon effect.

Consequently, with Polaris AMD made the decision to start with the mainstream market and then work up from there, a significant departure from the traditional top-down GPU rollouts. This means developing chips like Polaris 10 and 11 first, targeting mainstream desktops and laptops, and letting the larger enthusiast class GPUs follow. The potential payoff for AMD here is that this is the opposite of what NVIDIA has done, and that means AMD gets to go after the high volume mainstream market first while NVIDIA builds down. Should everything go according to plan, then this gives AMD the opportunity to grow out their market share, and ultimately shore up their business.

As we dive into Polaris, its abilities, and its performance, it’s these two lessons we’ll see crop up time and time again, as these were some of the guiding lessons in Polaris’s design. AMD has taken the lessons of the 28nm generation to heart and have crafted a plan to move forward with the FinFET generation, charting a different, and hopefully more successful path.

Though with this talk of energy efficiency and mainstream GPUs, let’s be clear here: this isn’t AMD’s small die strategy reborn. AMD has already announced their Vega architecture, which will follow up on the work done by Polaris. Though not explicitly stated by AMD, it has been strongly hinted at that these are the higher performance chips that in past generations we’d see AMD launch with first, offering performance features such as HBM2. AMD will have to live with the fact that for the near future they have no shot at the performance crown – and the halo effect that comes with it – but with any luck, it will put AMD in a better position to strike at the high-end market once Vega’s time does come.

The AMD Radeon RX 480 Preview The Polaris Architecture: In Brief
Comments Locked

449 Comments

View All Comments

  • Yojimbo - Thursday, June 30, 2016 - link

    Why wouldn't you use data for NVIDIA's GPUs to try to determine the GTX 1060's performance rather than use data from AMD's GPUs? The experience from the 700 series and the 900 series implies that, assuming that the 1060 has two GPCs (half that of the 1080) it should be about 20% faster than the 970 in DX 11 games and so about 20% faster than the RX 480. Pascal seems to be doing better in DX 12 than Maxwell, so it may end up being close to 20% faster than RX 480 in DX 12 games, too.
  • dragonsqrrl - Thursday, June 30, 2016 - link

    I'm not using AMDs GPUs to determine the performance of the 1060, I'm using the 1070 and 1080. What I was trying to say in my previous comment was that I've assumed roughly 50% 1080 performance (or around the 970) for the 1060. The RX480 leaks prior to launch suggested 390X-like performance, which led me to believe the 1060 would probably perform a step below it. Apparently the leaks were a bit exaggerated, so I now think the 1060 will be more competitive against the RX480 than I did before.

    I'm actually curious why your estimate is so different. Am I missing something?
  • Yojimbo - Thursday, June 30, 2016 - link


    OK, first let's look at the 900 series. The GTX 980 was released on September 18, 2014 for $549. It has a 2048:128:64 configuration @ 1126 MHz base clock for 4612 SP throughput. The GTX 970 was released on September 18, 2014 for $329. It has a 1664:104:56 core configuration @ 1050 MHz for 3494 SP throughput. The GTX 960 was released January 22, 2015 for $199. It has a 1024:64:32 core configuration @ 1127 MHz base clock for 2308 SP throughput.
    Relative: performance - 980 is 1.32 times 970, price - 980 is 1.67 times 970. performance - 980 is 2 times 960, price - 980 is 2.76 times 960. performance - 970 is 1.51 times 960, price - 970 is 1.65 times 960.

    Now the 10 series. GTX 1080 was just recently released and will presumably be available soon for $599. It has a 2560:160:64 configuration @ 1607 MHz for SP throughput of 8228. GTX 1070 was just recently released and will presumably be available soon for $379. It has a 1920:120:64 configuration @ 1506 MHz for SP throughput of 5783. Now the GTX 1060 is rumored to have a 1280:80:48 configuration. It will probably have a clock very close to the 1080 judging by the 900 series clocks. That would give it an SP throughput of 4114. Relative: performance - 980 is 1.42 times 1070, price - 1080 is 1.58 times 1070. speculative: performance - 1080 is 2 times 1060. performance - 1070 is 1.41 times 1060.

    Now the GTX 1070 has SP throughput that is 25% more than the GTX 980. It performs 20% to 40% faster than the 980 (in DX11 games. More in DX12 games), averaging more than 30% faster. 4114 SP throughput for the GTX 1060 would make it give it 18% more than the GTX 970. It should then average about 25% faster in DX11 games, and so more than 20% faster than the RX 480.

    Now, I know that you were only interested in how I got the performance numbers for the 1060, but I decided to include an argument for pricing as well while I was at it:

    The 1070 has a performance/price ratio of 1.11 wrt to the 1080. The 970 has a performance/price ratio of 1.27 wrt to the 980. The 960 has a performance/price ratio of 1.38 wrt the 980. The 960 has a performance/price ratio of 1.09 wrt the 970. You can see the 1070 is priced a lot closer to the 1080 compared to the 970's price relative to the 980, despite the 970 being closer to the 980 in performance compared with the relative performance of 1070 and 1080. The question is why is this the case? Does it have to do something with the defect density of the 14nm node, or something else? My guess is it has to do with the success of the 970 and the amount of competition from AMD in the space the cards occupy. The 970 was enourmously successful, and NVIDIA wants to push up the average selling price of the replacement card if they can, in order to tap into the prior success of the x70 card. Additionally, when the 980 and 970 were released, AMD had more competition for the 970 than for the 980. Therefore the 980 could be prices relatively higher. Now AMD does not really have competition for both the 1080 and the 1070, allowing both those cards to be priced higher. The 1060, however, faces competition. Therefore I think that the expected pricing of the 1060 would be to remain close to the relative price/performance ratio of the 960 wrt the 980, a card with competition compared with a card without much competition, rather than remain closer to the relative price/performance ratio of the 960 wrt the 970, which were both cards with competition. If we divide the price ratio of the 980 to 960 with the performance ratio of the 980 to 960 we get 2.76/2 = 1.38. This represents a conversion factor that will convert relative performance to relative price, under the assumption that the relative price performance ratio of the 980 to 960 also holds for the 1080 to 1060. Since I speculated that the 1080 will have 2 times the performance of the 1060, the 1080 would then cost 2.76 times the 1060 under these assumptions. Since the 1080 costs $599, the 1060 would be expected to cost about $217.

    $217 obviously leaves quite a bit of wiggle room for upward pricing pressure of the 1060, such as it falling closer in line with the 1070 price for whaetever reason, and still be well below the $300 that many seem to be claiming. But the point is that a $220 GTX 1060 performing 25% faster than the GTX 970 is well within the range of reasonable expectations given the recent historical data of NVIDIA's cards. If anything the GTX 1080 has even less competition than the GTX 980 had, suggesting the converstion factor might actually be greater (But I doubt it. The 1080 can't be found for the $599 at the moment and part of the reason for that is that the 1080 doesn't have any competition. That larger conversion factor is factored into the actual real world prices but probably not the MSRPs.) So the RX 480 seems to exert no extra pricing pressure on the GTX 1060 than AMD's offerings exerted on the GTX 960 when the GTX 960 was released.
  • dragonsqrrl - Thursday, June 30, 2016 - link

    "Relative: performance - 980 is 1.42 times 1070, price - 1080 is 1.58 times 1070. speculative: performance - 1080 is 2 times 1060. performance - 1070 is 1.41 times 1060."

    There's something wrong here. If the 1060 is roughly equal to the 980, and the 1080 is 2x the performance of the 1060, the 1080 would also have to be 2x the performance of the 980, which it isn't. I'm not exactly sure where the ratios or logic went wrong, but there's clearly an inconsistency there. The 1080 is about 1.65x the performance of the 980, and about 1.95x the performance of the 970. I'm not using theoretical SP performance, I'm basing this primarily off of real world DX11 performance at 1440p. This is why I assumed it would perform closer to the 970, because it's roughly 50% the performance of the 1080.
  • Yojimbo - Thursday, June 30, 2016 - link

    " If the 1060 is roughly equal to the 980, and the 1080 is 2x the performance of the 1060, the 1080 would also have to be 2x the performance of the 980, which it isn't."

    There's pretty obviously a typo there. The organization of the information should lead you to know it's a typo. It should read: ""Relative: performance - 1080 is 1.42 times 1070, price - 1080 is 1.58 times 1070. speculative: performance - 1080 is 2 times 1060. performance - 1070 is 1.41 times 1060." Does that clear things up?

    " The 1080 is about 1.65x the performance of the 980, and about 1.95x the performance of the 970."

    Does that contradict my information? If it does, then show how. It doesn't seem relevant to me, because you don't directly argue against the cross-generational comparison I did make. I established the relative performance of the 10 series to the 900 series by comparing the 1070 to the 980. The 1070 performs on average 30% faster than the 980 in real world games. The relative performance of the cards within their architecture is closely related to their theoretical performance.

    "I'm not using theoretical SP performance"

    Without considering theoretical performance there's no way whatsoever you can guess the performance of the 1060 because at this point theoretical performance of the 1060 is all we have information for.
  • sonicmerlin - Friday, July 1, 2016 - link

    Yojimbo almost certainly has Aspergers. And yet you read everything he wrote. Jesus
  • Yojimbo - Friday, July 1, 2016 - link

    Why wouldn't you want to read something that's right? Jesus
  • crimson117 - Wednesday, June 29, 2016 - link

    Go away, troll. Don't just post shit like that without backing it up. When can we get downvote buttons on AT comments?

    "Wrapping things up then, today’s launch of the Radeon RX 480 leaves AMD is in a good position. They have the mainstream market to themselves, and RX 480 is a strong showing for their new Polaris architecture. AMD will have to fend off NVIDIA at some point, but for now they can sit back and enjoy another successful launch."
  • atlantico - Wednesday, June 29, 2016 - link

    He's not wrong, for $200-240 the best GPU on the market is AMD RX480. For "backing that up" check the benchmarks in the article.
  • crimson117 - Wednesday, June 29, 2016 - link

    I was referring to the OP's comment "What a massive F-Up by AMD"

Log in

Don't have an account? Sign up now