The New PowerTune: Adding Further States

In 2010 AMD introduced their PowerTune technology alongside their Cayman GPU. PowerTune was a new, advanced method of managing GPU voltages and clockspeeds, with the goal of offering better control over power consumption at all times so that AMD could be more aggressive with their clockspeeds. PowerTune’s primary task was to reign in on programs like FurMark – power viruses as AMD calls them – so that these programs would not push a card past its thermal/electrical limits. Consequently, with PowerTune in place AMD would not need to set their maximum GPU clocks as conservatively merely to handle the power virus scenario.

This technology was brought forward for the entire Southern Islands family of GPUs, and remained virtually unchanged. PowerTune as implemented on SI cards without Boost had 3 states – idle, intermediate (low-3D), and high (full-3D). When for whatever reason PowerTune needed to clamp down on power usage to stay within the designated limits, it could either jump states or merely turn down the clockspeed, depending on how far over the limit the card was trying to go. In practice state jumps were rare – it’s a big gap between high and intermediate – so for non-boost cards it would merely turn down the GPU clockspeed until power consumption was where it needed to be.

Modulating clockspeeds in such a manner is a relatively easy thing to implement, but it’s not without its drawbacks. That drawback being that semiconductor power consumption scales at a far greater rate with voltage than it does with clockspeed. So although turning down clockspeeds does reduce power consumption, it doesn’t do so by a large degree. If you want big power savings, you need to turn down the voltage too.

Starting with 7790 and Bonaire, this is exactly what AMD is doing. Gone is pure clockspeed modulation – inferred states in AMD’s nomenclature – and instead AMD is moving to using a larger number of full states. GCN 1.1 has 8 states altogether, with no inferred states between them. With this change, when PowerTune needs to reduce clockspeeds it can drop to a nearby state, reducing power consumption through both clockspeed and voltage reductions at the same time.

With this change state jumping will also be a far more frequent occurrence. The lack of intermediate states and the lack of granularity (8 states over 700MHz is not fine-grained) effectively makes fast state jumping a requirement, as there’s a very good chance dropping down a state will leave some power/performance on the table. So if it’s throttling, 7790 will be able to state jump as quickly as every 10ms (that’s 100 jumps a second), typically bouncing between two or more states in order to keep the card within its limits.

At the same time, AMD’s formula for picking states on non-boost cards has changed. In a move similar to what AMD has done with Richland, AMD’s temperature-agnostic state selection system has been ditched in favor of one that includes temperatures into the calculation, making it a system that is now based on power, temperature, and load. There are some minor benefits to being temperature-agnostic that AMD is giving up – mainly that performance is going to vary a bit with temperature now – but at the end of the day this allows AMD to better min-max their GPUs to hit higher frequencies more often. This also brings them to parity with Intel and NVIDIA, who have long taken temperature into account.

The fact that this is a very boost-like system is not lost on us, and with these changes the line between PowerTune with and without boost starts to become foggy. Both are ultimately going to be doing the same thing – switching states based on power and temperature considerations – the only difference being whether a card adjusts down, or if it adjusts both up and down. In practice we rarely see cards adjust down outside of FurMark, so while PowerTune doesn’t dictate a clockspeed floor, base clocks are still base clocks. In which case the practical difference between whether an AMD card has boost or not is whether it can access some higher voltage, higher clockspeed states that it may not be able to maintain for long periods of time across all workloads. The 7790 isn’t a boost part of course, but AMD’s own presentation neatly lays out where boost would fit in, so if we do see future GCN 1.1 products with boost we have a good idea of what to expect.

Moving on, with the changes to PowerTune will also come changes to AMD’s API for 3rd party utilities, and what information is reported. First and foremost, due to the frequency of state changes with the new PowerTune, AMD will no longer be reporting the instantaneous state. Instead they will be reporting an average of the states used. We don’t know how big the averaging window is – we suspect it’s no more than 2 seconds – but the end result will be that MSI Afterburner, GPU-Z, and other utilities will now see those averages reported as the clockspeed. This will give most users a better idea of what the effective clockspeed (and thereby effective performance) is, but it does mean that it’s going to be virtually impossible to infer the clockspeeds/voltages of AMD’s new states.

The other change is that with the new PowerTune AMD will be exposing new tweaking options to 3rd parties. The current PowerTune (TDP) setting is going to be joined by a separate setting for adjusting a limit called Total Design Current (TDC), which as the name implies is how much current is allowed to be passed into the GPU. AMD limits cards by both TDP and TDC to keep total power, temperatures, and total currents in check, so this will open up the latter to tweakers. Unfortunately utilities with TDC controls were not ready in time for our 7790 review, so we can’t really comment on TDC at this time. With AMD’s changes to PowerTune however (and their insistence on calling TDP thermal management), TDP may be turning into a temperature control while TDC becomes the new power control.

Finally, since these controls are going to be user-accessible, this will spill-over to AMD’s partners. Partners will be able to set their own TDP and TDC limits if they wish, which will help them fine-tune their factory overclocked cards. This will give partners more headroom for such cards as opposed to being stuck shipping cards at AMD’s reference limits, but it means that different cards from different vendors may have different base TDP and TDC limits, along with different clockspeeds. This also means that in the future equalizing clockspeeds may not be enough to equalize two cards.

Bonaire’s Microarchitecture - What We’re Calling GCN 1.1 Meet The Radeon HD 7790 & Sapphire HD 7790 Dual-X Turbo
Comments Locked

107 Comments

View All Comments

  • GivMe1 - Friday, March 22, 2013 - link

    128bit interface is going to hurt high res textures...
  • CeriseCogburn - Sunday, March 24, 2013 - link

    Oh no it won't ! this is amd man! nothing hurts when it's amd ! amd yes it can !
  • Quizzical - Friday, March 22, 2013 - link

    Your chart shows Radeon HD 6870 FP64 performance as N/A. I think it's 1/20 of FP32 performance, but I'm not sure of that. It definitely can do FP64, as otherwise, it wouldn't be able to claim OpenGL 4 compliance.
  • MrSpadge - Friday, March 22, 2013 - link

    No, it doesn't have any HARDWARE FP64 capabilities. It's always possible to emulate this at slow performance via software, though.
  • Quizzical - Friday, March 22, 2013 - link

    It's basically the same as what the 7770, 7790, and 7850 do, but they're not listed as N/A. The relevant question isn't whether you can do it more slowly, but how much more slowly.
  • MrSpadge - Tuesday, March 26, 2013 - link

    No, it's not the same, the GCN cards have hardware FP64 capabilities.
  • Ryan Smith - Friday, March 22, 2013 - link

    Let's be clear here. 85W is not the TDP. The TDP is higher (likely on the order of 110W or so). However AMD chooses not to publish the TDP for these lower end cards, and instead the TBP.
  • alwayssts - Friday, March 22, 2013 - link

    Yeah, I figure ~85 TBP/105w TDP because that would be smack between 7770/7850 as well as having 20% headroom (which also allows another product to have their TBP between there and 7850's max TDP with it's max tdp above it within 150w....ie ~120-125/150w). IIRC, 80w is the powertune max (TDP) of 7770, 130w for 7850. 85w is the stock operation (TBP) of 7790.

    I really, really dislike how convoluted this power game has become...can you tell?!

    First it was max power. Then it was nvidia stating typical power (so products were within pci-e spec) with AMD still quoting max, which made them look bad. Then we get this 'awesome' product segmentation with 7000 having TBP and max powertune TDPs to separate them, while nvidia quotes TBP and hides the fact the TDP limits for their products exist unless you deduce them from the percentage you can up the boost power.

    AAAAaaaarrrrrghhhhh. I miss when the product you had could do what you wanted it to, ie before software voltage control and multiple states, as for products like this it gives the user less control and the companies a ton to create segmentation. Low-end stock products may have been less-than-stellar back in the day, but with determination you could get something out of it without some marketing stating it should fit x niche so give it y max tdp so it doesn't interfere with the market of z product.
  • CeriseCogburn - Friday, March 22, 2013 - link

    Maybe so you couldn't blow the crap out of it then return it for another one, then another one, as "you saved money" and caused everyone else to pay 25% more since you overclock freaks would blow them up, then LIE and get the freebie replacement, over and over again.

    Maybe they got sick of dealing with scam artist liars... maybe they aren't evil but the end user IS.
  • Spunjji - Friday, March 22, 2013 - link

    Why would the design power be higher than the total board power? :/ You're correct that the figure they're quoting isn't TDP but then you just went and made up a number.

    Here's some actual power consumption measurements of a 7770:
    http://www.techpowerup.com/reviews/HIS/HD_7770_iCo...

    So using Ananad's figures to extrapolate you can expect this thing to be ~90W max, usually lower than that at peak, right about where AMD put it.

Log in

Don't have an account? Sign up now