Power Gating

With 1.45 billion transistors on die, Llano relies on extensive power gating in order to keep things in order. The APU is split into two independent power islands: the CPU and the GPU. The memory controller and North Bridge both live on the GPU's power island. Each island has its own independent voltage source.

Everything from an individual CPU core to the entire GPU or virtually the entire APU package can be power gated. AMD provided photon recombination images to show the impact power gating the GPU can have on leakage current:

Although not depicted above, Llano can also fully power gate the x86 CPU cores or both the CPU and GPU if the entire APU is in a deep sleep state. Being able to completely power gate CPU cores or the GPU is an important part of enabling the next major feature of Llano: Turbo Core.

Turbo Core

All processors whether CPUs, GPUs or APUs have to be designed to strict thermal and power limits. OEMs need to know exactly what sort of chassis they'll be able to build around these chips and as a result the chip vendors provide guidance in the form of specifications, including the chip's thermal design point (TDP).

In the old days of microprocessors things were simple. You had a single core that ran all the time and it consumed all of the available thermal budget allocated for that core. AMD and Intel eventually enabled dynamic clock frequencies which let your single core underclock itself when it wasn't being used, which helped reduce power and extend battery life. Then came the multi-core era.

CPUs couldn't just start putting out twice as much heat now that they had two cores; instead, each core had to consume less power. The chip guys achieved this by running the cores at lower frequencies and voltages than they did in the single-core days. Two cores paved the way to four cores, which meant another reduction in clock speed per core. Sure we got much better multi-threaded performance, but for single-threaded applications performance wasn't as great as it could be. Users had to make a tradeoff: good multi-threaded performance or good single-threaded performance; you couldn't have both. Until power gating came along that is.

Without power gating you can never really shut off power to an idle core. The transistors aren't switching but power is still dissipated thanks to leakage current. Remember that transistors don't simply stop conducting electricity when they're off. The smaller they get, the more leaky our beloved transistors become. Power gating lets you physically block the flow of current to the transistors that are being gated, so when they're off, they're actually off. With an idle core shut off, now you have the extra TDP headroom to run any active cores at higher frequencies.

Intel does this with a technology it calls Turbo Boost. Intel looks at current draw and thermal sensors spread out all over the chip and determines when it has the available thermal headroom to turbo up any active cores. AMD implements a similar technology in Llano (and previously in their hex-core desktop parts) called Turbo Core.

I say similar but not identical because AMD's approach differs in a very important way. While Intel looks at current draw and temperature data, AMD looks at workload. Each activity within the Llano APU is assigned a certain power weight (e.g. an integer multiply is known to require a certain amount of power). Llano is aware of the operations it's currently working on and based on the weights associated with these operations it comes up with a general estimate of its power consumption on a per core basis. I mention this is an estimate because it correlates digital activity to power consumption; it doesn't actually measure power consumption.

Based on the number of events and their individual weights, AMD estimates the power consumption of each core and determines how much TDP headroom exists in the system. If the OS is requesting the highest p-state from the CPU and there's available TDP headroom, Llano will turbo up any active cores up to a maximum frequency. Like Sandy Bridge, Llano is able to temporarily exceed the APU's maximum TDP if it determines that the recent history of power consumption has been low enough that it'll take a while for the APU to ramp up to any thermal limits.

One major limit of Llano's Turbo Core is that the GPU can't turbo up in the event of the CPU cores being idle. Only the CPU cores can turbo up if they have available headroom. I suspect future versions of Llano will probably enable GPU Turbo Core as well:

It's unclear to me at this point what shortcomings or advantages exist for AMD's Turbo Core method vs. Intel's Turbo Boost. At the bare minimum the two are finally comparable although they use different approaches to attain a similar end result. AMD doesn't yet have a method of actually displaying Turbo Core frequencies, unfortunately, so we're operating a bit blind at this point. Over time I hope to have a better idea of how AMD's solution stacks up.

The GPU Introducing Mobile Llano
Comments Locked

177 Comments

View All Comments

  • ET - Wednesday, June 15, 2011 - link

    It may be impossible to know the exact speed the cores run, but it would be interesting to run a test to get some relative numbers.

    You can run a single threaded CPU bound program such as SuperPI, then run it again with the other three cores at 100% (for example by having another three instances of SuperPI running). Do this on AC and battery, and it might generate some interesting numbers. At the very least we'll be able to tell whether the 1.5GHz -> 2.4GHz ratio looks right.
  • ET - Wednesday, June 15, 2011 - link

    By the way, I just read Tom's Hardware review, which was unique in that it compared to a Phenom II X4 running at 1.5GHz and 2.4GHz. It looked from these benchmarks like the A8-3500M is always performing around the 1.5GHz level of the Phenom II X4 (sometimes it's a little faster, sometimes a little slower), which suggests that Turbo Core doesn't really kick in.
  • i_am_the_avenger - Wednesday, June 15, 2011 - link

    Maybe this will cheer the AMD Fans a bit

    This article did not mention some nifty features the APUs have (or maybe it did I did not read it line by line)...................................

    Watch the video below from engadget:

    http://www.engadget.com/2011/06/13/amds-fusion-a-s...

    It shows how these APUs can smooth out shaking videos real time, even while streaming from Youtube! and it does a very good job.

    Another feature is how it en-chances videos (colour etc.)

    This improves general user PC experience.......... something very desirable
    The video also shows how AMD wants to target general users and not work enthusiasts

    Another video shows comparison between the i7-2630QM and A8-3500M while multitasking video related applications.

    http://www.engadget.com/2011/03/01/amd-compares-up...
    ---Interesting to note that the APU Gradually increased its power consumption while i7 was like bursting to and fro, something the way turbo core acts maybe-----

    I think it is work vs general performance,
    Intel's great for work, when you need to finish tasks and it needs to be done quickly,
    while AMD APUs give you a good over all pc and multimedia performance - you watch videos, play games, so what if the zip file extracts a minute late and the fGPU performance is great....
    You may buy a i7 SNB with discreate GPU but that has a battery life hit (for same battery capacity) and also extra heat generation which requires more fans, also the extra weight..

    Please don't start judging me or something....
    I am getting confused myself, while intel looks great in every way except stock gaming and battery life(not that bad)... I think I don't need that much power, even if I work - my work isn't so CPU oriented that an i7 would matter, a 30 second task finishes in 20 ok but it does not matter to me..... but improved video and battery seems more useful to me

    I don't think that all of us have to tax our CPUs to full potential -- a few have to, not considering them -- so even if Intel have faster processors for many it does not affect them as much.
  • psychobriggsy - Wednesday, June 15, 2011 - link

    For all your moaning about not getting Asymmetric CrossFire to work, you didn't read the reviewers guide that says it only works in DX10 and DX11 mode, not DX9. So your Dirt2 benches for example clearly state DX9 for this test. I don't know about the other titles on that page - you say 5 of the others are DX9 titles. Do these titles have DX10 modes of operation - if so, USE THEM.

    Otherwise it just looks like you are trying to get the best results for the Intel Integrated Graphics.

    Just put "0 - Unsupported" for DX11 tests by HD3000 like other sites have done.
  • ET - Wednesday, June 15, 2011 - link

    The article said:

    "AMD told us in an email on Monday (after all of our testing was already complete) that the current ACF implementation on our test notebook and with the test drivers only works on DX10/11 games. It's not clear if this will be the intention for future ACF enabled laptops or if this is specific to our review sample. Even at our "High" settings, five of our ten titles are DX9 games (DiRT 2, L4D2, Mafia II, Mass Effect 2, and StarCraft II--lots of twos in there, I know!), so they shouldn't show any improvement...and they don't. Actually, the five DX9 games even show reduced performance relative to the dGPU, so not only does ACF not help but it hinders. That's the bad news. The only good news is that the other half of the games show moderate performance increases over the dGPU."

    I agree that at least in the case of DiRT 2 that's blatantly false, since that game was one of the first to use DX11, and was given with many Radeon 58x0 cards for this reason.
  • JarredWalton - Friday, June 17, 2011 - link

    DiRT 2 supports DX11, but it's only DX9 or DX11. We chose to standardize on DX9 for our Low/Med/High settings -- and actually, DX11 runs slower at the High settings than DX9 does (though perhaps it looks slightly better). Anyway, we do test DiRT 2 with DX11 for our "Ultra" settings, but Llano isn't fast enough to handle 1080p with 4xAA and DX11. So to be clear, I'm not saying DiRT 2 isn't DX11; I'm saying that the settings we standardized on over a year ago are not DX11.
  • jitttaaa - Wednesday, June 15, 2011 - link

    How is the notebook llano performing as good, if not better than the desktop llano?
  • ET - Wednesday, June 15, 2011 - link

    At least as far as CPU power is concerned, the desktop part is obviously faster. The benchmarks are mostly not compatible so it's hard to judge, but in Cinebench R10 the mobile Llano gets 2037 while the desktop gets 3390. I agree that for graphics it looks like the desktop part is performing worse in games, which is strange considering the GPU is working at a faster speed.

    Only explanation I can think of is that the faster CPU is taking too much memory bandwidth, but it doesn't make much sense since it's been said that the GPU gets priority. It's definitely something that's worth checking out with AMD.
  • ionave - Thursday, June 16, 2011 - link

    http://www.anandtech.com/show/4448/amd-llano-deskt...

    On average the A8-3850 is 58% faster than the Core i5 2500K.

    Boom. Delivered. You think its slow? It really isn't. The A8-3850 has about the performance of a DESKTOP i3. If you think that is bad performance, then you don't know what you are talking about. The battery life is amazing for having that kind of performance in a laptop. I'm sorry, but it totally destroys i7 and i5 platforms because of the sheer performance in that amazing battery life.
  • JarredWalton - Friday, June 17, 2011 - link

    Let me correct that for you:

    On average, the A8-3850 fGPU (6550D) is 58% faster than the Core i5-2500K's HD 3000 IGP, in games running at low quality settings. It is also 29% faster than the i5-2500K with a discrete HD 5450, which is a $25 graphics card. On the other hand, the i5-2500K with an HD 5570 (a $50 GPU) is on average 66% faster than the A8-3850.

    Boom. Delivered. You think that's fast? It really isn't. The 6550D has about the performance of a $35 desktop GPU. If you think that is good performance, then you don't know what you are talking about.

    At least Llano is decent for laptops, but for $650 you can already get i3-2310M with a GT 520M and Optimus. Let me spell it out for you: better performance on the CPU, similar or better performance on the GPU, and a price online that's already $50 below the suggested target of the A8-3500M. Realistically, A8-3500M will need to sell for $600 to be viable, A6 for $500, and A4 for $450 or less.

Log in

Don't have an account? Sign up now