Homework: How Turbo Mode Works

AMD and Intel both figured out the practical maximum power consumption of a desktop CPU. Intel actually discovered it first, through trial and error, in the Prescott days. At the high end that's around 130W, for the upper mainstream market that's 95W. That's why all high end CPUs ship with 120 - 140W TDPs.

Regardless of whether you have one, two, four, six or eight cores - the entire chip has to fit within that power envelope. A single core 95W chip gets to have a one core eating up all of that power budget. This is where we get very high clock speed single core CPUs from. A 95W dual core processor means that individually the cores have to use less than the single 95W processor, so tradeoffs are made: each core runs at a lower clock speed. A 95W quad core processor requires that each core uses less power than both a single or dual core 95W processor, resulting in more tradeoffs. Each core runs at a lower clock speed than the 95W dual core processor.

The diagram below helps illustrate this:

  Single Core Dual Core Quad Core Hex Core
TDP
Tradeoff

 

The TDP is constant, you can't ramp power indefinitely - you eventually run into cooling and thermal density issues. The variables are core count and clock speed (at least today), if you increase one, you have to decrease the other.

Here's the problem: what happens if you're not using all four cores of the 95W quad core processor? You're only consuming a fraction of the 95W TDP because parts of the chip are idle, but your chip ends up being slower than a 95W dual core processor since its clocked lower. The consumer has to thus choose if they should buy a faster dual core or a slower quad core processor.

A smart processor would realize that its cores aren't frequency limited, just TDP limited. Furthermore, if half the chip is idle then the active cores could theoretically run faster.

That smart processor is Lynnfield.

Intel made a very important announcement when Nehalem launched last year. Everyone focused on cache sizes, performance or memory latency, but the most important part of Nehalem was far more subtle: the Power Gate Transistor.

Transistors are supposed to act as light switches - allowing current to flow when they're on, and stopping the flow when they're off. One side effect of constantly reducing transistor feature size and increasing performance is that current continues to flow even when the transistor is switched off. It's called leakage current, and when you've got a few hundred million transistors that are supposed to be off but are still using current, power efficiency suffers. You can reduce leakage current, but you also impact performance when doing so; the processes with the lowest leakage, can't scale as high in clock speed.

Using some clever materials engineering Intel developed a very low resistance, low leakage, transistor that can effectively drop any circuits behind it to near-zero power consumption; a true off switch. This is the Power Gate Transistor.

On a quad-core Phenom II, if two cores are idle, blocks of transistors are placed in the off-state but they still consume power thanks to leakage current. On any Nehalem processor, if two cores are idle, the Power Gate transistors that feed the cores their supply current are turned off and thus the two cores are almost completely turned off - with extremely low leakage current. This is why nothing can touch Nehalem's idle power:

Since Nehalem can effectively turn off idle cores, it can free up some of that precious TDP we were talking about above. The next step then makes perfect sense. After turning off idle cores, let's boost the speed of active cores until we hit our TDP limit.

On every single Nehalem (Lynnfield included) lies around 1 million transistors (about the complexity of a 486) whose sole task is managing power. It turns cores off, underclocks them and is generally charged with the task of making sure that power usage is kept to a minimum. Lynnfield's PCU (Power Control Unit) is largely the same as what was in Bloomfield. The architecture remains the same, although it has a higher sampling rate for monitoring the state of all of the cores and demands on them.

The PCU is responsible for turbo mode.

New Heatsinks and Motherboards Lynnfield's Turbo Mode: Up to 17% More Performance
Comments Locked

343 Comments

View All Comments

  • snakeoil - Tuesday, September 8, 2009 - link

    what part of stock speed you don't understand?

    if you are going to compare 2 processors both must have a fixed stock speed, if you increase the frequency of one of them by 1Ghz over the stock speed that is illegal.

    or either both have a stock fixed speed to benchmark or both are overclocked.

    overclocking is the same it doesn't matter if it's auto overclocking or manual overclocking.

    we the people demand justice.
    we the people are not stupid.


  • Anand Lal Shimpi - Wednesday, September 9, 2009 - link

    You may find that your desire for fixed speed comparisons will become difficult in the future. Both AMD and Intel are going to be embracing this sort of an approach to clock speeds.

    Overclocking is not the same as what is happening with turbo mode. Overclocking is not officially supported by the manufacturer, it is running a part faster than it was sold at in order to improve performance. If an application crashes because you've overclocked your chip too far that's no fault of the manufacturer.

    Turbo mode runs the chip at a frequency it's guaranteed to work at, it's operating within spec. It simply re-allocates thermal resources; Intel could disable 3 of the cores and sell a Core i7 870 as a 3.6GHz single-core processor, or disable 2 of the cores and sell it as a 3.47GHz processor, or only disable one core and sell it as a 3.2GHz processor. Instead of making the end user choose, instead you get a dynamic processor that can configure itself in real time depending on the workload.

    This is in stark contrast to AMD's Overdrive utility which is actually overclocking. The chips aren't validated at the overdrive speeds and you're thus overclocking. Lynnfield is validated at both its standard clock speed and its turbo speeds, just like Bloomfield. So long as you don't exceed the TDP of the chip, it will work at those turbo frequencies. The things that will prevent it from turboing were outlined in the article.

    Once again, I am not increasing the speed of anything - Lynnfield is simply working as designed. Whether it's in a Dell machine or in a custom build, it will always work this way. It's what the end user will see the moment they turn on a Lynnfield machine. The end user would not see the same from a Core 2 or a Phenom II based machine.

    Take care,
    Anand

  • Jamahl - Wednesday, September 9, 2009 - link

    The problem with this review is it has a bunch of graphs with intel cpus with speeds rated at a lot lower than what they actually are.

    When you show a graph of an i5 @ 2.66 gigahertz beating a 3.4 gigahertz phenom II, that is false and that is a problem. This cpu was not ever at 2.66 gigahertz for any of these tests was it?

    I suggest there is a problem with your reviewer also, not to mention his attitude about 'having a laugh' because i brought up this point?
  • Anand Lal Shimpi - Wednesday, September 9, 2009 - link

    Again, that is the default clock of the processor - in many cases (especially the heavily threaded tests) it will be running at that speed. Turbo mode is dynamic, it's impossible to put down exactly what speed the chip was running at as it'll change throughout each test.

    You might see the chip run at 2.66GHz for several seconds, jump up to 3.46GHz then down to 3.2GHz, up to 3.6GHz and then back down to 2.66GHz all in the course of a single benchmark. It's repeatable, but there's no way to display all of that in a bar chart.

    Take care,
    Anand
  • snakeoil - Wednesday, September 9, 2009 - link

    What you are doing is cheating, and people is not stupid.
    you are saying that lynnfield is faster than phenom 2 because lynnfield is overclocked at least 600 mhz.
    people is not stupid as you think, and what you are doing is outrageous.
    if you are going to benchamark with turbo enabled then you have to overclock phenom 2 at least the same 600 mhz.

    show some respect for your readers. or are you really on intel's payroll?
  • Klober - Wednesday, September 9, 2009 - link

    I am usually very respectful on the AnandTech and DailyTech forums and comments areas, but you sir are exactly as stupid as you are claiming people "is" not. Please read the article from beginning to end before continuing on your unjustified tirade. The processor is being used as intended by the manufacturer - to not test it in this way would be a disservice to the engineers who designed it and the company that produces it.
  • snakeoil - Wednesday, September 9, 2009 - link

    if you dont want to disable turbo the overclock phenom 2 at least 600 mhz.
    just to be fair.
  • Gary Key - Wednesday, September 9, 2009 - link

    "if you dont want to disable turbo the overclock phenom 2 at least 600 mhz.
    just to be fair. "

    You do realize that it takes near zero or sub-zero cooling to run the 965BE in stable manner at 4GHz with a 64-bit OS. When I say stable, I mean 24/7 multi-tasking, not a CPUZ screenshot or a SuperPi 1M bench. AMD has not solved this problem with the current stepping.

    Once again, and for the last time, Intel's turbo function is a standard feature of the processor. AMD will be offering the exact same technology in their next processor family.
  • snakeoil - Wednesday, September 9, 2009 - link

    that's not true.
    we demand justice and fair benchmarks.
    you are losing all credibility and these benchmarks are worthless and unreal.
  • goinginstyle - Wednesday, September 9, 2009 - link

    How many times do they have to respond to you in a logical manner. You should be banned and in some countries that would mean a beheading for being so damn stupid.

Log in

Don't have an account? Sign up now