Homework: How Turbo Mode Works

AMD and Intel both figured out the practical maximum power consumption of a desktop CPU. Intel actually discovered it first, through trial and error, in the Prescott days. At the high end that's around 130W, for the upper mainstream market that's 95W. That's why all high end CPUs ship with 120 - 140W TDPs.

Regardless of whether you have one, two, four, six or eight cores - the entire chip has to fit within that power envelope. A single core 95W chip gets to have a one core eating up all of that power budget. This is where we get very high clock speed single core CPUs from. A 95W dual core processor means that individually the cores have to use less than the single 95W processor, so tradeoffs are made: each core runs at a lower clock speed. A 95W quad core processor requires that each core uses less power than both a single or dual core 95W processor, resulting in more tradeoffs. Each core runs at a lower clock speed than the 95W dual core processor.

The diagram below helps illustrate this:

  Single Core Dual Core Quad Core Hex Core
TDP
Tradeoff

 

The TDP is constant, you can't ramp power indefinitely - you eventually run into cooling and thermal density issues. The variables are core count and clock speed (at least today), if you increase one, you have to decrease the other.

Here's the problem: what happens if you're not using all four cores of the 95W quad core processor? You're only consuming a fraction of the 95W TDP because parts of the chip are idle, but your chip ends up being slower than a 95W dual core processor since its clocked lower. The consumer has to thus choose if they should buy a faster dual core or a slower quad core processor.

A smart processor would realize that its cores aren't frequency limited, just TDP limited. Furthermore, if half the chip is idle then the active cores could theoretically run faster.

That smart processor is Lynnfield.

Intel made a very important announcement when Nehalem launched last year. Everyone focused on cache sizes, performance or memory latency, but the most important part of Nehalem was far more subtle: the Power Gate Transistor.

Transistors are supposed to act as light switches - allowing current to flow when they're on, and stopping the flow when they're off. One side effect of constantly reducing transistor feature size and increasing performance is that current continues to flow even when the transistor is switched off. It's called leakage current, and when you've got a few hundred million transistors that are supposed to be off but are still using current, power efficiency suffers. You can reduce leakage current, but you also impact performance when doing so; the processes with the lowest leakage, can't scale as high in clock speed.

Using some clever materials engineering Intel developed a very low resistance, low leakage, transistor that can effectively drop any circuits behind it to near-zero power consumption; a true off switch. This is the Power Gate Transistor.

On a quad-core Phenom II, if two cores are idle, blocks of transistors are placed in the off-state but they still consume power thanks to leakage current. On any Nehalem processor, if two cores are idle, the Power Gate transistors that feed the cores their supply current are turned off and thus the two cores are almost completely turned off - with extremely low leakage current. This is why nothing can touch Nehalem's idle power:

Since Nehalem can effectively turn off idle cores, it can free up some of that precious TDP we were talking about above. The next step then makes perfect sense. After turning off idle cores, let's boost the speed of active cores until we hit our TDP limit.

On every single Nehalem (Lynnfield included) lies around 1 million transistors (about the complexity of a 486) whose sole task is managing power. It turns cores off, underclocks them and is generally charged with the task of making sure that power usage is kept to a minimum. Lynnfield's PCU (Power Control Unit) is largely the same as what was in Bloomfield. The architecture remains the same, although it has a higher sampling rate for monitoring the state of all of the cores and demands on them.

The PCU is responsible for turbo mode.

New Heatsinks and Motherboards Lynnfield's Turbo Mode: Up to 17% More Performance
Comments Locked

343 Comments

View All Comments

  • Scheme - Tuesday, September 8, 2009 - link

    Why is it assumed enthusiasts or technical users are only interested in overclocking? For me it's about balancing performance, temps, noise levels and power consumption, all with a reasonable cost of entry. All that considered Lynnfield seems to be a good platform.
  • Ann3x - Tuesday, September 8, 2009 - link

    Well the d0 can usually hit 3.8-4ghz on std voltages.

    So tbh temps dont really come into it. Anyone with a 920 should overclock it because there is really nothing to lose. No need to risk the cpu, negligible temperature increases, its all positive.

    If you buy a 920 and dont overclock you either should have a very good reason or you dont know how to.

    The 920 is a very meh processor at stock. The reason its so popular is its potential to overclock so easily and so highly not its stock speeds.
  • Scheme - Tuesday, September 8, 2009 - link

    3.8-4ghz will involve more than what I'd consider to be 'negligible temperature increases'.
  • Ann3x - Tuesday, September 8, 2009 - link

    I can hit 4Ghz on air with stock voltage with a max load (10hrs of LinX) of 65C. Thats so far within the thermal limits of the CPU to be considered totally negligible. My results seem very typical for d0 920s.

    Enthusiast CPUs need to be treated (and reviewed) in context with their market. If some people are too stupid to see the potential of their mid range CPUs then Id at least expect a good site like anandtech to realise it and not act like stock is the only option.

    There is a reason why 90% of X58 motherboards are marketed on their overclock and performance options.
  • erple2 - Tuesday, September 8, 2009 - link

    I'm a bit more interested in relatively low power consumption, and stable (as in for 4 years) operation, not how many fractional increases in performance I can eke out of a CPU at potentially catastrophic failure rates through overclocking. However, I'm buying a CPU for it's solid performance, reliability (the i7's haven't been out long enough to make any judgement of it's long term reliability), and lower power consumption. Also, can you still claim RMA status on a broken CPU that was potentially damaged by overclocking? I haven't read any Intel literature to suggest that you can.

    I don't care at all about what you think I my goals with buying a CPU for are. Your class of "enthusiast" is really "the overclocking user", not the "enthusiast".

    I suppose it would be more interesting to find out how many of the X58 purchasers actually overclock their CPUs (beyond what "turbo" buys you), and then make conclusions, rather than this handwaving "you're an idiot if you do something other than what I do" mentality that you are showing.

    I'd also like to see how many of those "90% of X58 mobos marketed on their overclock and performance options" (reference please!) are actually sold vs. those that are cheaper, and not marketed on overclocking performance. Maybe that "other 10%" sells about 40-50% of the market. Do you have data to suggest otherwise?
  • Ann3x - Wednesday, September 9, 2009 - link

    You really think you can damage a cpu by just increasing the clock? Wow.

    Only things that damage CPUs are overvolting (not needed for a decent OC on the i7) and high temps (usually linked directly to overvolting). The concept that an overclock at stock voltage could cause "catastrophic failure" is frankly laughable.

    A sensible overclock will be no more or less stable than stock. The only people who actually risk system stability are the ones who overvolt and push the limits. For the record 3.8-4ghz is most definitely NOT pushing the limits.

    Again ill say it. If youre buying an X58 motherboard you are PAYING for the ability to overclock. If you choose not to there is little - no point in the platform (with the possible exception of people who use very multi threaded apps). The proof in this point is actually staring you in the face in the i5/new i7, their design shows that intel realises the headroom they have in the architecture. The aggressive turbo mode of the i5/new i7 is proof that there is NO risk in overclocking within sensible limits at stock voltage.

    In reality the main "huge leap forward" of the new platform is simply the acceptance of overclocking within intel.

    That you choose to ignore the potential of your CPU is your own loss. Intel think its safe to overclock but hey, you know better right :). Im just surprised that anand chooses to for the most part ignore this and makes absurb attention grabbing statements instead of assessing the real merits of the 2 platforms. It really is a rose tinted glasses review but oh well. Keep your heads stuck in the sand.
  • yacoub - Tuesday, September 8, 2009 - link

    Great article except it would have been REALLY nice to have the i7 860 data, given that's most likely the best bang-for-the-buck Lynnfield option and it probably makes MUCH more sense price-wise than going for the 870. It REALLY sucks not knowing where that chip slots in for all the tests you did.
    This is exactly the concern I listed when the polls were posted that asked which CPU we most wanted to see benchmarked. Clearly we want to see ALL THREE.

    That said, it's nice to see Lynnfield is basically awesome, except in Crossfire/SLI which is about the LAST thing I could give a crap about. So really, this looks about perfect. (Only on page 14 though, but if it continues as it has to this point, I'm sold.) I just wish the 860 had been benchmarked.
  • Gary Key - Tuesday, September 8, 2009 - link

    I have the 860 in the upcoming mb roundups. :)
  • yacoub - Tuesday, September 8, 2009 - link

    I knew there was a reason we all love us some Gary Key. \m/. :)
  • 529th - Tuesday, September 8, 2009 - link

    I'm not sure what bios setting is Sleep State on the i7 9xx (1366) but for discussions sake, say C1E sleep state is disabled, as overclockers usually do this, does that negate the logical functioning of Turbo mode and run all cores at the max TDP & speed? So would a 1366 system be faster with turbo & C1E sleep state disabled? ...i'm not even sure if i'm asking that right.. i'm still reading up on the PCU and Turbo section

Log in

Don't have an account? Sign up now