Homework: How Turbo Mode Works

AMD and Intel both figured out the practical maximum power consumption of a desktop CPU. Intel actually discovered it first, through trial and error, in the Prescott days. At the high end that's around 130W, for the upper mainstream market that's 95W. That's why all high end CPUs ship with 120 - 140W TDPs.

Regardless of whether you have one, two, four, six or eight cores - the entire chip has to fit within that power envelope. A single core 95W chip gets to have a one core eating up all of that power budget. This is where we get very high clock speed single core CPUs from. A 95W dual core processor means that individually the cores have to use less than the single 95W processor, so tradeoffs are made: each core runs at a lower clock speed. A 95W quad core processor requires that each core uses less power than both a single or dual core 95W processor, resulting in more tradeoffs. Each core runs at a lower clock speed than the 95W dual core processor.

The diagram below helps illustrate this:

  Single Core Dual Core Quad Core Hex Core
TDP
Tradeoff

 

The TDP is constant, you can't ramp power indefinitely - you eventually run into cooling and thermal density issues. The variables are core count and clock speed (at least today), if you increase one, you have to decrease the other.

Here's the problem: what happens if you're not using all four cores of the 95W quad core processor? You're only consuming a fraction of the 95W TDP because parts of the chip are idle, but your chip ends up being slower than a 95W dual core processor since its clocked lower. The consumer has to thus choose if they should buy a faster dual core or a slower quad core processor.

A smart processor would realize that its cores aren't frequency limited, just TDP limited. Furthermore, if half the chip is idle then the active cores could theoretically run faster.

That smart processor is Lynnfield.

Intel made a very important announcement when Nehalem launched last year. Everyone focused on cache sizes, performance or memory latency, but the most important part of Nehalem was far more subtle: the Power Gate Transistor.

Transistors are supposed to act as light switches - allowing current to flow when they're on, and stopping the flow when they're off. One side effect of constantly reducing transistor feature size and increasing performance is that current continues to flow even when the transistor is switched off. It's called leakage current, and when you've got a few hundred million transistors that are supposed to be off but are still using current, power efficiency suffers. You can reduce leakage current, but you also impact performance when doing so; the processes with the lowest leakage, can't scale as high in clock speed.

Using some clever materials engineering Intel developed a very low resistance, low leakage, transistor that can effectively drop any circuits behind it to near-zero power consumption; a true off switch. This is the Power Gate Transistor.

On a quad-core Phenom II, if two cores are idle, blocks of transistors are placed in the off-state but they still consume power thanks to leakage current. On any Nehalem processor, if two cores are idle, the Power Gate transistors that feed the cores their supply current are turned off and thus the two cores are almost completely turned off - with extremely low leakage current. This is why nothing can touch Nehalem's idle power:

Since Nehalem can effectively turn off idle cores, it can free up some of that precious TDP we were talking about above. The next step then makes perfect sense. After turning off idle cores, let's boost the speed of active cores until we hit our TDP limit.

On every single Nehalem (Lynnfield included) lies around 1 million transistors (about the complexity of a 486) whose sole task is managing power. It turns cores off, underclocks them and is generally charged with the task of making sure that power usage is kept to a minimum. Lynnfield's PCU (Power Control Unit) is largely the same as what was in Bloomfield. The architecture remains the same, although it has a higher sampling rate for monitoring the state of all of the cores and demands on them.

The PCU is responsible for turbo mode.

New Heatsinks and Motherboards Lynnfield's Turbo Mode: Up to 17% More Performance
Comments Locked

343 Comments

View All Comments

  • goinginstyle - Tuesday, September 8, 2009 - link

    Really? SuperPi, WPrime, Everest, 3Dmarks and LN2 overclocking defines a better review? How does any of that correlate into real world applications and what 99% of people use their computers for on a daily basis. I counted a lot more than three tests in the AT review, go spam elsewhere.
  • C'DaleRider - Tuesday, September 8, 2009 - link

    Really. All I play is SuperPi, Everest and 3DMarks....oh, and Furmark and OCCT.

    Who would ever use their computer to encode video, run Excel spreadsheets, play games like Far Cry or Crysis or Left For Dead, or actually use any other real world application?

    Don't you know? Real elite computer users just bench synthetic crap, over and over, for hours and hours, and scoff at anyone who dares do anything productive with their computer.

    /sarcasm
  • geok1ng - Tuesday, September 8, 2009 - link

    I am not pleased by the news; i5 needs more juice for overcloking. Thats terrible: my E8600 is still on top: it runs at 4.0Ghz with 1.16v on a watercooled setup with 4x120mm fans at 1600rpm. we are talking less than 30dB of noise and less than 55w of power consumption. At 1.25v my E8600 reaches 4.25Ghz and would go a little further if wasnt for the 4 sticks of ram burning the NB. no reason to exchange systems before the 32nm parts arrive. The ability to achieve high clocks with low voltages is crucial for a good system: not only will it consume less power, but it will also be quieter, and that is a point for choosing sub-65w dual-cores in gaming rigs.
  • papapapapapapapababy - Tuesday, September 8, 2009 - link


    honestly... i lol at all the core i7 core i5 "GAMERS"( and their punny 23 lcd with shitty TN panels) also LGA1366? LGA1156? and 285$ for a shitty motherboard? XD I have a better proposition for you INTEL. how about u SUCK MY BALLS XD. MKAY? the story: my old man needed a gaming setup ( mostly simracing) So i bought the cheapest E7200 i could find (oc3.2GHZ), 4gb of ram, the cheapest intel mobo i could find -g31- ( not even pcie2) and gave him my 4770 ... the price? ridiculously cheap... almost nothing. and with all that extra money i saved i got this > a nice 42" HDTV with a perfect s-ips panel and low 1366 X 768 resolution, and a g25 wheel.

    btw, the framerates? i never, ever drop below 30fps. ALWAYS 60FPS NO MATTER WHAT, ( with nice 8xAA) XD so the best gaming cpu? the cheapest !


  • erple2 - Tuesday, September 8, 2009 - link

    Interesting. Which 42" LCD TV did you get that has a S-IPS panel? Also, a 23" panel at about 3 feet looks bigger than a 42" screen at 6 feet.
  • papapapapapapapababy - Tuesday, September 8, 2009 - link

    LG. btw a 23" TN lcd looks like shit no matter how you look at it.
  • C'DaleRider - Tuesday, September 8, 2009 - link

    My, aren't you special?

  • papapapapapapapababy - Tuesday, September 8, 2009 - link

    yes, i know! btw enjoy your small screen gaming, mr sheep XD
  • chrnochime - Tuesday, September 8, 2009 - link

    I'd say the same about that barely big enough 42" LCD, but then again why bother...
  • papapapapapapapababy - Tuesday, September 8, 2009 - link

    my bad i meant this > E7300 3.2ghz (+ a cheap 24$ modded heatpipe cooler... 14 dba XD)

Log in

Don't have an account? Sign up now