Homework: How Turbo Mode Works

AMD and Intel both figured out the practical maximum power consumption of a desktop CPU. Intel actually discovered it first, through trial and error, in the Prescott days. At the high end that's around 130W, for the upper mainstream market that's 95W. That's why all high end CPUs ship with 120 - 140W TDPs.

Regardless of whether you have one, two, four, six or eight cores - the entire chip has to fit within that power envelope. A single core 95W chip gets to have a one core eating up all of that power budget. This is where we get very high clock speed single core CPUs from. A 95W dual core processor means that individually the cores have to use less than the single 95W processor, so tradeoffs are made: each core runs at a lower clock speed. A 95W quad core processor requires that each core uses less power than both a single or dual core 95W processor, resulting in more tradeoffs. Each core runs at a lower clock speed than the 95W dual core processor.

The diagram below helps illustrate this:

  Single Core Dual Core Quad Core Hex Core
TDP
Tradeoff

 

The TDP is constant, you can't ramp power indefinitely - you eventually run into cooling and thermal density issues. The variables are core count and clock speed (at least today), if you increase one, you have to decrease the other.

Here's the problem: what happens if you're not using all four cores of the 95W quad core processor? You're only consuming a fraction of the 95W TDP because parts of the chip are idle, but your chip ends up being slower than a 95W dual core processor since its clocked lower. The consumer has to thus choose if they should buy a faster dual core or a slower quad core processor.

A smart processor would realize that its cores aren't frequency limited, just TDP limited. Furthermore, if half the chip is idle then the active cores could theoretically run faster.

That smart processor is Lynnfield.

Intel made a very important announcement when Nehalem launched last year. Everyone focused on cache sizes, performance or memory latency, but the most important part of Nehalem was far more subtle: the Power Gate Transistor.

Transistors are supposed to act as light switches - allowing current to flow when they're on, and stopping the flow when they're off. One side effect of constantly reducing transistor feature size and increasing performance is that current continues to flow even when the transistor is switched off. It's called leakage current, and when you've got a few hundred million transistors that are supposed to be off but are still using current, power efficiency suffers. You can reduce leakage current, but you also impact performance when doing so; the processes with the lowest leakage, can't scale as high in clock speed.

Using some clever materials engineering Intel developed a very low resistance, low leakage, transistor that can effectively drop any circuits behind it to near-zero power consumption; a true off switch. This is the Power Gate Transistor.

On a quad-core Phenom II, if two cores are idle, blocks of transistors are placed in the off-state but they still consume power thanks to leakage current. On any Nehalem processor, if two cores are idle, the Power Gate transistors that feed the cores their supply current are turned off and thus the two cores are almost completely turned off - with extremely low leakage current. This is why nothing can touch Nehalem's idle power:

Since Nehalem can effectively turn off idle cores, it can free up some of that precious TDP we were talking about above. The next step then makes perfect sense. After turning off idle cores, let's boost the speed of active cores until we hit our TDP limit.

On every single Nehalem (Lynnfield included) lies around 1 million transistors (about the complexity of a 486) whose sole task is managing power. It turns cores off, underclocks them and is generally charged with the task of making sure that power usage is kept to a minimum. Lynnfield's PCU (Power Control Unit) is largely the same as what was in Bloomfield. The architecture remains the same, although it has a higher sampling rate for monitoring the state of all of the cores and demands on them.

The PCU is responsible for turbo mode.

New Heatsinks and Motherboards Lynnfield's Turbo Mode: Up to 17% More Performance
Comments Locked

343 Comments

View All Comments

  • Shadowmaster625 - Tuesday, September 8, 2009 - link

    Intel releases yet another new socket type, offering negligible performance enhancements vs socket 775. Soon they will obsolete another socket type still in use. And this is a good thing? I'm still dealing with the fallout from the socket 478...
  • DJMiggy - Tuesday, September 8, 2009 - link

    Thanks! Some good info! Now to decide what to do...
  • Rabman - Tuesday, September 8, 2009 - link

    Full disclusre -- I work for AMD, my comments are my own and do not reflect my employer, etc.

    A clarification on Windows 7's Core Parking feature -- it doesn't actually "[look] at the performance penalty from migrating a thread from one core to another". Rather, Core Parking was designed as a power saving feature for multi-core server machines, and is only enabled on Windows 7 client SKUs where HT is present (I won't get into specifics as to why this decision was made). The side benefit for processors with HT is that the hyperthreads can be parked so the Windows scheduler will spread threads across the "real" cores first, resulting in better performance characteristics.
  • rbbot - Tuesday, September 8, 2009 - link

    That implies that it would have a negative effect on the chances of turbo mode engaging. On other OS, pure random chance would sometimes assign a waking thread to the hyper-core of the one already executing at full pelt. However, this means that on Windows 7, core parking prevents this happening and always wakes a 2nd core for the 2nd thread.

  • puffpio - Tuesday, September 8, 2009 - link

    If you disable turbo mode, will the individual cores still power down when unused?

    Take the 860 for example. With turbo mode enabled you get these overclocked speeds:
    3C/4C Active: 3.54GHz
    2C Active: 3.85GHz
    1C Active: 4.00GHz

    but with turbo mode disabled you get 3.99GHz at 1/2/3/4 cores active.
    If the cores are still able to be powered down w/ turbo mode disabled, it would seem that would give you the best performance at any core activity level.
  • Comdrpopnfresh - Tuesday, September 8, 2009 - link

    Specifically; power consumption, efficiency, and productivity/performance. On the consumer scale though- obviously with single-cpu boards benches geared towards commercial use would be droll.
  • AFUMCBill - Tuesday, September 8, 2009 - link

    Great Review.

    You mentioned the rising popularity of the uATX platform.
    I would guess this is related to the rising popularity of laptops.
    Except you can't find anything close to the performance of a Core i7 or i5 processor in a laptop form factor at anything remotely resembling a reasonable price - as in thousands and thousands of dollars extra. So people are headed to the uATX platform and the small(er) LAN party type boxes to get mobile performace. In my case I would like to be able to load high bitrate (25 Mbps and up) MPEG2 and MPEG4 footage into my video editor and have at it. My Q6600 handles the MPEG2 fine, but not the MPEG4 (AVCHD).

    Found the Core i7 860 available at MicroCenter for $229.99 USD.
    For me to make the buy, the only thing that is missing is USB 3.0.
    Next year is looking good...and prices are likely to be even lower then :-)
  • Peroxyde - Tuesday, September 8, 2009 - link

    Just checked at Newegg. Is there any error on the price? The newer and more performance i5 750 costs $209. The Q9550 cost $219. That sounds illogical.
  • AFUMCBill - Tuesday, September 8, 2009 - link

    I think it's called having old stock that was purchased before the new announcements. Obviously the folks they are going to be selling to are ones who are updating the processor in an older 775 socket motherboard based system - which with the new announcements are now rapidly receding into the past.
  • C'DaleRider - Tuesday, September 8, 2009 - link

    Sucks to have to depend on Newegg for buying, esp. considering what MicroCenter is doing. $199 for the i7 920 while Newegg gouges at $279, or the i5 750 for $179.

    Newegg long ago ceased being the place for the best prices.

Log in

Don't have an account? Sign up now