The Software Answer to Intel’s Turbo Mode

The Core i7 has its Turbo Mode. When only one core is active, power is shut off to all other cores and the active one can run at higher clock speeds. Even when all four cores are active, as long as the processor isn’t running too hot, the entire chip can run at a higher frequency. The default turbo speeds for the three i7s out on the market are listed below:

Processor Stock Clock Speed Max Turbo (All Four Cores Active) Max Turbo (One Core Active)
Intel Core i7-965 3.20GHz 3.33GHz 3.46GHz
Intel Core i7-940 2.93GHz 3.06GHz 3.20GHz
Intel Core i7-920 2.66GHz 2.80GHz 2.93GHz

 

Turbo mode is an interesting concept because it allows end users to experience higher performance without ever having to even know about overclocking. In its first incarnation on Nehalem, Intel was quite conservative with how far they pushed Turbo. Given that our Core i7 920 had no problems running at up to 3.8GHz without increasing its core voltage, I’d say that Intel could stand to be a bit more aggressive with its turbo modes.

Note that in all of our multithreaded tests, the Core i7 has no problems running at its “turbo” frequency. In other words, the i7-920 runs at 2.80GHz far more frequently than it runs at 2.66GHz.

AMD unfortunately does not have a similar capability in its processors. Intel dedicated around 1M transistors to an on-die microcontroller called the PCU (Power Control Unit). The Core i7’s PCU tracks power consumption and manages clock speeds and voltages to ensure that the i7 always delivers the best performance when you need it, and the lowest power consumption when you don’t.

AMD didn’t go in and introduce any major architectural changes in the Phenom II X4 955 and most likely won't for a while. Instead, AMD is hoping to deliver a turbo-like feature in software.

The software is AMD’s Overdrive Utility 3.0 and the feature is called AMD Smart Profiles.

It works like this: you select an executable, and then using AOD3 you can set what cores that executable should run on and what clock speeds those cores should run at. AMD will have predefined profiles for various games but you can also create your own profiles.

Since the majority of desktop applications aren’t multithreaded, you’d stand to benefit from running two of your cores at a higher frequency and dropping the clock speed on the remaining two cores. That way you could keep power consumption unchanged, while improving performance.

It does work, I ran a quick test on Left 4 Dead and Far Cry 2:

Processor Left 4 Dead Far Cry 2
Smart Profiles Enabled 124.1 fps 53.3 fps
Smart Profiles Disabled 121.4 fps 51.2 fps
Performance Improvement 2.2% 4.1%

 

In these cases the games were forced to run on two cores specifically (at least the .exe files) and those two cores were overclocked by 400MHz, while the remaining two cores were underclocked by 400MHz. The end result was that performance improved by 2% in Left 4 Dead and 4% in Far Cry 2.

There’s obviously more flexibility with AMD’s method than Intel’s and you could potentially get a much greater performance improvement. There are a couple of downsides. For starters, there’s no guarantee of stability but if you keep the overclocks mild and/or accompany them with a corresponding increase in core voltage then you can minimize that risk. The other obvious downside is that for the majority of applications and games you’ll have to create profiles manually. Honestly I’d prefer if the CPU did the work for me, but if you enjoy customizing your system then you’ll feel right at home with AOD3’s Smart Profiles.

AMD will add new profiles as time goes on. AMD does have plans for a turbo mode in future microprocessors, but we won’t see them until 2011.

The Move to DDR3: You Can Do It AMD Black Edition Memory Profiles: One Click Memory Overclocking
Comments Locked

65 Comments

View All Comments

  • corporategoon - Thursday, April 23, 2009 - link

    It seems every article posted at Anandtech lately has been riddled with sentence fragments. "Especially now that DDR2 and DDR3 are closer in price" isn't a sentence.

    C'mon guys, this mumbo-jumbo would never fly in print.
  • hooflung - Thursday, April 23, 2009 - link

    It has been proposed, or should I say down right leaked, that AMD is going to a 128bit register. Not in that we'll have a 128bit OS but it will be able to push 2 64bit registers once.

    This is what Bulldozer is rumored to be at. If so, then it is quite possible AMD has an answer for HT. Now once they go to 32nm, better gate efficiency, SSE5 ( if anyone actually puts out an APP for it ) and higher clocks with tripple/quad DDR3 we might see some real heavyweight cpu's that will make the i7 tremble.

    That is if AMD survives the Quarterly losses.
  • Spoelie - Friday, April 24, 2009 - link

    Larrabee derivative cores have 256bit registers and AVX extensions.. I imagine the successor to i7 will have a few of those as well.

    I hope amd drops the SSE5 bit and focuses on AVX, only reason not to is (1) pride (2) too late into bulldozer development. First is a bad reason and second would be a shame. Technically AVX is superior to SSE5.
    http://aceshardware.freeforums.org/intel-avx-kills...">http://aceshardware.freeforums.org/intel-avx-kills...

  • hooflung - Friday, April 24, 2009 - link

    It doesn't matter that Larrabee has these things. Larrabee isn't going to be used much for integers and that is what is pushing the i7's death blows to AMD's tech right now.

    The Larrabee 256bit registers have absolutely nothing to do with the 128bit registers AMD is rumored to have on bulldozer. Larrabee isn't going to replace the i7 it will enhance the ability to have a chip that can handle graphics and possibly ray tracing. AVX is the new SSE engine and you need you need 256bit registers to get better precision. While AMD's 128bit registers are for doing normal work, or having copies of the same register call in case of branch prediction failure, in the normal CPU pipeline. ( the last part is my speculation on branch prediction )

    And AMD has been talking about Fusion for years. When they go to a smaller process they can put stream proc.s from the ATI tech on their chips if they want AVX type work to be done.
  • Leyawiin - Thursday, April 23, 2009 - link

    Gaming results look very promising. I'll give it a couple months for the prices to drop a bit and I think this will be my new build.
  • TA152H - Thursday, April 23, 2009 - link

    The introduction into this article was really bad. I thought I had jumped into the middle of it. I backed out and tried again. You might have thought it was clever or different, but, maybe too different.

    What are you going to try next? Putting an intro on page 5? That would be different too, and you'd get more clicks, since people would think they were on the wrong page.
  • DeOderView - Thursday, April 23, 2009 - link

    Nice article!

    I'll bet Intel's team is poopping in their pants, now! :P

    Gratz to AMD's team on core speed achievement!

    Getting triple bus pipes (core I7) or quad. bus pipes is nothing new to AMD since that was their theory back in the day. Increasing bus pipes is just a matter of times and strategy for AMD team. By what've I meant:

    Pentium = single bus, high core speed
    Athlon = dual bus, low core speed (win)

    Core 2 Dual = dual buses, high core speed
    Phenom II = dual bused, high core speed

    Core I7 = triple buses, high core speed
    AMD = ???

    From what I've seen, core speed is pretty much hitting the ceilling and bus pipes... the sky is limited! To my surprise, you guys as a technical aspect and knowledge, and failed to realize this?!?!?!

    It's getting interesting more and more each day as AMD achieved their core speed milestone!

    Just you wait and see! :P
  • JarredWalton - Thursday, April 23, 2009 - link

    The number of memory channels has severely limited potential for returns on investment. Going from single to dual-channel was good for up to 10% back in the day, and perhaps 15% now. Tri-channel i7 will probably end up being 2-5% faster than dual-channel i5, all other things being equal.

    The problem is that adding more memory channels can dramatically increase bandwidth, but it does nothing for latency - and in some cases can even make latency higher. The big benefit for AMD in the Athlon 64 era wasn't the dual-channel chipset; it was the integrated memory controller. That has a major impact on latency, but it can only be done once. AMD and Intel have both done it now, so what's next?

    Let's put it this way: imagine a CPU running at 4GHz, so the clock cycle time is 250ps. Now let's take that same CPU with memory that has an average latency of 75ns - or 75000ps. That means any time the CPU has a cache miss and has to hit main memory, it loses 300 clock cycles!

    Of course, memory is accessed in chunks, so the CPU might load up a 256 byte cache line at a time. That means over a dual-channel 128-bit bus it needs to hit memory 16 times to load a whole cache line from memory. If you were to jump to a quad-channel memory interface, you could load a cache line into memory in half the time. The first access to memory is the one usually holding up the CPU, however, so the net benefit of doubling the channels/bandwidth is pretty small. Even with infinite bandwidth, the 300 CPU cycle latency to access main memory is still there.

    While it was written several years ago, http://www.anandtech.com/memory/showdoc.aspx?i=222...">the basic tenets of this article are still valid.
  • TA152H - Thursday, April 23, 2009 - link

    This might sound crazy, but I'm still thinking at some point they'll put SRAM back on the motherboard as a fourth level cache. You'd cut the access time dramatically, since you don't need a refresh cycle. You'd still be really slow compared to on board memory, but it would still be a lot better than DRAM. On-board memory controllers would help this, and also the relatively low clock speeds of processors compared to bus speeds would help some as well (compared to the Pentium 4 days, where the processors ran at higher clock speeds, and the memory bus at lower).

    It wouldn't add too much cost either. For Intel it might not make much sense, but for AMD, since they can't make a competitive processor, they can give it additional help in other areas. Add a 128 MB cache on the motherboard, and you'd beat Intel processors in some benchmarks, and in others you'd close the game some. The motherboards would still be cheaper than the overpriced i7 ones.

    Maybe you'd need 256mb, I don't know, but my point is, there's a real benefit, and AMD needs what help they can get to compete with the i7. It's a monster, but it still waits a long time on memory. That's an important advantage to have.
  • duploxxx - Thursday, April 23, 2009 - link

    It is good that the pricepoint is compared for all platforms, but i suggest you check the overall price again, in your final words you state it is a small price difference to go to i7... but check the board prices again on newegg.

    the cheapest amd am3 board is on sale for 95$
    the cheapest intel i7 board is on sale for 195$

    that is more then 10-70$ not to mention the am2+ compatibility and way cheaper boards although off course you lack a bit of features, but then again how much do you really need?

    http://www.newegg.com/Product/Product.aspx?Item=N8...">http://www.newegg.com/Product/Product.aspx?Item=N8...
    http://www.newegg.com/Product/ProductList.aspx?Sub...">http://www.newegg.com/Product/ProductLi...10706453...

Log in

Don't have an account? Sign up now