Turbo Boost Max 3.0 (TBM3), aka Turbo Boost Max or Turbo Boost 3.0

When Intel released the enterprise focused Broadwell-EP Xeon CPUs, there were a few features added to the platform over the previous Haswell-EP generation. One of these has come through to the consumer parts, though in a slightly different form.

For Broadwell-EP, one of the new features was the ability to have each core adjust the frequency independently depending on AVX or non-AVX workloads. Previously when an AVX load was detected, all the cores would reduce in frequency, but beginning with BDW-EP now they act separately. Intel has taken this enterprise feature and expanded it a little into a feature they're calling ‘Turbo Boost Max 3.0’.

Turbo Boost 2.0 is what Intel calls its maximum Turbo or ‘peak’ frequency. So in the case of the i7-6950X, the base frequency is 3.0 GHz and the Turbo Boost 2.0 frequency is 3.5 GHz. The CPU will use that frequency when light workloads are in play and decrease the frequency of the cores as the load increases in order to keep the power consumption more consistent. Turbo Boost 2.0 frequencies are advertised alongside the CPU on the box - TBM3 will be slightly different and not advertised.

TBM3, in a nutshell, will boost the frequency of a single CPU core when a single-threaded program is being used.

It requires a driver, similar to Skylake’s Speed Shift feature (which is not in Broadwell-E), which should be distributed in new X99 motherboard driver packages, but will also be rolled out in Windows 10 in due course. It also comes with a user interface, which might make it easier to explain:

Each of the cores in the processor can be individually accessed by the OS with the new driver, and the cores will be rated based on their performance and efficiency as they come out of Intel. In the image above, Core 9 is rated the best, with Core 0 at the bottom. This means that for TBM3, the driver will primarily use Core 9.

When enabled, TBM3 will activate in two modes: either the foreground application, or from a priority list. For the foreground selection, when the software detects a single threaded workload in play, it will attempt to pin the software to the best core (similar to changing the affinity in task manager to one core), and then boost the frequency. In priority mode, the application will look for any application on the left-hand panel (which has to be added manually). If an application with higher priority is present, then the software will unpin the current software and take the higher priority one and pin that instead.

When pinned, the software will boost the frequency of that core only. The only question now is how much is the boost, and what is the effect on performance?  Unfortunately, both of those questions have bad answers.

Intel refuses to state the effect of TBM3, saying that ‘each CPU is different and could boost by different amounts’. Now, you might think that makes sense. However…

Turbo Boost Max 3.0 has to be supported by the motherboard manufacturer in the BIOS. The TBM3 settings have to be set in the BIOS, which means that the usefulness of such a feature is actually down to the motherboard manufacturers. But they know how to do it right, right? Well, here’s where it can get worse.

On the MSI motherboard we used for most of our testing, Turbo Boost Max 3.0 was disabled by default in the BIOS. We asked about this, and they said it was a conscious decision made by management a couple of weeks prior. This makes TBM3 useless for most users who never even touch the BIOS. That sounds good, right?

Well, the BIOS also sets how much the CPU can boost by. So ultimately it doesn’t matter how much the CPU might like to boost in frequency, the system will only boost by the amount it says so in the BIOS, which is set by the motherboard manufacturer. In the case of the MSI BIOS, it was set to ‘Auto’. In my case, ‘Auto’ meant a boost of zero, despite the MSI BIOS ‘suggesting’ 4000 MHz. I had to manually set Core 9 to a 40x multiplier. Then it worked.

All in all, TBM3 was only enabled after I changed two settings and specifically setting the correct core in the BIOS. For me, this isn’t a global feature if that is the case. That’s not to mention how Multi-Core Turbo also comes into the mix, which still works with Turbo Boost 2.0 speeds by default. Based on what we've seen, it would seem at this time that TBM3 isn’t being readily embraced at this time.

It should be noted that we also had one of the new ASUS motherboards in for testing, however time was too limited before leaving for Computex to verify if this is the case on the ASUS motherboard as well. ASUS has told me that they have/will have a software package that enables TBM3 to be applied to multiple cores at once, whereas the Intel software will only accelerate a single program. It should be interesting to test.

The Reviewers Problem With Turbo Boost Max 3.0

In the options menu for TBM3, there are two primary options to take note of. The first is the utilization threshold, which is the % at which the software will take control of the single threaded application and pin it to a core. By default, this is set at 90%.

The other option is where a dilemma will be faced. It is the evaluation interval, or the period of time between checks that the software makes in order to accelerate a program. The version of the software we had started with a value of 10 seconds. That means if the software package starts one second or nine seconds into a benchmark run, it can affect the score. The answer here would be to make the evaluation interval very small, but the software only has a one-second resolution. So for benchmarks that run for only a few seconds (anyone benchmarking wPrime or SuperPi, for example), might either fail to be accelerated if the evaluation window is set at default, or only slightly when set at one second.

As you can imagine, if a reviewer does not know if TBM3 is enabled or not, there may be some odd benchmark results that seem different to what you might expect. It should be noted that because of the BIOS issue and the potential for motherboard manufacturers to do something different with every product, we ran our benchmarks with TBM3 disabled, and readers should check to see if reviewers specify how TBM3 is being used when data is published.

Package Differences: It’s Thin

When the Skylake mainstream platform was launched, it was noted that the processor packages and substrates were thinner compared to the previous generation. It would appear that Intel is using the same packaging technology for Broadwell-E as well.

On the left is the Haswell-E based Core i7-5960X, and on the right is the Broadwell-E based Core i7-6950X. Both of these platforms use a FIVR, the Fully Integrated Voltage Regulator, which Intel equipped on this microarchitecture in order to increase power efficiency. Usually the presence of the FIVR would require additional layers for power management in the package, but it would seem that Intel has been optimizing this to a certain extent. Each individual layer is certainly thinner, but it is likely that Intel has also reduced the number of layers, though my eyes cannot discern the resolution needed to see exactly how many are in each CPU (and I don’t have a microscope on hand to test).

A couple of questions will crop up from readers regarding the thinner package. Firstly, on the potential for bending the package, especially in regards to a minor story on Skylake where a couple of CPUs were found to have bent when under extreme cooler force. As far as Intel is aware, Broadwell-E should not have a problem for a number of reasons, but mostly related to the dual latch socket design and socket cooler implementation. Intel’s HEDT platforms, from Sandy Bridge-E on, have been rated as requiring 30-40% more pressure per square inch then the mainstream platforms. As a result the sockets have been designed with this in mind, ensuring the pressure of the latch and cooler stays on the heatspreader.

The other question that would come to my mind is the heatspreader itself. Intel has stated that they are not doing anything new with regards to the thermal interface material here compared to previous designs, and it is clear that the heatspreader itself is taller to compensate for the z-height difference in the processor PCB.

If we compare the ‘wing’ arrangement between the Haswell-E and Broadwell-E processors, Intel has made the layout somewhat more robust by adding more contact area between the heatspreader and the PCB, especially in the corners and sides. One would assume this is to aid the thinner PCB, although without proper stress testing tools I can’t verify that claim.

The Broadwell-E Review The Market, X99 Refresh and our Test Setup
Comments Locked

205 Comments

View All Comments

  • RussianSensation - Tuesday, May 31, 2016 - link

    It's a slap in the face when 6850/6900/6950X are also crap overclockers and will get owned hard in games and every day tasks by a $310 6700K. The only CPU that even remotely makes sense is the 6800K. For workstation use case, dual Xeons will smash the 6950X. Heck, it's better to build a 6900K + 6700K in the same case allowing one to be productive and game at the same time. Phanteks makes such cases now. 6950X is just a way to show your status, nothing more.
  • mapesdhs - Thursday, June 9, 2016 - link

    Something usually missing from reviews now is an oc'd 4820K, which is annoying because a 4c IB-E on X79 allows for quite a lot of oc headroom given the high rating of the socket and the beefy power delivery available on boards like the R4E, etc. I bet it would give many of the newer CPUs a serious pelting.
  • Drazick - Tuesday, May 31, 2016 - link

    Could we have Extreme Edition with Iris Pro + 128MB eDRAM?

    That would be a great addition (Even only for the 6 Cores Part).
  • Eden-K121D - Wednesday, June 1, 2016 - link

    Iris pro would be useless but i agree with the eDRAM acting as L4 Cache
  • barleyguy - Tuesday, May 31, 2016 - link

    Great review.

    One possible omission though: You mentioned that the Xeon E-2640 is a better deal as far as price/performance, but there are no Xeons on the benchmark charts. Do you plan to review the E-2640 at some time in the future?

    Thanks.
  • ShieTar - Tuesday, May 31, 2016 - link

    That might indeed be a statement which needs to be proven by tests?
    The 2640 has 10 Broadwell-Cores at a 2.8GHz All-Cores-Turbo, the 6950X should have a 10-core-Turbo of 3.2GHz, so you might argue you get 87.5% of the Performance for ~60% of the CPU cost. But the 2640 does have a slower verified memory speed, which may have a little impact. And its turbo boost settings are defined to hit a 90W TDP, and I don't think you can change that even in a workstation with plenty of cooling available. Add to that the fact that you can overclock to improve the performance of the 6950X, and that the 700$ price difference should be considered relative to the overall system cost, and you probably end up with very similar price-to-performance ratios.

    I think the stronger challenger to the 6950X price-to-performance figures is the 2687W v4, which can be had for just over 2k$, and gets you annother 2 cores at almost the same clocks. That's ~16% more performance for ~16% more CPU cost, which translates into less than 10% higher system cost.
  • samer1970 - Tuesday, May 31, 2016 - link

    Hello ,

    we all know that games dont use more than 8 threads today ...

    so to take advantage of an 8 cores or 10 cores CPU in Gaming you should Disable HT (Hyperthreading) and run the gaming test again to compare it against the 4 cores i7 6700K .

    and test it with SLI as well to reach the i7 6700k bottleneck !

    let me put it more simple ,

    The i7 6700K has 4 cores and can oc to 4.4 ghz easy . this CPU will give us 8 Virtual cores comparable to 2.2 GHZ clock for each virtual core .

    However the 8 Coresi7 6900K , With the HT Turned OFF , will give us 8 cores @ 4.4 ghz EACH !

    Thats double the speed of the 4 cores i7 ! if the game uses 8 threads .

    EVEN if we dont OC the 8 cores , it would be 3.2GHZ VS 2.2 GHZ !!!

    if you ask why Disable HT ? simple because the game will never use 16 Virtual cores !!! and the advantage is LOST .

    Please run the test again for games with HT turned off .

    and to stress the CPU more , TEST SLI as well , we want the i7 6700K to bottleneck !

    THANKS

    oh and Intel Should release i5 Broadwel-E CPU , 8 cores without HT , CHEAPER and BETTER for GAMERS
  • RussianSensation - Tuesday, May 31, 2016 - link

    Nice try but no cigar. 6700K @ 4.8ghz + HT is the optimal gaming CPU. No current game even scales linearly across 6 cores + HT. 8-10 core CPU with a slower architecture would lose badly to an i7 6700K @ 4.8Ghz + DDR4 4000.

    There is not a chance 6950X @ 4.4Ghz can keep up with 6700K OC.
    http://www.techspot.com/article/1171-ddr4-4000-mhz...

    By the time games use 8-10 cores, we'll be on PS5/XB2 generation in 2020-2021 and Icelake-E. Broadwell-E 8-10 cores will be outdated.
  • adamod - Wednesday, June 1, 2016 - link

    http://www.pcworld.com/article/3039552/hardware/te...
    look at the ashes bench
  • mapesdhs - Thursday, June 9, 2016 - link

    Oh grud not here aswell! You've been banging on about this HT thing on toms for ages.

Log in

Don't have an account? Sign up now