Favored Core

For Broadwell-E, the last generation of Intel’s HEDT platform, we were introduced to the term ‘Favored Core’, which was given the title of Turbo Boost Max 3.0. The idea here is that each piece of silicon that comes off of the production line is different (which is then binned to match to a SKU), but within a piece of silicon the cores themselves will have different frequency and voltage characteristics. The one core that is determined to be the best is called the ‘Favored Core’, and when Intel’s Windows 10 driver and software were in place, single threaded workloads were moved to this favored core to run faster.

In theory, it was good – a step above the generic Turbo Boost 2.0 and offered an extra 100-200 MHz for single threaded applications. In practice, it was flawed: motherboard manufacturers didn’t support it, or they had it disabled in the BIOS by default. Users had to install the drivers and software as well – without the combination of all of these at work, the favored core feature didn’t work at all.

Intel is changing the feature for Skylake-X, with an upgrade and for ease-of-use. The driver and software are now part of Windows updates, so users will get them automatically (if you don’t want it, you have to disable it manually). With Skylake-X, instead of one core being the favored core, there are two cores in this family. As a result, two apps can be run at the higher frequency, or one app that needs two cores can participate. 

Speed Shift

In Skylake-S, the processor has been designed in a way that with the right commands, the OS can hand control of the frequency and voltage back to the processor. Intel called this technology 'Speed Shift'. We’ve discussed Speed Shift before in the Skylake architecture analysis, and it now comes to Skylake-X. One of the requirements for Speed Shift is that it requires operating system support to be able to hand over control of the processor performance to the CPU, and Intel has had to work with Microsoft in order to get this functionality enabled in Windows 10.

Compared to Speed Step / P-state transitions, Intel's new Speed Shift terminology changes the game by having the operating system relinquish some or all control of the P-States, and handing that control off to the processor. This has a couple of noticeable benefits. First, it is much faster for the processor to control the ramp up and down in frequency, compared to OS control. Second, the processor has much finer control over its states, allowing it to choose the most optimum performance level for a given task, and therefore using less energy as a result. Specific jumps in frequency are reduced to around 1ms with Speed Shift's CPU control from 20-30 ms on OS control, and going from an efficient power state to maximum performance can be done in around 35 ms, compared to around 100 ms with the legacy implementation. As seen in the images below, neither technology can jump from low to high instantly, because to maintain data coherency through frequency/voltage changes there is an element of gradient as data is realigned.

The ability to quickly ramp up performance is done to increase overall responsiveness of the system, rather than linger at lower frequencies waiting for OS to pass commands through a translation layer. Speed Shift cannot increase absolute maximum performance, but on short workloads that require a brief burst of performance, it can make a big difference in how quickly that task gets done. Ultimately, much of what we do falls more into this category, such as web browsing or office work. As an example, web browsing is all about getting the page loaded quickly, and then getting the processor back down to idle.

Again, Speed Shift is something that needs to be enabled on all levels - CPU, OS, driver, and motherboard BIOS. It has come to light that some motherboard manufacturers are disabling Speed Shift on desktops by default, negating the feature. In the BIOS is it labeled either as Speed Shift or Hardware P-States, and sometimes even has non-descript options. Unfortunately, a combination of this and other issues has led to a small problem on X299 motherboards.

X299 Motherboards

When we started testing for this review, the main instructions we were given was that when changing between Skylake-X and Kaby Lake-X processors, be sure to remove AC power and hold the reset BIOS button for 30 seconds. This comes down to an issue with supporting both sets of CPUs at once: Skylake-X features some form of integrated voltage regulator (somewhat like the FIVR on Broadwell), whereas Kaby Lake-X is more motherboard controlled. As a result, some of the voltages going in to the CPU, if configured incorrectly, can cause damage. This is where I say I broke a CPU: our Kaby Lake-X Core i7 died on the test bed. We are told that in the future there should be a way to switch between the two without having this issue, but there are some other issues as well.

After speaking with a number of journalists in my close circle, it was clear that some of the GPU testing was not reflective of where the processors sat in the product stack. Some results were 25-50% worse than we expected for Skylake-X (Kaby Lake-X seemingly unaffected), scoring disastrously low frame rates. This was worrying.

Speaking with the motherboard manufacturers, it's coming down to a few issues: managing the mesh frequency (and if the mesh frequency has a turbo), controlling turbo modes, and controlling features like Speed Shift. 'Controlling' in this case can mean boosting voltages to support it better, overriding the default behavior for 'performance' which works on some tests but not others, or disabling the feature completely.

We were still getting new BIOSes two days before launch, right when I need to fly half-way across the world to cover other events. Even retesting the latest BIOS we had for the boards we had, there still seems to be an underlying issue with either the games or the power management involved. This isn't necessarily a code optimization issue for the games themselves: the base microarchitecture on the CPU is still the same with a slight cache adjustment, so if a Skylake-X starts performing below an old Sandy Bridge Core i3, it's not on the game.

We're still waiting to hear for BIOS updates, or reasons why this is the case. Some games are affected a lot, others not at all. Any game we are testing which ends up being GPU limited is unaffected, showing that this is a CPU issue.

Analyzing The Silicon: Die Size Estimates and Arrangements Power Consumption, Test Bed and Setup
Comments Locked

264 Comments

View All Comments

  • rascalion - Monday, June 19, 2017 - link

    Are the Ryzen numbers in the charts retests using the last round of bios and software updates?
  • Ian Cutress - Monday, June 19, 2017 - link

    As much as possible, the latest BIOSes are used.
    Our CPU testing suite is locked in for software versions as of March 2017. This is because testing 30/50/100+ CPUs can't be done overnight, we have to have rigid points where versions are locked in. My cycle is usually 12-18 months. (Note I'm only one person doing all this data.)
  • FreckledTrout - Monday, June 19, 2017 - link

    Ian any chance once there are a few BIOS tweaks you could say do a mini updated review on the 7820x vs Ryzen 1800x. With Ryzen having latest BIOS as well plus 3200Mhz memory. I'm just curious really how the 8-core guys line up when some of the dust settles and I think a lot of people will be.
  • Ian Cutress - Monday, June 19, 2017 - link

    Any reason why 3200? I'll have Intel people saying it is pushing the Ryzen out of spec
  • jjj - Monday, June 19, 2017 - link

    You could do a memory subsystem scaling review for all platforms, Skylake X, Threadripper, Ryzen (Summit Ridge) and Coffee Lake. Cache, interconnect, DRAM. See where they are, how they scale, where the bottlenecks are, single rank vs dual rank modules and perf impact in practice.Why not even impact on power and efficiency.

    In any case, you'll need to update Ryzen 5 and 7 results when Ryzen 3 arrives , isn't it?

    For DRAM at 3200 it might be out of spec - overclocking the core is out of spec too but that has never stopped anyone from overclocking the memory. Right now 3200 is what a lot of folks buy , at least for higher end mainstream Ofc some will argue that Ryzen scales better with memory and that's why it is unfair but it's a hell of a lot more reasonable than testing 1080p gaming with a 1080 TI since it's a popular real world scenario.

    At the end of the day the goal should be to inform, not to watch out for Intel's or AMD''s feelings.
  • vanilla_gorilla - Monday, June 19, 2017 - link

    >For DRAM at 3200 it might be out of spec - overclocking the core is out of spec too but that has never stopped anyone from overclocking the memory.

    This. Exactly. We're enthusiasts and we always push the envelope. No one cares what the specs are all we care is about what these processors are capable of in the right hands.

    And Ian I think you guys do an awesome job, there's no other place I look for CPU benchmarks. Keep up what you do, we all appreciate it, as well as your willingness to have a dialog with us about the process. Really cannot say how impressed I am by how open and engaged you are, it's really commendable.
  • Ian Cutress - Tuesday, June 20, 2017 - link

    Thanks for the comments :)

    Though on your comments about pushing things out of spec. We have a good deal of readers who want plain stock for their businesses - AT isn't solely a consumer focused site. Otherwise I'd just jack all the CPUs and just post OC results :D Our base testing will always be at stock, and for comparison testing there has to be an element of consistency - testing an OC'ed part against a stock part in a direct A vs B comparison is only going to end up with a barrage of emails being rammed down my throat. There has to be some planning involved.
  • Ian Cutress - Tuesday, June 20, 2017 - link

    I've been planning a memory scaling article, I just haven't had the time (this article was around 6 weeks of prep with all the events going on that I had to attend).

    Note we don't retest stuff every review. With our new 2017 test suite, I've been going through regression testing. Usually regression testing is done once for the full segment until the benchmarks are changed again. I'll look at my next few months (still stupidly busy) and look at the priorities here.
  • FreckledTrout - Monday, June 19, 2017 - link

    Most people can easily buy a 3200 kit for not a lot of extra money. It doesn't take a lot tweaking(well not anymore on AGESA 1.0.0.6) or silicone lottery like an OC, just a bit more cash. From what I have seen with Ryzen it is the sweet spot on price and performance. I would assume Its likely the most chosen configuration on the R7's. To make it fair use 3200 on the 7820x as well. I only ask because Ryzen did way better than I would have thought and would like to see it with 3200Mhz memory and latest updates to see really how close Intel and AMD are on 8-core systems. Then im going to build :)
  • tipoo - Monday, June 19, 2017 - link

    Launch review! Nice work dude(s).

Log in

Don't have an account? Sign up now