The Intel Skylake Mobile and Desktop Launch, with Architecture Analysis

Name: The Intel Skylake Mobile and Desktop Launch, with Architecture Analysis
Item: The Intel Skylake Mobile and Desktop Launch, with Architecture Analysis
Author: Dr. Ian Cutress

by Ian Cutress on September 1, 2015 11:05 PM EST

Posted in
CPUs
Intel
Core M
Skylake
eDRAM

173 Comments | Add A Comment

173 Comments

A small portion of this page was posted as part of our initial Skylake-K coverage.

The High Level Core

Taking a high level view, the application of the Skylake-S/H architecture features results in the following:

From most perspectives, the central image provided by Intel could have been taken from a Broadwell presentation if you scrubbed out the ‘GT4’ element of the design, but the move to Skylake suggests improvements in each of these areas either in terms of functionality, bandwidth, power consumption, or all of the above.

Out of the box, the Skylake IA core is designed to extract instruction level parallelism from code, and the design allows for more operations to be dispatched, queued and retired in a single clock. We cover this more in the next page. But to augment this, the data transfer ring between cores, cache, graphics and the system agent is also increased in read bandwidth to 128b, allowing more data to be supplied to the cores. In terms of the cores themselves, the big upgrades surround in-flight management of data, especially with respect to cache misses. We have been told that a number of these upgrades, while they might not be readily usable by current software, should have effects in certain vertical markets (finance, big data, etc).

From a memory connectivity standpoint, most processors will support both DDR3L and DDR4 with a dual memory controller design (or a single memory controller that supports both). Skylake-Y (Core M) is the exception here, and will only work in DDR3L/LPDDR3 modes. As always with multi-DRAM support designs, only one standard can be in use at any time.

On the PCI-Express Graphics allocation side, the Skylake-H and Skylake-K processors will have sixteen PCIe 3.0 lanes to use for directly attached devices to the processor, similar to Intel's previous generation processors. These can be split into a single PCIe 3.0 x16, x8/x8 or x8/x4/x4 with basic motherboard design. (Note that this is different to early reports of Skylake having 20 PCIe 3.0 lanes for GPUs. It does not.)

With this SLI will work up to x8/x8 and CrossFire to x8/x8 or x8/x4/x4 in both desktop and mobile designs. For desktops, if a motherboard supports x8/x4/x4 and a PCIe card is placed into that bottom slot, SLI will not work because only one GPU will have eight lanes. NVIDIA requires a minimum of PCIe x8 in order to enable SLI. Crossfire has no such limitation, which makes the possible configurations interesting. Below we discuss that the chipset has 20 (!) PCIe 3.0 lanes to use in five sets of four lanes, and these could be used for graphics cards as well. That means a motherboard can support x8/x8 from the CPU and PCIe 3.0 x4 from the chipset and end up with either dual-SLI or tri-CFX enabled when all the slots are populated.

For Skylake-U/Y, these processors are not typically paired with discrete graphics and as far as we can tell, the PCIe lanes have been removed from these lines. As a result, any storage based on PCIe (such as M.2) for devices based on these processors will be using the chipset PCIe lanes. As mentioned later, the chipsets on U/Y also differ to their more desktop oriented counterparts.

DMI 3.0

The Skylake-S and H processors are connected to the chipset by the four-lane DMI 3.0 interface. The DMI 3.0 protocol is an upgrade over the previous generation which used DMI 2.0 – this upgrade boosts the speed from 5.0 GT/s (2GB/sec) to 8.0 GT/s (~3.93GB/sec), essentially upgrading DMI from PCIe 2 to PCIe 3, but requires the motherboard traces between the CPU and chipset to be shorter (7 inches rather than 8 inches) in order to maintain signal speed and integrity. This also allows one of the biggest upgrades to the system, chipset connectivity, as shown below in the HSIO section.

Skylake-Y/H, by virtue of having the chipset on the processor package, can keep the interface between the CPU and IO very simple and uses what they call an OPIO – on package input-output. We have seen this as far back as Haswell to deliver 4GB/s of bandwidth at 1pJ/bit, and has been listed as being very power efficient as well as highly scalable.

CPU Power Delivery – Moving the FIVR

Moving on to power arrangements, with Skylake the situation changes as compared to Haswell. Prior to Haswell, voltage regulation was performed by the motherboard and the right voltages were then put into the processor. This was deemed inefficient for power consumption, and for the Haswell/Broadwell processors Intel decided to create a fully integrated voltage regulator (FIVR) in order to reduce motherboard cost and reduce power consumption. This had an unintended side-effect – while it was more efficient (good for mobile platforms), it also acted as a source of heat generation inside the CPU with high frequencies. As a result, overclocking was limited by temperatures and the quality of the FIVR led to a large variation in results. For Broadwell-Y, this also resulted in an increase for the z-height of the processor due to having leveling transistors on the rear of the package. As a result, in order to decrease the z-height of any corresponding product, a hole in the motherboard had to be forged:

For Skylake, the voltage regulation is moved back into the hands of the motherboard manufacturers. This should allow for cooler processors depending on how the silicon works, but it will result in slightly more expensive motherboards.

A slight indication of this will be that some motherboards will go back to having a large amount of multiplexed phases on the motherboard, and it will allow some manufacturers to use this as a differentiating point, although the usefulness of such a design is sometimes questionable.

The Claims: Performance and Power The Skylake Package: 100-Series Chipsets and DRAM

PRINT THIS ARTICLE

Post Your Comment
Please log in or sign up to comment.

Comments Locked

173 Comments

View All Comments

extide - Saturday, September 5, 2015 - link
Well, a decrease in performance from cannonlake to skylake would be correct. However, I assume you mean haswell, not cannonlake, and that is probably be due to the L2/FMUL changes. However you are also looking at chips with different clockspeeds, with haswell having a faster clock so that also contributes to this result.

It is somewhat disappointing that Intel has decided to make changes that significantly favor power consumption over performance.

I have a feeling the Xeons will not have these same changes, so it will be interesting to see what the Skylake E5's are like...
shodanshok - Sunday, September 6, 2015 - link
Mmm, I have the very opposite feeling: I think that these changes were done explicitly to the benefit of server and mobile chips. These two categories (server and mobile) are greatly limited by their power usage (and by their ability to effectively remove the generated heart), while are only marginally dependent on FPU performance.

So trading some performance for improved power efficiency suddenly make a lot of sense, especially if Intel want to continually increase Xeon's for number (and it seems so).
SeanJ76 - Saturday, September 5, 2015 - link
Not impressed...
exmachiner - Monday, September 7, 2015 - link
Why is there no Desktop SKU with GT4e/Iris Pro ? Will it launch at a later date ? There is an Iris Pro version in Broadwell IIRC.
ZachSaw - Monday, September 7, 2015 - link
I'd be interested to know what its relative performance is vs a discrete card like 750ti when it comes to the SM5.0 version of NNEDI3 with MPDN. Intel GPUs surprisingly run twice as fast with the shader version as compared to the OpenCL version (AMD loves shader too - the only exception here is NVIDIA's Maxwell architecture). It'll be interesting to see if Skylake is the perfect HTPC!
janolsen - Tuesday, September 8, 2015 - link
If I understand pg 3 slide correctly , eDRAM will only be for BGA - and thus no Iris iGPU for desktop, broadwell chips may be a bit faster for those not needing a GPU for gaming and similar.
HollyDOL - Wednesday, September 9, 2015 - link
Hm, is this a paper launch only? Only available parts until now are 6600K and 6700K (all czech big e-shops as well as ones like newegg). Awaiting 6100T eagerly (want to build mITX baby for my mom since her old (ancient) computer died 2 weeks ago)... and for obvious reasons I'd rather prefer new platform than the old one in case there was ever need to upgrade something (which I doubt but still...)
qasdfdsaq - Thursday, September 10, 2015 - link
I feel like there's something missing here. We get 15w dual-core parts with Iris GT3e, but quad-core parts are all 45w with no GT3e. Indeed, there's no quad-core mobile chips with Iris graphics although Broadwell and Haswell both had them in the 45w quad-core range. There's certainly no issue fitting it in the power envelope, given you can literally fit 3x 2-core chips with GT3e into the 45w TDP.
LDW - Friday, September 18, 2015 - link
I like to have a laptop for its portability and am not willing to buy a second system for my occasional gaming. In my experience, games like civ 5 , civ be and skyrim are happy with two processors but would like more graphics power than my current laptop. (i7-4700MQ with no additional graphics chips)...

To my surprise, I find that the H series of processors have less graphics power than the U series. I suspect that the U series 2 processors, 4 threads would be just fine for the games I play and I know they would like the additional graphics power. So I'm likely to be looking at the U series as I look at replacements for my current laptop, not the H series as I expected.

I'm curious if others reach that conclusion as well.... and am looking forward to anandtech's future comparisons between the H and U series graphics capabilities.

ldw
francisca euralia - Tuesday, October 20, 2015 - link
hello, can u give me a sumary od this page with the most important definition?

The Intel Skylake Mobile and Desktop Launch, with Architecture Analysis

The High Level Core

DMI 3.0

CPU Power Delivery – Moving the FIVR

Post Your Comment

173 Comments

View All Comments

extide - Saturday, September 5, 2015 - link

shodanshok - Sunday, September 6, 2015 - link

SeanJ76 - Saturday, September 5, 2015 - link

exmachiner - Monday, September 7, 2015 - link

ZachSaw - Monday, September 7, 2015 - link

janolsen - Tuesday, September 8, 2015 - link

HollyDOL - Wednesday, September 9, 2015 - link

qasdfdsaq - Thursday, September 10, 2015 - link

LDW - Friday, September 18, 2015 - link

francisca euralia - Tuesday, October 20, 2015 - link

Log in

Don't have an account? Sign up now