Final Words

Based on our testing of Skylake-K, performance per clock did not impress as expected, but by going back several generations it gave an overall performance boost worthy of an upgrade from 3-to-5 year old platforms where more CPU performance might be a good thing to have. The chipset side of the equation is also blown wide open, with PCIe ports being able to provide any PCIe x4 or under idea through the chipset without having to share bandwidth.

The reasons for Skylake-K’s clock-for-clock performance relates to the fact that Skylake is a mobile first design, and Intel has focused a lot of resources in two areas. First is power delivery, by adding more power gating and finer frequency control direct to the hardware through Speed Shift, the processor can take advantage of unused parts as well as improving the power consumption of a non-steady state workload. Speed Shift does two things - it requires operating system support, but it allows the operating system to hand control of frequency adjustments back to the hardware within an OS specified range. As the processor is able to adjust its own frequency settings almost 30x faster than the operating system, it can respond to dynamic workloads such as those experienced in day-to-day use of a device faster than the OS.  The second part of Speed Shift relates to the power states of a system, and the Skylake platform allows this hardware control to be substantially more granular than the high level P-states, allowing the onboard controller to determine the best efficiency mode to run the processor with respect to system power (which is also now tracked). When the system is in its most efficient state and performance is not a concern, Skylake can also induce a duty cycle for the processor cores, switching them on and off, to remain at the most efficient power state when performance is not a priority.

Also on the power side of things, the integrated graphics gets a boost here too. Intel has added a number of fixed function units, such as those for HEVC decode, to reduce power as well as more power gating and frequency domain differentiation. Generation 9 graphics also gets added RAW processing functionality for camera setups, and Multi Plane Overlay performs stretch/rotation and translation of parts of the desktop directly on the display controller rather than requiring trips out to main memory to waste power in doing so. Multi Plane Overlay is still in its infancy, but it is part of the graphics platform that Intel seems keen to improve over the years. All together, from a power perspective, it will be interesting to feel the claimed benefits of the new architecture.

The other avenue for Skylake’s design choices stems from the corporate or business aspect of Intel’s platform. At some point a form of the Skylake architecture will find its way into an extreme platform Xeon, which means the design must be able to cope from 4.5W in Skylake-Y up to a potential of 140W in a Xeon E5 (which is what currently exists on Haswell-EP). The best way to do this is by making the design more modular, adding in performance when necessary. While Intel has not been explicit about which segments are more modular than others, we have been indicated that the L2 cache and the movement from an 8-way associative design to a 4-way design is part of this.  It is worth noting that this change also affords power benefits for Skylake, at a small expense in performance.

Also, as per our architecture analysis, despite the decrease in L2 associativity, CPU performance limited environments benefit from wider and deeper execution buffers, attempting to extract as much instruction parallelism as possible. Skylake extends this by providing a larger out-of-order window for micro-ops to queue into, providing larger buffers for in-flight data (both load and store), doubling of the L2 cache miss bandwidth, general improvements to the branch predicition to reduce unnecessary speculation, an increase in micro-op dispatch and retirement as well as new instructions for better cache management. As per our understanding, these under-the-hood changes will be seen as the Skylake platform develops.

From the processor perspective, Skylake affords an interesting dynamic. For the most part, the TDP ratings in the stack of processors remain similar to those in Broadwell, meaning that the 15W design for Broadwell should only need minor adjustments (new motherboard, new ports for new features) and still provide sufficient cooling but at better performance and/or more battery life. Despite not being directly released today, Intel is talking a lot about its eDRAM implementations, both as 4+3e (quad core, GT3 with 64MB) models and 4+4e (quad core, GT4 with 128MB) models.

The addition of Iris and Iris Pro graphics to Intel's 15W and 28W SKUs stands to significantly improve the state of graphics performance on ultrabooks and near-ultrabook form factors. Intel has offered processors with more powerful graphics since Haswell, although seemingly begrudgingly, withholding larger GPUs and eDRAM for the 45W SKUs. While OEMs will be paying more for these higher performance processors, these new Iris and Iris Pro configurations are practically a love letter to Apple, who has long desired more powerful iGPUs for their products and was a major instigator of the Iris lineup in the first place. We wouldn’t be the least bit surprised to see Apple jump on these new parts for the 13” MacBook Pro and the MacBook Air series, as they deliver just what Apple has been looking for. 

This ends up being doubly beneficial for all concerned, as the actual working nature of the eDRAM has changed, making it more like a DRAM buffer with lower latency and transparent to software, but it is clear that Intel is releasing substantially more parts than it ever did on Haswell and Broadwell, with more to follow as time progresses. It is unclear if an eDRAM part will make the desktop however, though we might imagine a Skylake-H part with eDRAM in a future iMac in order to update the line.

Two new parts that will be constantly discussed over the next few months is the overclockable mobile processor, the i7-6820HK and the mobile Xeons. Firstly the overclockable processor is most likely only going to be in the large desktop replacement type of environments, coming out as a 45W part, but it will most likely end up in gaming systems where users want to add more heat and voltage to their gaming system. We might see some interesting laptop designs as a result of this.  The mobile Xeon aspect of the release is Intel's first foray into Xeons specifically targeted at mobile, rather than a repurposed desktop processor in a bulky chassis. At this point, Intel is only launching a couple of E3 v5 models at 45W, suggesting that they will still end up in large workstation chassis with professional graphics cards, but it implements a step that Intel might take down into the notebook segment with both ECC and vPro and other Xeon features that translate well into business environments.

As always, if previous analysis has proven anything, architecture discussions can promise much on paper but where it really matters is if it translates through the devices into which the architecture is used. Implementations such as Speed Shift, fixed function units and an increase in power gating/frequency control should all impact a substantial segment of users, assuming all these functions have the correct operating support. Arguably how earth shattering or paradigm shifting Intel’s Skylake platform is can be up for discussion, but this week is the tech show IFA in Berlin where we expect a number of OEMs to start showing devices as well as processors actually going on sale. We will update with product testing when we can get our hands on them.

Processor Lists and Conclusions
Comments Locked

173 Comments

View All Comments

  • extide - Saturday, September 5, 2015 - link

    Well, a decrease in performance from cannonlake to skylake would be correct. However, I assume you mean haswell, not cannonlake, and that is probably be due to the L2/FMUL changes. However you are also looking at chips with different clockspeeds, with haswell having a faster clock so that also contributes to this result.

    It is somewhat disappointing that Intel has decided to make changes that significantly favor power consumption over performance.

    I have a feeling the Xeons will not have these same changes, so it will be interesting to see what the Skylake E5's are like...
  • shodanshok - Sunday, September 6, 2015 - link

    Mmm, I have the very opposite feeling: I think that these changes were done explicitly to the benefit of server and mobile chips. These two categories (server and mobile) are greatly limited by their power usage (and by their ability to effectively remove the generated heart), while are only marginally dependent on FPU performance.

    So trading some performance for improved power efficiency suddenly make a lot of sense, especially if Intel want to continually increase Xeon's for number (and it seems so).
  • SeanJ76 - Saturday, September 5, 2015 - link

    Not impressed...
  • exmachiner - Monday, September 7, 2015 - link

    Why is there no Desktop SKU with GT4e/Iris Pro ? Will it launch at a later date ? There is an Iris Pro version in Broadwell IIRC.
  • ZachSaw - Monday, September 7, 2015 - link

    I'd be interested to know what its relative performance is vs a discrete card like 750ti when it comes to the SM5.0 version of NNEDI3 with MPDN. Intel GPUs surprisingly run twice as fast with the shader version as compared to the OpenCL version (AMD loves shader too - the only exception here is NVIDIA's Maxwell architecture). It'll be interesting to see if Skylake is the perfect HTPC!
  • janolsen - Tuesday, September 8, 2015 - link

    If I understand pg 3 slide correctly , eDRAM will only be for BGA - and thus no Iris iGPU for desktop, broadwell chips may be a bit faster for those not needing a GPU for gaming and similar.
  • HollyDOL - Wednesday, September 9, 2015 - link

    Hm, is this a paper launch only? Only available parts until now are 6600K and 6700K (all czech big e-shops as well as ones like newegg). Awaiting 6100T eagerly (want to build mITX baby for my mom since her old (ancient) computer died 2 weeks ago)... and for obvious reasons I'd rather prefer new platform than the old one in case there was ever need to upgrade something (which I doubt but still...)
  • qasdfdsaq - Thursday, September 10, 2015 - link

    I feel like there's something missing here. We get 15w dual-core parts with Iris GT3e, but quad-core parts are all 45w with no GT3e. Indeed, there's no quad-core mobile chips with Iris graphics although Broadwell and Haswell both had them in the 45w quad-core range. There's certainly no issue fitting it in the power envelope, given you can literally fit 3x 2-core chips with GT3e into the 45w TDP.
  • LDW - Friday, September 18, 2015 - link

    I like to have a laptop for its portability and am not willing to buy a second system for my occasional gaming. In my experience, games like civ 5 , civ be and skyrim are happy with two processors but would like more graphics power than my current laptop. (i7-4700MQ with no additional graphics chips)...

    To my surprise, I find that the H series of processors have less graphics power than the U series. I suspect that the U series 2 processors, 4 threads would be just fine for the games I play and I know they would like the additional graphics power. So I'm likely to be looking at the U series as I look at replacements for my current laptop, not the H series as I expected.

    I'm curious if others reach that conclusion as well.... and am looking forward to anandtech's future comparisons between the H and U series graphics capabilities.

    ldw
  • francisca euralia - Tuesday, October 20, 2015 - link

    hello, can u give me a sumary od this page with the most important definition?

Log in

Don't have an account? Sign up now