Broadwell-U: On Performance

As part of the Broadwell-U launch, it would not be complete without a list of performance related metrics direct from Intel indicating how Broadwell-U improves over Haswell-U. Without hardware on hand to test for ourselves it is hard to verify the numbers, but it provides a number of interesting talking points and how they compare to the previous Intel presentations leading up to this.

Core Improvements

We covered the transistor numbers on the previous page, but Intel’s direct performance metrics are most important when we consider graphics and battery life. Moving from Haswell-U to Broadwell-U, in terms of productivity, will not be that much of a jump as it is a similar architecture but on a different process node. It allows Intel to catch the low hanging fruit and move the IPC up by around 5%, achieved by the following:

Larger OoO scheduler
Faster store-to-load forwarding
Larger (+50%) L2 transaction lookaside buffer (TLB)
New dedicated 1GB page mode for L2 TLB
2nd TLB page miss handler
Faster FP multiplier
Faster Radix-1024 divider
Improved address prediction for branches and returns
Targeted cryptography instruction acceleration

The node adjustment has more weight when it comes to power saving, resulting in a lower voltage required for similar performance, but combined with Intel’s 2:1 policy for Broadwell (+2% performance uses at most +1% power) is good all around.

However the bigger change is on the GPU side. Intel is quoting a +22% synthetic graphics improvement from HD 5500 to HD 4400 with 3DMark and +50% for Cyberlink MediaEspresso for video conversion.

One might consider that Intel should bring alternating CPU and GPU performance each U series cycle, to give each platform a serious talking point. Haswell gave a half-generation increment in the name scheme after all (Gen7 to Gen7.5) but the CPU architecture was new compared to Ivy Bridge.

Intel is also a fan at looking into historical improvements. If you consider that a number of users are upgrading a 2-4 year old system, this makes a good amount of sense to see where the multi-generation improvements add up. On the other hand, when a person does upgrade, you would hope that every area has been improved over the 2-3 generations in the interim.

Naturally in order to give the best comparison data we look back at the oldest reasonable product for comparison – in this case Intel pitted an i5-5300U (HD 5500, GT2 with 24 EUs) against an i5-520UM. In the time between these two platforms, the concept of attacking mobile devices has changed significantly because of the base performance. If we put the 4.5W equivalent of the i5-520UM into a fanless tablet for example, the quality and features we know today would (I assume) feel slow almost to a point of excruciating. One argument is that back then, in 2010-ish (and before), our concept of software features and gaming was not at the level of detail it is today (which is true) and the same comparison will most likely be made in four years looking back at this era. Not only does the hardware improve, but also the understanding of the market and the concept of user experience.

Nevertheless, now we have devices that wake from sleep in fractions of a second rather than seconds, or turn on in seconds rather than minutes. Battery life has improved because integrated graphics are a bigger portion of the equation and we have thrown the graphics card away for most devices that need a sense of mobility. My old 8lb brick of a mobile 15-inch 1200p workstation used a 45W GPU with a 35W CPU, which was a nightmare for working on-the-go. The 11-inch netbook wasn’t a lot better, with the low 1366x768 resolution and underwhelming performance. As I am writing this review, my sub-3lb UX301 laptop is in a low power mode and on this flight I have managed three hours of active writing time, looking at text on white backgrounds, and still have half of the battery remaining. At this point four years ago, I would be getting out my charger for my 8lb brick with its extended battery and then wondering if I have exceeded the power limit for the flight socket. A popular feeling is to look back fondly to the past, but when it comes to the combination of laptop battery life with performance, the only way is forward.

Battery Life and the Audio DSP

Almost all the Intel suggested use scenarios, outside static All-In-Ones and mini-desktops, rely on some form of battery, so it makes sense that power efficiency is one card in play for Broadwell-U. In the past this relates in terms of actual performance per watt but also in regards to time-to-sleep, especially when parts of the system can be put into a lower power state or shut off completely when not in use. This makes designs complicated with disconnected clock domains as introduced in previous designs and so forth.

The test for battery life is also important as well because users typically do not run blank screens at idle when performing daily tasks. The two metrics Intel has provided is a 100 nit display idle with Windows 8.1, with the other requiring local HD video playback. 

For the former, Intel is quoting +60 minutes of battery life on their test platform at idle, equivalent to +11.0%. Most of this power saving comes from the SoC using better power saving techniques, but also the rest of the platform, such as the PCH, also reduces its power use to around half.

During the (local) video playback, a 90 minute difference equates to a substantial +20.8% battery use gain. A small amount of this is from the SoC and platform, but the biggest saving by far is the audio. Broadwell-Y and Broadwell-U both integrate Intel’s audio DSP (Digital Signal Processor) into the PCH. This removes a couple of Realtek components from the motherboard and allows Intel to bring it under their own manufacturing process, as well as configure the power gating needed.

The DSP is more powerful, presumably equating to a good race-to-sleep performance as well as dealing with HD audio under a lower power budget. Interestingly enough I would point out that the power usage of the DSP will be directly related to how much data is flowing through. If a HD video with little to no audio is involved, then the power usage will be quite low anyway. I would like to perhaps put a SYL metal live-show DVD through its paces to see how this affects power consumption.

As we mentioned back during the Core M discussions, the audio DSP lends itself to being a configurable and programmable entity, much in the same way that AMD’s solution is actively promoted. Similar to the response we had back then, Intel is considering opening it up with a public SDK, although that side of the equation is not on the roadmap as of yet.

Broadwell-U Platform Controller Hub (PCH)

As a writer, my bread and butter at AnandTech these past four years has revolved around motherboards and thus examining the connectivity provided by a chipset is always interesting. Because Intel bundle both the processor and the PCH on the same package, it allows manufacturers to save space in their design but it also allows Intel to control power consumption tighter to give better performance or longer battery life as a whole. There is still room for manufacturers to differentiate in their IO offerings, which is a good thing for consumers.

The new PCH for Broadwell-U focuses on that power consumption, especially when it comes to throttling sections and data pathways when not in use. The ‘Dynamic Power and Thermal Framework’ entry for the 5th Gen PCH should allow the performance to either respond as a function of battery life or skin temperature. This means throttling where necessary to reduce temperature or increase battery life. Wake on Voice is also a target for Intel, allowing devices to maintain a super-low power state but still respond without direct touch.

When it comes to direct connectivity, the PCH offers four SATA 6 Gbps, four USB 3.0 (two of which are muxed similar to a hub), eight USB 2.0 ports, TPM, a PCIe 2.0 x4 and another 12 PCIe 2.0 lanes split into 6 ports, allowing six devices maximum. We asked Intel regarding PCIe storage support for RST, and were told that with additional hardware support (remapping logic), Broadwell-U can support one PCIe 2.0 x2 PCIe storage device. This means that if a PCIe storage device based Broadwell-U came to market, with RST capabilities, it would cost a bit more than the base model. Also worth noting is that Broadwell-U is still using PCIe 2.0. On the PCH side this is perhaps not so much a big deal, and when asked about PCIe 3.0 Intel reiterated their stance on not commenting on possible future plans but they are monitoring demands and industry trends.

On the DRAM front, we got confirmation that Broadwell-U will support a maximum of 16GB of DDR3L/DDR3L-RS or LPDDR3 memory. No comment was made on a move towards two modules per channel memory or DDR4. Regarding video connectivity, Broadwell-U was too early for HDMI 2.0 and thus has HDMI 1.4b.

WiDi 5.1

Also new on the table is WiDi 5.1, which brings support for 4K to the ecosystem.

A part of WiDi that has been lacking has been the business features, and as a result Intel is focusing on security, privacy and controls needed for a professional environment. These will need a driver update for the ultra-early adopters of Broadwell, but Intel is driving down the costs of the WiDi adapters to a more palatable price point. My Belkin WiDi receiver, for example, retailed at 120 GBP-ish back in 2013 and requires an external power supply. Compare that to the product Intel promoted with their conference call - the Actiontec Mini2 which uses HDMI and is only $40.

Intel Wireless AC-7265

While not strictly speaking new to the market, Intel is promoting its new low power WiFi solution to the manufacturers to use in conjunction with Broadwell. The AC 7265 is an upgrade over the AC 7260 that was used extensively in Haswell from mobile devices all the way up to big desktop partners, and the AC 7265 brings about both performance and power benefits.

The form factor specifically for Broadwell-U is provided as a BGA M.2 part, with the package being 12mm x 16mm (given by the 1216 form factor designation). Low powered wireless is an important part of lower performance systems, as without the right configuration a sustained network load can eat up a portion of the processor performance. Intel’s partners with Broadwell-U are presumably not bound to use the AC 7265 and can use other products based on other performance metrics, but Intel is targeting networking as a source of power drain and working to correct that issue.

Devices! Where and When?

Most of AnandTech are here in Vegas, attending CES 2015 and (almost literally) running between meetings, press events and product showcases. Broadwell-U is high on our priority list, and we know several are due for announcement this week. Watch this space.

Fitting in With Core M & Release Dates
Comments Locked

85 Comments

View All Comments

  • Pork@III - Friday, January 16, 2015 - link

    Yes with Skylake we have a few new intructions; new memory controller, eDRAM on die area, also few other arcitectural changes. With all these changes together Intel promise significant progress in computational capacity of this generation processors. Intel did not once have promised more than what actually imagined their customers. But we still have hope.
  • jman9295 - Sunday, January 18, 2015 - link

    Intel should think about producing a mid-range quad core without HT and without the HD graphics GPU specifically for gaming laptops that come with discrete Nvidia GPUs. Before, it made sense to have the iGPU for non-gaming situations to keep the battery from draining. The laptop would switch back and forth automatically. Now, though, with Maxwell being designed as a mobile architecture from the start, there has to be some way that Nvidia can disable most of the GPU so that it can operate at extremely low power consumption when browsing or doing light tasks. I don't think Intel has ever had a mobile core-i CPU with 4 physical cores and no HT. And I'm pretty sure there never was a mobile core-i CPU that did not have an iGPU. Right now, gamers have the choice of a 4c/8t i7 or a 2c/4t i5 or i7 and nothing in between. Giving us a 4c/4t i5 would bring the cost of halfway decent gaming laptops down to under $1,000 depending on the GPU installed. I'm sure there are plenty of other applications for a mobile CPU like this other than gaming, but this would be the ideal gaming laptop CPU.
  • tipoo - Sunday, January 18, 2015 - link

    Why does each EU handle 7 threads, when they have 8 "shaders" each?
  • tipoo - Sunday, January 18, 2015 - link

    I have the Iris Pro 5200, I'll be interested to see where that 6100 falls in comparison to it. More EUs, and the other benefits to the small GPU-level caches, but no eDRAM. I think the 5200 should still beat it, but I wonder how close it can come without eDRAM.
  • boe - Wednesday, February 4, 2015 - link

    I'd certainly like to know more about the onboard GPU for HTPCs. Will it support 4K@60? 4K 3D specs etc.

Log in

Don't have an account? Sign up now