Broadwell CPU Architecture

We’ll kick off our look at Broadwell-Y with Broadwell’s CPU architecture. As this is a preview Intel isn’t telling us a great deal about the CPU at this time, but they have given us limited information about Broadwell’s architectural changes and what to expect for performance as a result.

With Broadwell Intel is at the beginning of the next cycle of their tick-tock cadence. Whereas tock products such as Haswell and Sandy Bridge designed to be the second generation of products to use a process node and as a result are focused on architectural changes, tick products such as Ivy Bridge and now Broadwell are the first generation of products on a new process node and derive much (but not all) of their advantage from manufacturing process improvements. Over the years Intel has wavered on just what a tick should contain – it’s always more than simply porting an architecture to a new process node – but at the end of the day Broadwell is clearly derived from Haswell and will be taking limited liberties in improving CPU performance as a result.

Intel's Tick-Tock Cadence
Microarchitecture Process Node Tick or Tock Release Year
Conroe/Merom 65nm Tock 2006
Penryn 45nm Tick 2007
Nehalem 45nm Tock 2008
Westmere 32nm Tick 2010
Sandy Bridge 32nm Tock 2011
Ivy Bridge 22nm Tick 2012
Haswell 22nm Tock 2013
Broadwell 14nm Tick 2014
Skylake 14nm Tock 2015

All told, Intel is shooting for a better than 5% IPC improvement over Haswell. This is similar to Ivy Bridge (4%-6%), though at this stage in the game Intel is not talking about expected clockspeeds or the resulting overall performance improvement. Intel has made it clear that they don’t regress on clockspeeds, but beyond that we’ll have to wait for further product details later this year to see how clockspeeds will compare.

To accomplish this IPC increase Intel will be relying on a number of architectural tweaks in Broadwell. Chief among these are bigger schedulers and buffers in order to better feed the CPU cores themselves. Broadwell’s out-of-order scheduling window for example is being increased to allow for more instructions to be reordered, thereby improving IPC. Meanwhile the L2 translation lookaside buffer (TLB) is being increased from 1K to 1.5K entries to reduce address translation misses.

The TLBs are also receiving some broader feature enhancements that should again improve performance. A second miss handler is being added for TLB pages, allowing Broadwell to utilize both handlers at once to walk memory pages in parallel. Otherwise the inclusion of a 1GB page mode should pay off particularly well for servers, granting Broadwell the ability to handle these very large pages on top of its existing 2MB and 4K pages.

Meanwhile, as is often the case Intel is once again iterating on their branch predictor to cut down on missed branches and unnecessary memory operations. Broadwell’s branch predictor will see its address prediction improved for both branches and returns, allowing for more accurate speculation of impending branching operations.

Of course efficiency increases can only take you so far, so along with the above changes Intel is also making some more fundamental improvements to Broadwell’s math performance. Both multiplication and division are receiving a performance boost thanks to performance improvements in their respective hardware. Floating point multiplication is seeing a sizable reduction in instruction latency from 5 cycles to 3 cycles, and meanwhile division performance is being improved by the use of an even larger Radix-1024 (10bit) divider. Even vector operations will see some improvements here, with Broadwell implementing a faster version of the vector Gather instruction.

Finally, while it’s not clear whether these will be part of AES-NI or another instruction subset entirely, Intel is once again targeting cryptography for further improvements. To that end Broadwell will bring with it improvements to multiple cryptography instructions.

Meanwhile it’s interesting to note that in keeping with Intel’s power goals for Broadwell, throughout all of this Intel put strict power efficiency requirements in place for any architecture changes. Whereas Haswell was roughly a 1:1 ratio of performance to power – a 1% increase in performance could cost no more than a 1% increase in power consumption – Broadwell’s architecture improvements were required to be at 2:1. While a 2:1 mandate is not new – Intel had one in place for Nehalem too – at the point even on the best of days meaningful IPC improvements are hard to come by at 1:1, never mind 2:1. The end result no doubt limited what performance optimizations Intel could integrate into Broadwell’s design, but it also functionally reduces power requirements for any given performance level, furthering Intel’s goals in getting Core performance in a mobile device. In the case of Broadwell this means Broadwell’s roughly 5% performance improvement comes at a cost of just a 2.5% increase in immediate power consumption.

With that said, Intel has also continued to make further power optimizations to the entire Broadwell architecture, many of which will be applicable not just to Core M but to all future Broadwell products. Broadwell will see further power gating improvements to better shut off parts of the CPU that are not in use, and more generalized design optimizations have been made to reduce power consumption of various blocks as is appropriate. These optimizations coupled with power efficiency gains from the 14nm process are a big part of the driving force in improving Intel’s power efficiency for Core M.

Intel Broadwell Architecture Preview Broadwell GPU Architecture
Comments Locked

158 Comments

View All Comments

  • isa - Monday, August 11, 2014 - link

    I appreciate the thoughts, but I'm actually an ASIC designer, so I have no problem with lingo - I just am not informed regarding my specific question. And my current laptop is a Penryn 2008 laptop, so forgive me if we ignore the need question and stay focused on what version(s) of Broadwell are intended for mainstream (non-gaming) desktop replacement laptops. thanks!
  • ZeDestructor - Monday, August 11, 2014 - link

    You want the full HQ/MQ series for your next laptop. Those are the full quad-core-enabled machines, which is something your probably want for a hi-performance machine. Since this is a gaming laptop though, you may want to look into building a small mini-ITX desktop though, they're comparable if your primary definition of "mobile gaming" is "drag computer to a LANparty/friend's place" rather than actually gaming on a train or similar.
  • name99 - Tuesday, August 12, 2014 - link

    If you can afford it, why are you dicking around with trying to save a few hundred dollars?
    If you want a serious machine, buy an rMBP with quadcore i7.
    If you want a light machine, buy an MBA.
    Then either stick Parallels on it (if you use OSX) or just run Boot Camp and never touch OSX, except maybe once every two months to update the firmware.

    If you can afford it, life's too short to waste time trying to figure out which of fifty slightly different laptop PCs will suck less in their different hardware, different pre-installed crap-ware, different drivers. Just pay a little more and get something that most people consider works well.
  • ZeDestructor - Tuesday, August 12, 2014 - link

    As an ASIC designer, OSX is just a non-option. Hell, even Linux is rather hairy compared to Windows for ASIC stuff.

    I personally like the Dell Precision line, but Lenovo Thinkpad W and HP Elitebook would also doo. In a pinch, a ridiculously-specced Compal (in it's various Sager rebrands) would also do, but IMO are just not built anywhere close.
  • Kjella - Monday, August 11, 2014 - link

    Y = tablets
    U = ultraportables
    H = laptop/desktops
    EP/EX = workstations/servers

    So H. Out in Q2 2015, by current best guesses.
  • isa - Monday, August 11, 2014 - link

    Thanks, Kjella. Upon further googling after reading your post, I learned that there apparently will be a mobile and deskop version of the H flavor, and it does look like the mobile H is the most likely for me. But that's not rumored to come out until May or June 2015, which is disappointing.

    Even weirder, since the first desktop version of Broadwell will be available in PCs in May/June 2015, and since the first version of Skylake (rumored to be a desktop version) is rumored to be "available" 2H 2015, it seems Broadwell H desktop is slated for a very, very short product life. Similarly, is Broadwell H mobile also slated for an extremely short product life?

    Perhaps I missed it, but it would be great if there were an Anandtech chart comparing Intel's definitions of "announced", "launched", "samples", "volume shipments", "available", and similar Intelese to figure out about how many months from each it is till Joe Consumer can buy the chip in question. I suspect these definitions and even the lingo can vary with each tick and tock, but some kind of cheat sheet guestimates would be great (and revised as better info arrives, of course).
  • isa - Monday, August 11, 2014 - link

    To further clarify the need for a cheat sheet, I'm familiar with the timing of tick and tock for the last few years, but it seems that 2014/2015 at a minimum will diverge so much from the last few years that previous expectations add confusion rather than clarity.
  • ZeDestructor - Monday, August 11, 2014 - link

    Judging by the delay for BW, Skylake will probably be pushed forward at least 6 months, if only to make up the R&D costs of BW. Then again, Intel wants that tablet market, so they might not either.
  • IntelUser2000 - Wednesday, August 13, 2014 - link

    "Similarly, is Broadwell H mobile also slated for an extremely short product life?"

    No its not. Companies like Intel cares about 5% profit differences, so having a short product life would make absolutely no sense. Broadwell isn't coming to "mainstream" desktops, only high-end enthusiast ones like the K series.

    So they will all happily be a family like this:
    -Skylake mainstream desktop
    -Broadwell H/U/Y/K
  • name99 - Tuesday, August 12, 2014 - link

    Semiaccurate says that quadcores (which I take to mean H-series) will not be out until 11 months from now.
    (Makes you wonder WTF has happened to the Skylake timeline. They haven't yet admitted that that has even slipped, let alone by how much.)

Log in

Don't have an account? Sign up now