What is Xe-LP?

A big part of the Tiger Lake/Ice Lake comparison will be the performance difference in graphics. Where Ice Lake has 64 Execution Units of Gen11 graphics, Tiger Lake has 96 Execution Units but of the new Xe-LP architecture. On top of that, there’s the new SuperFin transistor stack that promises to drive frequencies (and power windows) a lot higher, making Tiger Lake more scalable than before.

Straight off the bat Intel’s graphs are showing that at the same voltage, where Ice Lake Gen11 achieves 1100 MHz, the new Xe-LP graphics will get to ~1650 MHz, a raw +50% increase. That means at Ice Lake’s peak power, we should expect Tiger Lake to perform at a minimum 2.25x better. Expanding beyond that, the peak for Tiger Lake seems to be in the 1800 MHz range, ultimately giving a minimum 2.45x more performance over Ice Lake. This is before we even start talking about the fundamental differences in the Xe-LP architecture compared to Gen11.  

Intel is promoting Xe-LP as operating at 2x the performance of Gen11, so even though these numbers might easily suggest a 2.25x uplift before taking into account the architecture, it will ultimately depend on how the graphics is used.

Gen11 vs Xe-LP

For a more in-depth look into Intel’s Xe graphics portfolio, including HP, HPC, and the new gaming architecture HPG, Ryan has written an article covering Xe in greater detail. In this article, we’ll cover the basics.

In the Ice Lake Gen11 graphics system, each one of the 64 execution units consisted of two four-wide ALUs, one set of four for FP/INT, and the other set of four for FP/Extended Math. 16 of these execution units would form a sub-slide within Gen11.

For Xe-LP, that 4+4 per execution unit has been rebalanced for this target market. There are now 10 ALUs per execution unit, but in an 8+2 configuration. The 8 ALUs support 2xINT16 and INT32 data types, but also with new DP4a instructions can accelerate INT8 inference workloads. The new execution units also now work in pairs – two EUs will share a single thread control block to help assist with coordinated workload dispatch.

As with ICL, 16 of the EUs now form a sub-slice with the graphics, and slices are added in the SoC as performance is needed. What is new in Tiger Lake is that each sub-slice now has its own L1 data and texture cache, and the pixel backend runs 8 pixels/clock per two sub-slices.

Overall the graphics system can support 1536 FLOP/clock, with the samplers at 48 Tex/clock per sub-slice and a total of 24 pixel/clock in the back-end. LP in Tiger Lake has 16 MiB of its own L3 cache, separate from the rest of the L3 cache in the chip, and the interface to the memory fabric is doubled, supporting 2x64B/clock reads or writes or a combination of both.

Exact performance numbers for Xe-LP in Tiger Lake are going to be a question mark until we get closer to launch. Intel has stated that the discrete graphics version of LP, known as DG1, is due out later this year.

Xe-LP Media and Display

The other question on Tiger Lake on graphics will be the media and display support. Tiger Lake will be Intel’s first official support for the AV1 codec in decode mode, and Intel has also doubled its encode/decode throughput for other popular codecs. This means a full hardware-based 12-bit video pipeline for HDR and 8K60 playback support.

Display Support for Tiger Lake is also extended with four 4K display pipelines. Connections over DP1.4, HDMI 2.0, Thunderbolt 4, and USB4 Type-C simultaneously is how Intel expects users to operate if all four outputs are needed at once. The display engine also supports HDR10, 12-bit BT2020 color, Adaptive Sync, and support for monitors up to 360 Hz.

External Graphics and Hybrid Support

One of the interesting questions we posted to Intel during Architecture Day was surrounding how Xe-LP will operate in the presence of additional graphics, and potentially paired with a discrete version of LP later in the year. Unfortunately there seemed to be some confusion between the definitions of ‘hybrid’ graphics vs ‘switchable’ graphics, so we got that cleared up in time for the article.

At present, Intel expects almost all Tiger Lake solutions to run in devices where there is no discrete graphics solution – only the integrated graphics is provided as the primary compute for gaming and acceleration. However, Tiger Lake will support switchable graphics solutions with Xe-LP discrete graphics. Intel did not state if this was discrete graphics with respect to a built LP chip or an external discrete graphics solution through Thunderbolt.

Due to Tiger Lake’s PCIe 4.0 support and Thunderbolt 4 support, depending on how an exact Tiger Lake system is configured, Intel expects that any discrete graphics solution will operate at a lower latency, mostly due to the fact that the PCIe 4.0 lanes will be directly attached to the CPU, rather than a chipset. Intel quoted ~100 nanosecond lower latency. They also stated an 8 GB/s bandwidth to main memory, which seemed a bit low?

On the topic of hybrid graphics, where the integrated graphics and an Xe-LP discrete solution could work in tandem on the same rendering task, Intel stated that there is no plan to support a Multi-GPU solution of this configuration.

What is in a Willow Cove Core? Tiger Lake IO and Power
Comments Locked

71 Comments

View All Comments

  • anonomouse - Friday, August 14, 2020 - link

    A Lot™
  • psyclist80 - Thursday, August 13, 2020 - link

    This close to launch and no benchmarks? hmm, seems suspect. I do hope they got the kick in pants they needed to make a jump in performance needed. Looking forward to apples to apples reviews!
  • eek2121 - Thursday, August 13, 2020 - link

    Benchmarks will likely land on launch day.
  • Martimus - Thursday, August 13, 2020 - link

    You accidentally posted the wrong slide for the third slide on the "What is in a Willow Cove Core?" page.
  • LiveWell - Thursday, August 13, 2020 - link

    Are they incorporating Mears Silicon Technology into their chips?
  • drexnx - Thursday, August 13, 2020 - link

    I guess after "enhanced super fin" they can always go to "enhanced super duper fin"? 10ESDF?
  • Linustechtips12#6900xt - Wednesday, April 7, 2021 - link

    the marketing can be called ESD-F safe
  • trivik12 - Thursday, August 13, 2020 - link

    if a phone can have LPDDR5 why not a laptop. I am hoping to see premium laptops(XPS, Spectre, Thinkpad etc) to have an option of LPDDR5. Otherwise this sounds exciting. Even a 8 core 65w CPU for desktop should be interesting. I hope Intel does release that.
  • vFunct - Thursday, August 13, 2020 - link

    With 65W parts, does that mean we're going to see it in a 16" MacBook Pro?
  • AdditionalPylons - Friday, August 14, 2020 - link

    The 16" MacBook Pro uses 45 W chips (yes, even the high-end I9-9980HK is specified as 45W). I have the lower-end I7-9750H but still feel the cooling is inadequate, especially when using both CPU and GPU at the same time. Apple just wanted to make it as thin as possible, and didn't expect that Intel would fail so miserably at 10 nm. Now I'm not surprised at all that they are moving to in-house ARM CPUs.
    Personally I would have preferred AMD for x86 compatibility with my development applications at work, but I'll probably still consider an ARM Mac in a few years and in the worst case rely on ssh to workstations for development. HPCs for running anyway.

Log in

Don't have an account? Sign up now