Stage 2: Datacenter

Intel’s Datacenter Group SVP, Navin Shenoy, also took to the stage at CES in order to discuss some new products in Intel’s portfolio, as well as to deliver updates on ones that were disclosed last year. Back in 2018, Intel held its Datacenter Summit in August, where it lifted the lid on Cascade Lake, Cooper Lake, and 10nm Ice Lake. Along with this, we saw new instruction support for AI and security as the top two areas of discussion.

Cascade Lake: Get Yours Today

Intel’s first generation of Xeon Scalable processors, Skylake-SP, was launched over 18 months ago. We’ve been hearing about the update to that family, Cascade Lake-SP, for a while now, along with its brother Cascade Lake-AP and how it will tackle the market. The announcement today from Intel is that the company is now shipping Cascade Lake for revenue.

This means, to be crystal clear, that select customers are now purchasing production-quality processors. What this doesn’t mean is retail availability. These select customers are part of Intel’s early sampling program, and have likely been working with engineering samples for several months. These customers are likely the big cloud providers, the AWS / Google / Azure / Baidus of the world.

It’s worth pointing out that at Intel’s Datacenter Summit, they said that half of all of its Xeons sold were ‘custom’ processor configurations that were not sold though its distributors – these parts are often described as ‘off roadmap’. It is likely that when Intel says Cascade Lake-SP is shipping for revenue to select customers that they are likely to be purchasing these off-roadmap processors. They might be running at a higher TDP than Intel expects for the commercial parts, or have different core/cache/frequency/memory configurations as and when they are needed.

One of the big draws for Cascade Lake is Intel’s Optane DC Persistent Memory support, which will enable several terabytes of memory per socket, but also the hardware security patches for Spectre v2. Businesses who want to be sure their hardware is patched can guarantee security if it's in the hardware, rather than relying on a firmware/software stack. So this might be part of why Intel’s demand for 14nm CPUs is at an all-time high and outstripping supply – if a company wants to be 100% sure it is protected, they need the hardware with baked-in security.

The full retail launch of Cascade Lake is expected in 2019. Based on what we saw at Supercomputing in November, given by a rolling slide deck at the booth of one of Intel’s OEM partners, that time frame looks to be somewhere from March to May.

Nervana for Inference: NNP-I coming in 2019

To date, when Intel has discussed the Nervana family of processors, we have only known about them in the context of large-scale neural network acceleration. The idea is that these big pieces of silicon are designed to accelerate the types of compute commonly found in neural network training, at performance and power efficiency levels above and beyond what CPUs and GPUs can do. It has been disclosed that Intel is working on that family of parts, NNP-L, for a while now, and we are still waiting on a formal launch. But in the meantime, Intel is announcing today that it is working on a part that's optimized for inference as well.

There are two parts to implementing machine learning with neural networks: making the network learn (training), and then using the trained network on new information to do its job (inference). The algorithms are often designed such that the more you can train a network, the more accurate it is and sometimes the less computationally intensive it is to apply it to an external problem. The more resources you put into training, the better. But the scale of compute between training and inference is several orders of magnitude: you need a big processor for training, but don’t need a big processor for inference. This is where Intel’s announcement comes in.

The NNP-I is set to be a smaller version of the NNP-L and built specifically for inference, with Intel stating that it will be coming in 2019. Exact details are not being disclosed at this time, so we don't have any information on the interface (likely PCIe), power consumption, die size, architecture, etc. However, we can draw some parallels from Intel’s competition. NVIDIA has big Tesla V100 GPUs with HBM2 for training that can draw 300-350W each, with up to eight of them in a system at once. However for inference it has the Tesla P4, which is a small chip below 75W, and we’ve seen systems designed to hold 20 of NVIDIA's various inference processors at once. It is likely that this new NNP-I design is along the same lines.

Snow Ridge on 10nm: An SoC for Networking and 5G (Next-Gen Xeon-D?)

The Data Center Group will be making two specific announcements around 10nm. The first is disclosing the Snow Ridge family of processors, focused on networking and specifically targeting the wide array of 5G deployments coming up over the next decade. The purpose of Snow Ridge is to enable wireless access base stations and deployments, as well as functions required at the edge of the network, such as compute, virtualization, and potentially things like artificial intelligence.

Intel gave no other details, however going back in my mind, I realise that we’ve heard this before with Intel. They already have processors on their roadmap focused specifically on networking, with 40 GbE support and features like QuickAssist Technology to accelerate networking cryptography: the Xeon-D line of processors. This makes me believe that Snow Ridge will be the name for the next generation of Xeon D, either the Xeon D-2500 or Xeon D-3100, depending on the power envelope Intel is going for.

Given this assumption, and the fact that Intel has said that this is a 10nm processor, I suspect we’re looking at a multi-core Sunny Cove enterprise design with integrated networking MACs and support for lots of storage and lots of ECC memory. There’s an outside chance that it might support Optane, allowing for bigger memory deployments, although I wouldn’t put money on it at this stage.

Ice Lake Xeon Scalable on 10nm

To finish up Intel’s announcements, Nevin also talked about Ice Lake Xeon Scalable. At Intel’s Architecture Day, a processor was shown at the event that was described as Ice Lake Xeon, so this is just Intel repeating the fact that they now have working silicon in the labs. There is still no word as to how Intel is progressing here, with question marks over the yields of the smaller dies, let alone the larger Xeon ones. Working silicon in this case is just a functional test to make sure it works – what comes now is the tuning for frequency, power, performance, and optimizing the silicon layout to get all three. I’m hoping that Intel keeps us apprised of its progress here.



What Happened at CES 2018, and why CES 2019 is Different

A memory that will stick in my mind is Intel’s CES 2018 announcements. At the heart of the show, we wanted to know about the state of Intel’s 10nm process, and details were not readily available. 10nm wasn’t mentioned in the keynote, and when I tried to ask then-CEO Brian Krzanich about it, another Intel employee hastily cut in to the conversation saying that nothing more would be said. In the end we got a single sentence from Gregory Bryant at an early morning presentation the day after the keynote, and that sentence was only after 10 minutes of saying how well Intel was executing. That single sentence was to say that Intel was shipping 10nm parts in 2017, although so far only two consumer products (in limited quantities, and region specific) have ever been seen.

This year, coming off the back of Intel’s Architecture Day last month, shows that Intel is becoming more open to discussing future products and roadmaps. A lot of us in the press and analyst community are actively encouraging this trend to continue, and the contrast between CES 2018 and CES 2019 is clear to see. Companies tend to hide or obfuscate details when product execution isn’t going to plan; now that Intel is starting to open up with details, the outlook is clearly returning to one with more optimism.

Stage 1: Consumer
Comments Locked


View All Comments

  • HStewart - Tuesday, January 8, 2019 - link

    The article you mention states "the last of which is capable of powering a 4K display." This is related to Ice Lake. it sounds like a significant improvement on current integrated graphics - but 2020 will have new platform that competes with both NVidia and AMD on discrete graphics level.
  • sorten - Monday, January 7, 2019 - link

    Ice Lake-U, SP7 + TB3, please!
  • KateH - Monday, January 7, 2019 - link

    yes, please! i had a SP2 way back when and it would have been my perfect machine if it had a quadcore CPU, more RAM, and provision for better graphics. the newer models check the CPU and memory upgrade, all it's missing now is eGPU capability.
  • darkswordsman17 - Monday, January 7, 2019 - link

    On the Foveros chip, I was thinking for gen 2 of HoloLens. It'd let them carryover the work that went into the first one with the Atom cores, give it some more performance maybe when plugged in, and the GPU would likely be a big benefit for such a device. I think rumors point to HoloLens moving to ARM though. On the forums someone suggested Courier (which I believe Microsoft has said they are going to release as a consumer product at some point). It'd be good for tablets. I think it'd be good for standalone AR/VR headsets too. I could see it being a chip for the normal Macbook, or some hybrid iOS/Mac OS device that might function like the Surface Book, with the tablet portion having an Apple SoC, and then the base having this running MacOS. Its a neat chip that could fit quite a few devices. Likely more premium stuff though, but sounds like Intel is thinking up twists on that type of chip, so maybe we'll get some reference designs.
  • HStewart - Tuesday, January 8, 2019 - link

    Yes it sounds like Foveros Chip would be good for HoloLens Gen 2 - but also it would good route for low end laptops like using Pentium chips - think about what most people use laptops for - spreadsheets and word processor - so higher performance core chip would be good for that in primary thread and atom chips for other multi-processing. I would expect OS vendors like Microsoft will possibly have changes to optimized this. I would also not doubt - AMD will likely have a similar component one day.

    Interesting comment about iOS/Mac OS - with EMiB like in my XPS 15 2in1 - they code have Core CPU in combine with Apple SoC one day for Hybrid - who knows this could already be in plans
  • ScouserPcgamer - Tuesday, January 8, 2019 - link

    “Also in the 9th Gen space, Intel said that the mobile 9th Gen Core processors will be available in Q2.”
    Right so looks like will be two Gaming laptop released this year, RTX mobile & 8th gen CPU (end jan) and then wouldn’t be surprised that Q3 has RTX mobile and 9th Gen CPU gaming laptops, think will just wait till towards end of year before getting a gaming laptop
  • SaturnusDK - Tuesday, January 8, 2019 - link

    Intel announcing products a year in advance now promising desktop 10nm by H2. Well, we all know what happened to the W3175X promised to be out in 2018 at Computex last year, announced October 8th to be available in December, yet still nowhere to be found, not even on Intels own product pages.
    I think its a good bet we'll be well into 2020 before we see any 10nm desktop parts on the shelves anywhere.
  • HStewart - Tuesday, January 8, 2019 - link

    I think Intel is fully aware of last year or so supply issues and making changes in process so that Holiday season 2019 is a real possibly - now desktop 10nm is very likely a lower priority than mobile 10nm - this article sounds like a very positive indication that 10nm is reality for 2019 in large quantities. Combine with articles about Intel investments in FAB and Sunny Cove announcement - it seams like a sure bet in 2019.
  • jjj - Tuesday, January 8, 2019 - link

    Why do you assume that Lakefield is for PC, have they stated that? Could be a number of other things since in PC, a single big core and high cost is not that appealing and super low idle is not all that relevant.
    Unless for some foldable phone/tab.
  • jjj - Tuesday, January 8, 2019 - link

    Actually a single big core is not a good choice for a phone either, if someone wants x86 there, they need 2 cores at least.
    My guess would be Tesla for infotainment but would rather see console grade gaming in cars not this crap.

Log in

Don't have an account? Sign up now