The Fastest for Serial Workloads

If you asked ‘what made the best processor’ fifteen years ago, the obvious answers were performance, power and price. As time has marched on, this list has started to include integrated graphics, bandwidth, platform integration, platform upgradability, core-to-core latency, and of course, cores. Marching up from a single x86 core through to CPUs that carry 10 cores for consumers, 28 cores for enterprise and 72 cores for add-in cards makes the task of creating a perfect processor almost impossible – there is no way to satisfy all of the properties that build a processor today. Both AMD and Intel start from basic building blocks (a single core) and then configure processors around that core, adding in more cores, connectivity, and then binning to the right voltage/frequency and pricing appropriately. The end result is a stack of processors aimed at different segments of the market.

The pair of Kaby Lake-X processors cover one main area listed above more than any others: core performance. By having the latest CPU microarchitecture and placing it on the newest high-end desktop platform there is room at the top for more frequency leading to a higher pure performance product. As a byproduct these CPUs are power efficient, leading to a high performance per watt, and are situated in a platform with extensive IO options. Ultimately this is where the Kaby Lake-X customer will sit: someone who wants high single thread performance but is not after massive multi-core performance. This would typically cover the majority of gamers and enthusiasts, but not necessarily content creators.

The benefits in the benchmarks are clear against the nearest competition: these are the fastest CPUs to open a complex PDF, at the top for office work, and at the top for most web interactions by a noticeable amount.

The downsides are pure throughput workflows, such as neuron simulation, rendering and non-video encoding.

The parts in the middle are the ones to dissect, and these get interesting. Let me pull up a few graphs that illustrate this middle of the road position: Chromium Compilation, Agisoft Photoscan and WinRAR.

Office: Chromium Compile (v56)

System: Agisoft Photoscan 1.0 Total Time

Encoding: WinRAR 5.40

These three results show the Core i7-7740X performing above any AMD chips of similar price, but the Core i5-7640X performing below any Ryzen 7 or Ryzen 5 parts. This comes down to the workload in each of these benchmarks, and how the processor configurations affect that. All three of these real-world benchmarks are variable-thread workloads. Some elements are serialized and rely on a high single-thread performance, while other elements are fully parallelizable and can take advantage of cores and threads (sometimes threads do not necessarily help). The benchmarks are ultimately limited by Amdahl’s Law, where single thread speed affects the whole test, but multiple-threads only helps the parallelizable parts. With sufficiently parallelizable code, it becomes a balance between the two.

So for the Core i7-7740X, up against the Ryzen 7 1700 at an equivalent price, the Core i7 has eight threads and the Ryzen 7 has sixteen, but the Core i7 has a much higher single thread performance. So for these benchmarks, having a high performance metric like this means that despite having half the cores/threads of the AMD part, the Core i7 can take the lead very easily.

But the Core i5-7640X has a different task. It has four cores, like the Core i7, but no hyperthreading, so it sits at four threads. Its direct competitor, the Ryzen 5 1600X, has six cores with simultaneous multithreading, leading to twelve threads. This gives the AMD processor a 3:1 advantage in threads, and for each of these three benchmarks it can parallelize the code sufficiently that the single thread performance of the Intel CPU is not enough. Moving from a 2:1 ratio with the Core i7 to a 3:1 ratio with the Core i5 is a turning point for ST performance compared to MT performance.

So with the X299 confusion, are these CPUs worth recommending?

When Kaby Lake-X first came out, a number of technology experts were confused at Intel’s plans. It made sense to launch the latest microarchitecture on the high-end desktop platform, although launching it in a quad-core form was an idea out-of-left-field, especially for a platform that is geared towards multiple cores, more memory, and more memory bandwidth. In that paradigm, the Kaby Lake-X is an oddball processor design choice.

There are bigger factors at play however – if Intel launched 6-10 core parts on KBL, it would cannibalize their Skylake-X and Skylake-SP sales. Also, as we’ve seen with Skylake-X CPUs, those enterprise cores are now different to the consumer Skylake-S cores, with different cache structures and AVX-512. So if Intel had launched >4 cores on KBL-X, they would have likely had to scrap Skylake-X.

But that’s a slight tangent.

The Core i7-7740X appeals to users who want the fastest out-of-the-box single thread x86 processor on the market today. This means financial traders, gamers, and professionals working with serial code bases, or anyone with deep pockets that might think about upgrading to Skylake-X in the future. Enthusiast overclockers are likely to find the better binned CPUs fun as well.

That’s if you do not mind paying a premium for the X299 platform. For users who mind the cost, the Core i7-7700K is 98% of the way there on performance but can save a hundred dollars on the motherboard and offers the same functionality. In some of our benchmarks, where despite the high single thread performance having more cores helped, then spending a little more on the Skylake-X six-core Core i7-7800X is beneficial: for example, Luxmark and POV-Ray scored +33% for the 7800X over the 7740X.

The Core i7-7740X makes certain sense for a number of niche scenarios. By contrast, the Core i5-7640X doesn’t make much sense at all. There’s still the benefit of high single-thread performance and some good gaming performance in older titles, but in the variable threaded workloads it loses to AMD’s processors, sometimes by as much as 45%.  For a chip that comes in at $242, users should expect to pay about the same on a motherboard – whereas either an AMD part or the Core i5-7600K can go in a $120 motherboard and still be overclocked.

There are only two scenarios I can see where the Core i5 adds up. Firstly, users who just want to get onto X299 now and upgrade to a bigger CPU for quad-channel memory and more PCIe lanes later. The second is for professionals that know that their code cannot take advantage of hyperthreading and are happy with the performance. Perhaps in light of a hyperthreading bug (which is severely limited to minor niche edge cases), Intel felt a non-HT version was required.

In our recent CPU Buyers’ Guide (link autoupdates to the latest CPU guide) we suggested the Core i7-7740X for anyone wanting a Peak VR experience, and we still stand by that statement. It has enough threads and the biggest grunt to take on VR and the majority of enthusiast gaming experiences, if a user has pockets big enough.

The recommendations of the new CPUs boil down to platform costs. They seem a minor upgrade to the Kaby Lake-K processors and the Z270 platform, which is a platform that caters to a big audience with a more cost-sensitive structure for motherboards in mind. 

Power Consumption and Overclocking to 5.0 GHz
Comments Locked

176 Comments

View All Comments

  • mapesdhs - Monday, July 24, 2017 - link

    2700K, +1.5GHz every time.
  • shabby - Monday, July 24, 2017 - link

    So much for upgrading from a kbl-x to skl-x when the motherboard could fry the cpu, nice going intel.
  • Nashiii - Monday, July 24, 2017 - link

    Nice article Ian. What I will say is I am a little confused around this comment:

    "Intel wins for the IO and chipset, offering 24 PCIe 3.0 lanes for USB 3.1/SATA/Ethernet/storage, while AMD is limited on that front, having 8 PCIe 2.0 from the chipset."

    You forgot to mention the AMD total PCI-E IO. It has 24 PCI-E 3.0 lanes with 4xPCI-e 3.0 going to the chipset which can be set to 8x PCI-E 2.0 if 5Gbps is enough per lane, i.e in the case of USB3.0.

    I have read that Kabylake-X only has 16 PCI-E 3.0 lanes native. Not sure about PCH support though...
  • KAlmquist - Monday, July 24, 2017 - link

    With Kabylake-X, the only I/O that doesn't go through the chipset is the 16 PCI-E 3.0 lanes you mention. With Ryzen, in addition to what is provided by the chipset, the CPU provides

    1) Four USB 3.1 connections
    2) Two SATA connections
    3) 18 PCI-E 3.0 lanes, or 20 lanes if you don't use the SATA connections

    So if you just look at the CPU, Ryzen has more connectivity than Kabylake-X, but the X299 chip set used with Kabylake-X is much more capable (and expensive) than anything in the AMD lineup. Also, the X299 doesn't provide any USB 3.1 ports (or more precisely, 10 gb per second speed ports), so those are typically provided by a separate chip, adding to the cost of X299 motherboards.
  • Allan_Hundeboll - Monday, July 24, 2017 - link

    Interesting review with great benchmarks. (I don't understand why so many reviews only report average frames pr. second)
    The ryzen r5 1600 seems to offer great value for money, but i'm a bit puzzled why the slowest clocked R5 beats the higher clocked R7 in a lot of the 99% benchmarks, Im guessing its because the latency delta when moving data from one core to another penalize the higher core count R7 more?
  • BenSkywalker - Monday, July 24, 2017 - link

    The gaming benchmarks are, uhm..... pretty useless.

    Third tier graphics cards as a starting point, why bother?

    Seems like an awful lot of wasted time. As a note you may want to consider- when testing a new graphics card you get the fastest CPU you can so we can see what the card is capable of, when testing a new CPU you get the fastest GPU you can so we can see what the CPU is capable of. The way the benches are constructed, pretty useless for those of us that want to know gaming performance.
  • Tetsuo1221 - Monday, July 24, 2017 - link

    Benchmarking at 1080p... enough said.. Completely and utterly redundant
  • Qasar - Tuesday, July 25, 2017 - link

    why is benchmarking @ 1080p Completely and utterly redundant ?????
  • meacupla - Tuesday, July 25, 2017 - link

    I don't know that guy's particulars, but, to me, using X299 to game at 1080p seems like a waste.
    If I was going to throw down that kind of money, I would want to game at 1440p or 4K
  • silverblue - Tuesday, July 25, 2017 - link

    Yes, but 1080p shifts the bottleneck towards the CPU.

Log in

Don't have an account? Sign up now