Final Thoughts: Do or Do Not - There is no Try

In this review we’ve covered several important topics surrounding CPUs with large numbers of cores: power, frequency, and the need to feed the beast. Running a CPU is like the inverse of a diet – you need to put all the data in to get any data out. The more pi that can be fed in, the better the utilization of what you have under the hood.

AMD and Intel take different approaches to this. We have a multi-die solution compared to a monolithic solution. We have core complexes and Infinity Fabric compared to a MoDe-X based mesh. We have unified memory access compared to non-uniform memory access. Both are going hard against frequency and both are battling against power consumption. AMD supports ECC and more PCIe lanes, while Intel provides a more complete chipset and specialist AVX-512 instructions. Both are competing in the high-end prosumer and workstation markets, promoting high-throughput multi-tasking scenarios as the key to unlocking the potential of their processors.

The Battle
  Cores/
Threads
Base/
Turbo
XFR/
TB
L3 DRAM
1DPC
PCIe TDP Cost
(8/10)
AMD TR 1950X 16/32 3.4/4.0 +200 32 MB 4x2666 60 180W $999
Intel i9-7900X 10/20 3.3/4.3 +200 13.75 4x2666 44 140W $980
Intel i7-6950X 10/20 3.0/3.5 +500 25 MB 4x2400 40 140W $1499
AMD TR 1920X 12/24 3.5/4.0 +200 32 MB 4x2666 60 180W $799
Intel i7-7820X 8/16 3.6/4.3 +200 11 MB 4x2666 28 140W $593

What most users will see on the specification sheet is this: compared to the Core i9-7900X, the AMD Ryzen Threadripper 1950X has 6 more cores, 16 more PCIe lanes, and ECC support for the same price. Compared to the upcoming sixteen core Core i9-7960X, the Threadripper 1950X still has 16 more PCIe lanes, ECC support, but is now substantially cheaper.

On the side of the 1920X, users will again see more cores, ECC support, and over double the number of PCIe lanes compared to the Core i7-7820X for $100 difference. Simply put, if there is hardware that need PCIe lanes, AMD has the solution.

In our performance benchmarks, there are multiple angles to describe the results we have collected. AMD is still behind when it comes to raw IPC, but plays competitively in frequency. Intel still wins the single threaded tasks, especially those that rely on DRAM latency. AMD pulls ahead when anything needs serious threads by a large amount, and most of the time the memory arrangement is not as much of an Achilles heel as might be portrayed. If a user has a workload that scales, AMD is bringing the cores to help it scale as wide as possible.

Despite Threadripper's design arguably being better tuned to highly threaded workstation-like workloads, the fact that it still has high clocks compared to Ryzen 7 means that gaming is going to be a big part of the equation too. In its default Creative Mode, Threadripper’s gaming performance is middling at best: very few games can use all those threads and the variable DRAM latency means that the cores are sometimes metaphorically tripping over themselves trying to talk to each other and predict when work will be done. To solve this, AMD is offering Game Mode, which cuts the number of cores and focuses memory allocations to the DRAM nearest to the core (at the expense of peak DRAM bandwidth). This has the biggest effect on minimum frame rates rather than average frame rates, and affects 1080p more than 4K, which is perhaps the opposite end of the spectrum to what a top-level enthusiast would be gaming on. In some games, Game Mode makes no difference, while in others it can open up new possibilities. We have a full article on Game Mode here.

If I were to turn around and say that Threadripper CPUs were not pure gaming CPUs, it would annoy a fair lick of the tech audience. The data is there – it’s not the best gaming CPU. But AMD would spin it like this: it allows the user to game, to stream, to watch and to process all at the same time.

You need a lot to do in order to fill 16 cores to the max, and for those that do, it’s a potential winner. For anyone that needs hardcore throughput such as transcode, decode, rendering such as Blender, Cinema 4D or ray-tracing, it’s a great CPU to have. For multi-GPUs or multi-storage aficionados or the part of the crowd that wants to cram a bunch of six PCIe 3.0 x8 FPGAs into a system, AMD has you covered.

Otherwise, as awesome as having 16 cores in a consumer processor is – and for that matter as awesome as the whole Threadripper name is in a 90s hardcore technology kind of way – Threadripper's threads are something of a mixed blessing in consumer workloads. A few well-known workloads can fully saturate the chip – video encoding being the best example – and a number of others can't meaningfully get above a few threads. Some of this has been due to the fact that for the last 8 years, the bread-and-butter of high-end consumer processors have been Intel's quad-core chips. But more than that, pesky Amdahl's Law is never too far away as core counts increase.

The wildcard factor here – and perhaps the area where AMD is treading the most new ground – is in the non-uniform allocation of the cores. NUMA has never been a consumer concern until now, so AMD gets to face the teething issues of that introduction head on. Having multiple modes is a very smart choice, especially since there's a good bit of software out there that isn't fully NUMA-aware, but can fill the CPU if NUMA is taken out of the equation and the CPU is treated as a truly monolithic device. Less enjoyable however is the fact that switching modes requires a reboot; you can have your cake and eat it too thanks to mode switching, but it's a very high friction activity. In the long-term, NUMA-aware code would negate the need for local vs distributed if the code would pin to the lowest latency memory automatically. But in lieu of that, AMD has created the next best thing, as even in an ideal world NUMA is not without its programming challenges, and consequently it's unlikely that every program in the future will pin its own memory correctly.

In that respect, a NUMA-style CPU is currently a bit of a liability in the consumer space, as it's very good for certain extreme workloads but not as well balanced as a single Ryzen. Costs aside, this means that Threadripper isn't always a meaningful performance upgrade over Ryzen. And this isn't a catch unique to AMD – for the longest time, Intel's HEDT products have required choosing between core counts and top-tier single-threaded performance – but the product calculus has become even more complex with Threadripper. There are trade-offs to scaling a CPU to so many cores, and Threadripper bears those costs. So for the consumer market its primarily aimed at, it's more important than ever to consider your planned workloads. Do you need faster Handbrake encoding or smoother gameplay? Can you throw enough cores at Threadripper to keep the beast occupied, or do you only occasionally need more than Ryzen 7's existing 8 cores?

AMD has promised that the socket will live for at least two generations, so Threadripper 2000-series when it comes along should drop straight in after a BIOS update. What makes it interesting is that with the size of the socket and the silicon configuration, AMD could easily make those two ‘dead’ silicon packages into ‘real’ silicon packages, and offer 32 cores. (Although those extra cores would always be polling at far memory speeds).

This is the Core Wars. A point goes to the first chip that calculate the Kessel run in under twelve parsecs.

Analyzing Creator Mode and Game Mode
Comments Locked

347 Comments

View All Comments

  • Vorl - Thursday, August 10, 2017 - link

    the answer to both of you is that "this is a High end PC processor, not a workstation CPU, and not a server CPU. That was clearly covered at the start of the article.

    If you want raw number crunching info, there will be other sites that are going to have those reviews, and really, maybe anandtech will review it in that light since it really is such a powerful CPU in another review for server stuff.

    Also, there is a LOT of value in having a standardized set of tests. Even if a few tests here and there are no longer valuable like PDF opening, the same tests being used across the board are important for BENCH. you can't compare products if you aren't using the same tools.

    Unfortunately AMD is ahead of the curve currently with massive SMP being given to normal consumers now at a reasonable price. It will take a little time for dev's to catch up and really make use of this amazing CPU.

    With the processing power in a CPU like this imagine the game mechanics that can be created and used, For those of us that are more interested in making this a reasonably priced workstation/server build for VMs etc, cool for us, but that isn't where this is being marketed, and it's not really fair to jump all over the reviewer for it.
  • Zstream - Thursday, August 10, 2017 - link

    Utter rubbish. This CPU is designed for a workstation build. Some a product labeled Xeon is a workstation CPU, but this isn't?
  • mapesdhs - Friday, August 11, 2017 - link

    Yeah, TR doesn't really look like something that's massively aimed at gamers, it has too many capabilities and features which gamers wouldn't be interested in.
  • pm9819 - Friday, August 18, 2017 - link

    AMD themselves call it a consumer cpu. Is Intel paying them as well
  • Lolimaster - Friday, August 11, 2017 - link

    It's a HEDT/workstation, a year ago people called Workstation a dual Xeon 8 cores, which a sole 1950X replicates.

    Intel draws a line not supporting ECC, AMD supports ECC in all their main cpu's server or not all the way back to Athlon 64.

    16cores/32threads, ECC, 64 pci-e lanes, upgrade path to 32cores/64threads with zen3. Smells Workstation to me.

    Another thing is server cpu's which EPYC is, with features tailored to it, like a massive core count with low clock speeds to maximize efficiency and damn expensive mobos without any gamerish gizmo, just think to put on building without looking at net. TR can do a bit of that too, but optimized to an all around performance and budget friendly.
  • Ian Cutress - Thursday, August 10, 2017 - link

    Dan sums it up. Some of these tests are simply check boxes - is it adequate enough.

    Some people do say that an automated suite isn't the way to do things: unfortunately without spending over two months designing this script I wouldn't have time for nearly as much data or to test nearly as many CPUs. Automation is a key aspect to testing, and I've spent a good while making sure tests like our Chromium Compile can be process consistent across systems.

    There's always scope to add more tests (my scripts are modular now), if they can be repeatable and deterministic, but also easy to understand in how they are set up. Feel free to reach out via email if you have suggestions.
  • Johan Steyn - Thursday, August 10, 2017 - link

    Ian, I understand that you see them as checkboxes, but this is not a normal CPU John doe is going to buy. It has a very specific audience and I feel you are missing that audience badly. I guy that buys this to use for rendering or 3Dstudio Max, is not going to worry about games. Yes, it would be a great bonus to also be OK at it. Other sittes even did tests of running rendering as well as play games at the same time. TR shined like a star against Intel. This is actually something that might happen in real life. A guy could begin a render and then while waiting, decide to play a game.

    I would not buy TR to open pdf's, would I?
  • Ian Cutress - Thursday, August 10, 2017 - link

    No, but you open things like IDEs and Premiere. A PDF test is a gateway test in that regard with an abnormally large input. When a workstation is not crunching hard, it's being used to navigate through programs with perhaps the web and documents in tow where the UX is going to be indicative of something like PDF opening.
  • Lolimaster - Friday, August 11, 2017 - link

    Including useless benchs not only you waste target audience time, you too having to write and upload images from that useless benchs instead of making the article more interesting.

    How about a "the destroyer for HEDT/Workstion", a typical productivy load + some gaming, out of a sudden people will get TWICE the cpu resources, they can do things they couldn't before on the same machine.

    They could get a dual socket mobo with 2x10c Xeons paying the hefty premium with pathetic clock speeds if they wante to game a bit while doing work, TR fixed that, with mass consumer type of gaming performance while reducing the multicore costs by more than half (cores counts + ECC support without paying intel tax).
  • Lolimaster - Friday, August 11, 2017 - link

    And that audience few months ago was limited to do their productivity thing with 6-8 cores or 10 paying the huge intel tax, probably they couldn't game without hurting other things and had a 2 secondary PC for killing time.

    With TR and the massive 16 core count they can finally do all of that off a single PC or focus the entire powerhorse when they need (leaving things do work during their sleep).

Log in

Don't have an account? Sign up now