Conclusions

The art to building a good CPU is balance: you want something that is fast for individual streams of instructions and data, but also fast for multiple streams. You need something that is also power efficient, high yielding, and can be put together quite easily, with software out there already able to take advantage of what you have made.

“Opportunities multiply as they are seized.”

AMD has succeeded at a time when its competitor has struggled. As AMD launched its Zen 2 hardware across its Ryzen and EPYC product lines, built on TSMC’s 7nm, Dr. Lisa Su the CEO stated in interviews to AnandTech that:

We've executed our roadmap from the previous five years and we’re extending it into the next 5 years, all while assuming our competition will be competitive and even beating their public targets.’

At a time when Intel is struggling with its 10nm manufacturing process, AMD is targeting where Intel should have been if it had executed to time. The fact that Intel has suffered issues has benefited AMD, with its latest Ryzen and EPYC CPUs taking high praise. The follow on from these has been Threadripper, and the first two Zen 2 based Threadripper CPUs were quite good. I even used the word ‘bloodbath’ in the review, it was that impressive compared to what Intel had to offer.

Read our Initial Threadripper 3000 Series Review Here

With this third Threadripper 3000 processor, the 3990X, AMD is hoping to capitalize on its successes. The concept here is relatively simple: more of the same. Double the high-performance Zen 2 cores, at only slightly lower frequencies per core, for the same power – if a user has the right workload, then it’s the ideal processor.

And there-in likes the crux of this CPU; what is the right workload?

“Know yourself and you will win all battles”

One of the continual talking points about new CPUs is if the ecosystem is ready for them, especially with AMD pushing core counts ever higher. There’s no point having a million cores if everything is written for a few cores – not everyone runs a thousand copies of the same workload at the same time. Unfortunately this is what happened here with the 3990X. We’re in a situation where only a few software packages (that we tested) work great with the CPU, but it’s also the operating system that’s behind.

In our reviews, I prefer Windows for both comfort but also because a lot of the user base is on Windows. We typically use Windows 10 Pro, but because this CPU has 128 total threads, the regular version of Windows 10 Pro has issues – we had to move to Windows 10 Enterprise in order to see a difference. The alternative was to disable simultaneous multithreading, taking us back to one thread per core, which actually worked really well for a lot of tests, but also left some performance on the table. We suggest that 3990X users who typically have Windows 10 Pro do one of these two things: either disable SMT or use Win10 Pro for Workstations/Enterprise. This issue is down to how Windows tracks processor groups, an adage from multi-socket platforms, which shouldn’t apply here but because it’s hard coded into the OS when we have above 64 threads, it’s a pain.

Then there’s also the workload issue: we saw a number of tests, like Corona, Blender, and even NAMD, work great, which points to rendering and scientific compute benefiting from such a high core count processor. However other programs, such as 7-zip, LuxMark, Photoscan, and others did not see much (if any) of an improvement in performance compared to AMD’s own 32-core CPU.

I’ve heard a lot of silicon engineers say that adding cores helps, but adding frequency helps everything. The question then becomes whether you target workloads that can scale out (more cores) best, or whether scaling up (more frequency) is a better solution. We either end up with target CPUs for one or the other, or a combination CPU that tries to do both.

“[He] who wishes to fight must first count the cost”

In this review we evaluated two directions for AMD’s 64-core 3990X. The first was at the consumer/prosumer level, looking up to improve on their high-end desktop system. The second was at the enterprise level, looking down to see if that single 64-core CPU is actually worth it compared to a dual socket system. The conclusion might shock you. (It might not.)

For the first stage, the consumer/prosumer level, our conclusion is that the usefulness of the 3990X is limited. Aside from a few select instances (as mentioned, Corona, Blender, NAMD) the 32-core Threadripper for half the price performed on par or with margin. For this market, saving that $2000 between the 64-core and the 32-core can easily net another RTX 2080 Ti for GPU acceleration, and this would probably be the preferred option. Unless you run those specific tests (or ones like it), then go for the 32 core and spend the money elsewhere. Aside from the core count there is little to differentiate the two parts.

The second stage, the enterprise level, it becomes a no brainer to consolidate a dual socket system into a single AMD CPU – the initial outlay cost is substantially lower, and the long term power costs also come into play. This is what the enterprise likes to combine into ‘Total Cost of Ownership’, or TCO. The TCO and performance advantage of AMD here is plain to see in the benchmarks and the pricing. The situation gets a little muddier when we compare which AMD CPU to choose from: typically a server market wants RDIMM memory, which only comes from the EPYC processors. The difference between the 64-core EPYC 7702P and Threadripper 3990X is minor in terms of cost (under $500), and each CPU has its benefits: EPYC gets more PCIe lanes (128 vs 64) and more memory (8 channel RDIMM vs 4 channel UDIMM), while Threadripper gets better frequencies (2900/4300 vs 2000/3350) for a higher TDP (280W vs 200W). From a server perspective, if you need more IO or more memory, get the EPYC, otherwise Threadripper merits consideration.

“Do many calculations [to] lead to victory”

In the end, the situation for the 3990X is not as clear as it was with the 3970X. It’s a good chip, but it’s not the best chip for everything. I will tell you what it is good at though: ever seen Cinebench R20 complete in 16 seconds? Here you go:

A final thought. The AMD TR 3990X is amusingly priced at $3990. It’s a great marketing idea, and gets people talking. I’m proud to say that this price was my idea – AMD originally had it for something different. I don’t often influence change in the industry in such an obvious way, but this one was fun.

AMD 3990X Against $20k Enterprise CPUs
Comments Locked

279 Comments

View All Comments

  • ZoZo - Friday, February 7, 2020 - link

    If not for Intel, they would probably also cost at least $1000. It takes 2 for competition.
  • eva02langley - Friday, February 7, 2020 - link

    Once again software developers are late to the game. MS really needs to upper their game with their OS division because one day, they will lose that monopoly for good. If it was not for the gaming industry, Windows would probably not be where it is today.
  • extide - Friday, February 7, 2020 - link

    I mean you can clearly see that Windows supports it just fine -- you just have to go for the Workstation/Enterprise version. It's not like Windows itself is totally behind the times.
  • Kevin G - Friday, February 7, 2020 - link

    The hard work is indeed done but not configured for the more mundane version of Windows where this certainly fits into the established licensing models: this is a single socket system and NUMA is not necessary here. A simple patch would fix things here.

    Then again as this article points out, MS didn't fix that a Xeon Phi 72xx with up 288 threads would appear as a five socket system. I would imagine that such a workstation too would have benefited from applications recognizing that it could have a single NUMA node (this was configurable in hardware).
  • drothgery - Friday, February 7, 2020 - link

    And some quick googling shows Win 10 Pro Worksation is less than 10% of the cost of this CPU alone, so it's not like it'd be a big deal to anyone who actually bought one.
  • Thanny - Saturday, February 8, 2020 - link

    The Windows kernel is still badly broken when it comes to complicated NUMA scheduling. That's why the 2970WX, 2990WX, and all first-gen EPYC chips (with four dies) perform relatively badly under Windows, but quite well under Linux.

    The 64-thread limitation is quite mild compared to that problem.
  • FunBunny2 - Friday, February 7, 2020 - link

    "If it was not for the gaming industry, Windows would probably not be where it is today."

    not in corporate, it's Office.
  • Makaveli - Friday, February 7, 2020 - link

    Was just going to post this. I know everyone is all over Gaming and RGB. however that means nothing in the enterprise market.

    Microsoft get more revenue from office alone than probably the whole Xbox division and anything they get on the PC gaming side.
  • duvjones - Friday, February 7, 2020 - link

    To be fair, a chip like this is not something that Mircosoft could predict coming in the x64 space. Which is what it giving Linux (and really any POSIX system) it's advantage, this kind of power and core count use to be reserved for the academic corners of high-end computing about 15-20 years ago.... Where Windows simply doesn't apply.
    They manage now, but... Mircosoft's is only making do with a workaround. They will have to address it at some point, the question is when.
  • Whiteknight2020 - Friday, February 7, 2020 - link

    Yeah, because Windows server only supports 64 sockets and unlimited cores.....

Log in

Don't have an account? Sign up now