In this review, we took the newest member of AMD’s desktop processor line, the Athlon X4 845, and pitted it against similar comparison points dating back to the first Bulldozer based desktop processors for the mainstream segment. This new processor uses AMD’s latest microarchitecture, Excavator, to create Carrizo based cores. The Athlon X4 845 uses two Carrizo modules for four total threads, wrapped into a 65W thermal design power window, and would appear to be the only Carrizo based processor AMD is going to release for the FM2+ socket.

The Athlon X4 845 is actually a dressed up laptop processor, modified for the desktop platform. As a result we only get 2 MB of L2 cache rather than the 4 MB for all of our comparison points, but also there is only eight PCIe 3.0 lanes rather than sixteen, which can also have some knock on effects. In this review we wanted to do a direct performance comparison, clock for clock, between the new processor and the older processors. However, some of the design decisions made above the core logic have had an impact in results.

Know Your Comparison

Typically in a review like this, we talk about IPC or ‘instructions per clock’. This is a measure of how efficient the processor is at processing instructions – either a fixed set of instructions in a quicker time or more instructions in a fixed time. There are two main components to the core design that play major roles: the front/back end that actually performs the calculations, and the memory sub-system that provides the data for calcualtions. In order to get the peak IPC for a given test, both of these components need to be running near their limit or be able to compensate if waiting for the other. However, this is often test dependent – some probe the logic more than the memory, and for others the reverse is true. It depends on what you are testing.

In most circumstances, generational processor updates have similar or improved memory sub-system arrangements which makes most comparisons in IPC directly related to the logic in the core. When we compare the Excavator design to Steamroller or Piledriver however, the memory sub-system has changed for better and for worse in our benchmark suite. This makes comparisons between the two sets of core logic difficult, as the memory plays a significant part in the performance. This is wholly benchmark dependent as well. A number of professional benchmark tests are designed specifically to either test one or other of the two segments, so it becomes really important to consider what each benchmark is doing in every case. When doing a good analysis, we can determine if the core-logic has improved (either the processing latency, scheduler, prefetch or other), or if the memory subsystem is the main catalyst for improvements.

That being said, users cannot buy one set of core logic with a different memory sub-system. They come in complete packages, and as a result the full top-down result might only of interest for users wanting to buy today. This requires both the core and the memory to work together to give better performance, so it can be striking if decisions are made to affect that. It also pains both the reviewer and the user if in fact something like the memory sub-system comes in different flavors, depending on how much is spent or if the manufacturer is just trying to sell excess parts.

The March on IPC

Nonetheless, time for the conclusions to this review. Here are the main processors we tested:

AMD CPUs
  µArch /
Core
Cores Base
Turbo
TDP DDR3 L1 (I)
Cache
L1 (D)
Cache
L2
Cache
Athlon
X4 845
Excavator
Carrizo
4 3500
3800
65 W 2133 192KB
3-way
128KB
8-way
2 MB
16-way
 
Athlon
X4 860K
Steamroller
Kaveri
4 3700
4000
95 W 1866 192KB
3-way
64KB
4-way
4 MB
16-way
 
Athlon
X4 760K
Piledriver.v2
Richland
4 3800
4100
100 W 1866 128KB
2-way
64KB
4-way
4 MB
16-way
 
Athlon
X4 750K
Piledriver
Trinity
4 3400
4000
100 W 1866 128KB
2-way
64KB
4-way
4 MB
16-way

The main points of comparison are the caches: the AMD Athlon X4 845 has a double-size L1 data cache with an improved prefetch, but a half-size L2 cache, compared to the Kaveri based X4 860K. It is worth noting that we were not able to source 65W parts to match the X4 845, however one of the most poignant results out of the testing is our IPC performance analysis table after the 3 GHz testing. We set all the processors to 3 GHz, with maximum official supported memory for each, and it went a bit like this:

AMD Average IPC Increases
Benchmark Suite Richland over Trinity Kaveri over Richland Carrizo over Kaveri
Real World 0.8% 8.0% 8.8%
Office -0.1% 11.1% 4.1%
Legacy 0.1% 11.8% 8.5%
Overall
Windows
0.3% 10.3% 7.3%
 
Linux 10.4% 10.5% -12.1%
Gaming -0.4% 12.5% -5.8%

The AMD Athlon X4 845 is a Janus-like product: powerful, yet two-faced. In practically all of our Windows based CPU benchmarks, it scored increases over the previous generation in most part due to the larger L1 data cache but also the improved logic.

The benchmarks that required more memory, such as Agisoft or WinRAR, saw minor decreases, which could be predicted before we started.

However, two major segments saw significant decreases in performance. For our Linux tests, most of these were highly memory sensitive. NPD and NAMD are both scientific matrix solvers, requiring lots of memory accesses, and Redis is a key-value load store known to be highly cache size and latency sensitive – this bought these results down.

The gaming side of the equation is a different story, and the results were fairly consistent across all benchmarks and all GPUs: the X4 845 performs worse than the X4 860K clock for clock. There are two ways to attribute this, as mentioned above: PCIe 3.0 x8 and 2MB of L2 cache. Given previous experience with PCIe lane bandwidth requirements resulting in only a tiny difference in performance, it would seem that the latter has more of an effect on gaming (at this level of CPU power) than one might expect. It means a 6% decrease in performance when clock speeds are identical compared to Kaveri, but still ends up 5% over Trinity and Richland.

Wanting The Full Package

The AMD Athlon X4 845, as mentioned earlier in the review, is outside the regular efficiency range for the Carrizo core design. It was designed to be operated at 15W for the total chip, or 35W for the high power mode. AMD even noted in their slides that at 35W, the Carrizo and Kaveri designs would be similar for efficiency. So to push it to 65W would suggest that Kaveri might even be ahead, given the wider window that Kaveri was designed for. The result of pushing Carrizo to 65W means that there is no integrated graphics, and the frequencies are near but below the competing Kaveri parts, and overclocking is next to zero. What Carrizo relies on is the microarchitectural advances more than anything else.

Our new Athlon, at $70 launch price, competes mainly against the Intel Pentium G3258, known as the overclockable Haswell-based dual core Pentium that was launched for $72. Depending on the retailer, the time of day, how the wind is blowing, or what sale is on, these prices can be as low as $50, along with other Athlon and Pentium processors. The typical price/performance metric becomes more focused on just the performance in this case, and the battle between the two trades blows.

In single threaded environments, the G3258 wins out hands down, by having a 25-50% performance advantage despite having lower clock speeds.

Cinebench R15 - Single Threaded

However, due to having four threads rather than two, the X4 845 wins in any fully multithreaded test, particularly for heavy workloads such as video encoding. The G3258 lacks accelerated AES encryption as well, meaning the X4 845 gets a result 800% higher in that case.

Hybrid x265, 4K Video

Where the waters are muddied is in variable threaded workloads, or memory dependent workloads. The Pentium has larger and quicker caches, meaning that it can take the lead in some multithreaded workloads. But taking into account some benchmarks, like Google Octane, the difference is minimal:

Google Octane v2

When it comes to gaming, it depends on which benchmark/configuration you choose, but for GTA and GRID, when the Athlon is paired up with an AMD graphics card, the Athlon wins, but with an NVIDIA graphics card, the Pentium wins. For Shadow of Mordor and Alien Isolation however, the higher IPC for the Intel processor wins out no matter which GPU is used.

Carrizo, 7th Generation and the Future

When we benchmarked a number of laptops using Carrizo processors, and compared them to a Kaveri laptop, we could instantly tell that the Carrizo microarchitecture was a sufficient jump in the mobile space for performance and power, as long as OEMs would actually use dual channel memory. This was bolstered by the fact that any graphics tests relied on the integrated GPU, which saw enhancements with the new design as well. On the desktop side of the equation, the results are less clear cut. Here we have a microarchitecture with good performance characteristics for compute, but it gets let down in discrete gaming. Moreover, the competition provided by the Pentium G3258 is hard to ignore. The fact that the two processors, at stock, performed similarly for web use is an interesting element in our testing for sure.

AMD’s future will be with Bristol Ridge, using an updated Excavator microarchitecture, and the new line of high-end processors using Zen cores. Both of these are slated for the tail end of the year and/or Q1, anything from 4-8 months ahead. Is it really worth investing in a Carrizo (or Pentium) platform now only to find it has been passed later in the year? While it’s an interesting question, in my opinion it’s probably the wrong question to ask.

Bristol Ridge, using the updated Excavator core, is likely to perform similarly (within single digit %) of Carrizo in raw performance, but it will also have DDR4 and new chipsets to help deal with things like PCIe SSDs, NVMe, upgraded Ethernet and new features (features unknown at this point). For some users, especially building simple machines that just need base storage and some oomph, that will not matter much. If you are a user that slowly upgrades over time (by buying one big upgrade every now and again rather than a full system replacement), then going in for Carrizo (or Kaveri) now should be par for the course. The interesting element is whether to go for Kaveri (X4 880K) or Carrizo (X4 845), especially if the difference is only $20.

Carrizo comes with AMD’s new 95W near-silent cooler, whereas the X4 880K uses the new 125W solution. If the difference is only $20, pitch for the faster Kaveri every time. What you lose in microarchitecture will be made up by frequency and overclocking ability.

If you want to make that jump from Athlon to Zen, from mid-range to AMD’s high-end, then it might be worth investing a summer to earning more for a future system. Even if Zen doesn’t pan out completely (most users have their fingers crossed that Intel will have some competition at last), a bigger system with more storage or a better graphics card is never a bad thing.

Ultimately, the X4 845’s main let down, for gaming at least, would seem to be that 2 MB of L2 cache, and the base processor design aiming at 15W. Bristol Ridge is also aimed around 15W, and should come in 65W flavors (with integrated graphics), and it will be interesting to see what level of cache it has compared to the mobile counterparts.

AMD CPU L2 Cache Levels
Core uArch Cores L2 Cache Mobile L2 Cache Desktop
Excavator v2 Bristol Ridge 4 2x1 MB, 16-way ...?
Excavator Carrizo 4 2x1 MB, 16-way 2x1 MB, 16-way
Steamroller Kaveri 4 2x2 MB, 16-way 2x2 MB, 16-way
Piledriver v2 Richland 4 2x2 MB, 16-way 2x2 MB, 16-way
Piledriver Trinity 4 2x2 MB, 16-way 2x2 MB, 16-way
AMD's Desktop Future: AM4, Bristol Ridge and Summit Ridge
Comments Locked

131 Comments

View All Comments

  • The_Countess - Tuesday, July 19, 2016 - link

    actually bulldozer on 14nm would have been a completely different beast. it would have allowed AMD to use far more transistors per core while still making it way smaller in terms of size. that would have allowed AMD to create a far wider execution core, eliminating most of its bottlenecks.

    the high latency cache would probably still means it wouldn't be great for games but for everything else it would be a far more competitive design.

    it is also 14nm that will allow zen to make such a massive leap in IPC's as it will be a very wide Core, while still being pretty small, something that just can't be done on 28nm.

    bulldozer might not have been the best idea, but being stuck on 32/28nm for so long made all it's issues infinitely worse.
  • nandnandnand - Thursday, July 14, 2016 - link

    "Well better late than never for Andantech,"

    There was no point in Adanantech writing this review, because it is a chip for those people too stupid to wait until Zen. Zen is the only thing that matters.
  • BurntMyBacon - Friday, July 15, 2016 - link

    @nandnandnand: "There was no point in Adanantech writing this review, because it is a chip for those people too stupid to wait until Zen. Zen is the only thing that matters."

    Now, because this review exists, people as yet uninformed have concrete data to avoid decisions that might make them look (as you put it) stupid. There is very much a point.
  • Byte - Thursday, July 14, 2016 - link

    Zen will probably be the RX480 in the CPU world. Better performance, still trounced by the competition, but competently priced.
  • looncraz - Friday, July 15, 2016 - link

    That would be an improvement on the current situation. AMD is pricing their CPUs quite poorly right now.

    An Intel Celeron G3900 is $50 right now. AMD's closest competition is the A6-7400k - at $55.

    Both are dual cores, both are 65W, both have middling (but usable) graphics performance... quite similar at first glance... except the Intel runs at 2.8Ghz and the AMD runs at 3.5Ghz w/ 3.9Ghz turbo and can rather easily exceed 4Ghz when overclocked.

    Sounds like AMD should be taking home the gold on that one, until you find that the Celeron is nearly 25% faster in single threaded programs and is ~40% faster in multi-threaded programs... Bad deal going for the AMD... especially since the same board that hosts the Celeron can accept much faster CPUs and the AMD board simply doesn't have notably more powerful options available - you can upgrade to a quad core, but you won't be getting better single threaded performance no matter how hard you try. You might break even around 5Ghz, if you can manage it...

    AMD has a 40% clock-speed advantage out the gate, but loses by a large margin.
  • bananaforscale - Friday, July 15, 2016 - link

    You know what's funny? The fact that if I want to get a CPU that's faster than the FX-6100 I bought almost 5 years ago I still have to pay more than what I paid for it. Sure, Intel gives better single thread performance but I'd get fewer cores and no overclockability. Then there's the fact that I've been running that original Bulldozer with a 20% OC and it seems more stable than at stock clocks.

    Comparing single data points tells nobody a thing. Anyway, isn't that A6 in your comparison unlocked? :P
  • wumpus - Friday, July 15, 2016 - link

    I'm sure you missed an FX-8320 sale, or you really nailed the low point. Unfortunately Intel can match AMD's performance at nearly the same price, and is cutting off AMD's air supply that way.
  • artk2219 - Monday, July 18, 2016 - link

    Whats crazy is that Microcenter sells the FX 8320E's for $89.99. They also have a motherboard bundle option that you can get for $125 to $170 depending on which board you choose. Theoretically you can get a processor, motherboard, cooler, and memory for the price of a non-K core I5, or just a motherboard and processor for the price of an I3. The unfortunate thing is that not everyone has a microcenter near them, but for the ones that do you can get quite the deal, especially since those 8320E's will easily OC to FX 8350 levels, and more likely 4.2 to 4.6 from a stock clock of 3.2
  • BlueBlazer - Friday, July 15, 2016 - link

    From the leaks plus AMD's vague announcements, all points to AMD's Zen is going to be quite late (right into 2017). Why put use 28nm "placeholder" for AM4 if Zen is due soon? Also Global Foundries only has 14nm LPP which is a low power process. That may mean the frequency is going to be low (just look at the chips made on 14nm LPP like Qualcomm's Snapdragon 820, or even AMD's latest Radeon RX480). Reference http://semiengineering.com/high-performance-and-lo... quote "The “LP” processes are optimized for low power and feature design rules targeted for the lowest leakage, support lower operative voltages, and tend to have the slowest transistors of the three options".
  • wiboonsin - Sunday, November 12, 2017 - link

    I have read your article, it is very informative and helpful for me.I admire the valuable information you offer in your articles. Thanks for posting http://www.fanaticrunningwear.com/

Log in

Don't have an account? Sign up now