Real World Performance at 3 GHz

For our generational testing, we took each of the four main processors in this test and adjusted their CPU frequencies in the BIOS to 3 GHz. This was achieved through a 30x multiplier and 100 MHz base frequency, which for each processor is a reduction from the stock speeds. We set each CPU to perform at 3 GHz only to fix the frequency, and ran the memory in each case at the maximum supported frequency by the processor. Some benchmarks in the generational tests will probe the memory, and an upgrade in the memory controller to support higher frequencies (officially) than an older processor is, a generational upgrade, as important as the core or cache performance.

AMD CPUs
  µArch /
Core
Cores Base
Turbo
TDP DDR3 L1 (I)
Cache
L1 (D)
Cache
L2
Cache
Athlon
X4 845
Excavator
Carrizo
4 3500
3800
65 W 2133 192KB
3-way
128KB
8-way
2 MB
16-way
 
Athlon
X4 860K
Steamroller
Kaveri
4 3700
4000
95 W 1866 192KB
3-way
64KB
4-way
4 MB
16-way
 
Athlon
X4 760K
Piledriver.v2
Richland
4 3800
4100
100 W 1866 128KB
2-way
64KB
4-way
4 MB
16-way
 
Athlon
X4 750K
Piledriver
Trinity
4 3400
4000
100 W 1866 128KB
2-way
64KB
4-way
4 MB
16-way

Speaking of cache, as mentioned at the beginning of this review, the Athlon X4 845 has a significant advantage in the L1 cache layout, affording a 2x size L1 data cache along with a move from 4-way to 8-way associativity. Each of these methods, as a broad rule of thumb, typically decreases the cache miss rate by a factor of 1.414 (square root of 2x). Combined should see a factor two decrease in cache misses overall, and this will affect a number of benchmarks when we compare each processor at a fixed frequency. On the other side of the equation, the L2 cache for the X4 845 is half that of the X4 860K, meaning that if the data is not in the L1, it is less likely to be in the L2, which will add additional latency.

Dolphin Benchmark: link

Many emulators are often bound by single thread CPU performance, and general reports tended to suggest that Haswell provided a significant boost to emulator performance. This benchmark runs a Wii program that raytraces a complex 3D scene inside the Dolphin Wii emulator. Performance on this benchmark is a good proxy of the speed of Dolphin CPU emulation, which is an intensive single core task using most aspects of a CPU. Results are given in minutes, where the Wii itself scores 17.53 minutes.

Dolphin Emulation Benchmark

Emulation takes cues from a high IPC and base frequency, however for our generational testing it is all about the microarchitecture. The Carrizo has a 9% advantage here over the Kaveri.

WinRAR 5.0.1: link

Our WinRAR test from 2013 is updated to the latest version of WinRAR at the start of 2014. We compress a set of 2867 files across 320 folders totaling 1.52 GB in size – 95% of these files are small typical website files, and the rest (90% of the size) are small 30 second 720p videos.

WinRAR 5.01, 2867 files, 1.52 GB

WinRAR enjoys memory bandwidth with its variable workload, and seemingly the Kaveri has a strong showing here. The Carrizo only has 2MB of L2 cache, which most likely puts it at a disadvantage.

3D Particle Movement v2

The second version of this benchmark is similar to the first, however it has been re-written in VS2012 with one major difference: the code has been written to address the issue of false sharing. If data required by multiple threads, say four, is in the same cache line, the software cannot read the cache line once and split the data to each thread - instead it will read four times in a serial fashion. The new software splits the data to new cache lines so reads can be parallelized and stalls minimized. As v2 is fairly new, we are still gathering data and results are currently limited.

3D Particle Movement v2.0 beta-1

We saw this in our laptop Carrizo testing: if we adjust the software to avoid false sharing (which decreases performance), the Excavator microarchitecture pulls a significant lead in 3DPMv2. Part of this is most likely down to the larger L1 data cache as well.

Web Benchmarks

On the lower end processors, general usability is a big factor of experience, especially as we move into the HTML5 era of web browsing. 

WebXPRT 2013

WebXPRT

This benchmark can be memory intensive, as it draws various graphs and applies filters to pictures, among other things. The lower L2 cache hurts here.

Google Octane v2

Google Octane v2

In contrast, Octane attempts to stay as close to the execution ports as possible, and the Carrizo cores take an 18% lead over Kaveri.

Benchmark Overview Performance at 3 GHz: Office
Comments Locked

131 Comments

View All Comments

  • artk2219 - Thursday, July 14, 2016 - link

    They had too many parts that weren't hitting their mobile TDP's, or they just bakes too many chips than was needed on the mobile side. Either way, why let them sit in a warehouse or toss them at a loss, when for a very smalla mount you can just throw them into your standard desktop package and make some extra sales.
  • TheinsanegamerN - Thursday, July 14, 2016 - link

    Carrizo and kaveri did not use hypertransport. They would have to re-engineer their chip to work on AM3+, and to be frank, the AM3+ market is just too small to justify the tiny margins they would get.

    That money is better spent on getting zen out of the door.
  • neblogai - Thursday, July 14, 2016 - link

    Why invest into upgrading bad product, when you can sell the same Bulldozer cores till Zen comes? And this Carriso Athlon is just a by-product of a mobile part and can only be sold for desktop. It all makes sense financially. By the way, new Bristol Ridge AMD 15W APUs are really nice and competitive, but laptop manufacturers are failing again- for example, HP Envy x360 comes with FX-9800P APU- again in single channel memory memory configuration, also with HDD installed and without possibility to use SSD. https://hardforum.com/threads/unboxing-1st-impress...
  • TheinsanegamerN - Friday, July 15, 2016 - link

    AMD doesnt take the mobile market seriously. If they did, they would be partnering up with the likes of MSI or clevo to produce a good laptop line for their APUs, or at the very least make dual channel a strict requirement.
  • The_Countess - Tuesday, July 19, 2016 - link

    AMD unfortunately can't demand much of anything from OEM's currently.

    and as intel still has a defacto monopoly no OEM wants to piss of intel by making a better AMD laptop.
  • nathanddrews - Thursday, July 14, 2016 - link

    So... will there ever be a desktop Carrizo w/IGP? Much of the hype around Carrizo was focused on its very low power video playback, including H.265 hardware encode/decode.
  • stardude82 - Thursday, July 14, 2016 - link

    Isn't that what Bristol Ridge is? But on the new AM3 socket.
  • Arnulf - Thursday, July 14, 2016 - link

    AM4.
  • Pissedoffyouth - Thursday, July 14, 2016 - link

    Why not bang 8 of these cores into a 125w TDP and make it for FM2+ or AM3+? Finally an upgrade for Piledriver on AM3
  • KAlmquist - Friday, July 15, 2016 - link

    If you compare the Athlon 845 with the FX-4350 (link below), the Athlon wins on some benchmarks and loses on others. The Athlon has better IPC, but the FX has a faster clock and a 3rd level cache, leaving no clear-cut winner. If we added an L3 cache to the Athlon chip, that would speed it up, but not by a lot. In other words, Excavator is a big improvement over Piledriver in terms of performance per watt, but not much in terms of absolute performance. An Excavator based FX chip (by which I mean a chip with 8 Excavator cores and 8 MB of L3 cache) would probably be a very marginal improvement over the existing FX lineup at stock frequency, and would have less overclocking potential. I can see why AMD decided not to spend the resources to develop such a chip.

    http://www.anandtech.com/bench/product/1684?vs=127...

Log in

Don't have an account? Sign up now