Cache and Memory Performance

I mentioned earlier that cache latencies are higher in order to accommodate the larger caches (8MB L2 + 8MB L3) as well as the high frequency design. We turned to our old friend cachemem to measure these latencies in clocks:

Cache/Memory Latency Comparison
  L1 L2 L3 Main Memory
AMD FX-8150 (3.6GHz) 4 21 65 195
AMD Phenom II X4 975 BE (3.6GHz) 3 15 59 182
AMD Phenom II X6 1100T (3.3GHz) 3 14 55 157
Intel Core i5 2500K (3.3GHz) 4 11 25 148

Cache latencies are up significantly across the board, which is to be expected given the increase in pipeline depth as well as cache size. But is Bulldozer able to overcome the increase through higher clocks? To find out we have to convert latency in clocks to latency in nanoseconds:

Memory Latency

We disable turbo in order to get predictable clock speeds, which lets us accurately calculate memory latency in ns. The FX-8150 at 3.6GHz has a longer trip down memory lane than its predecessor, also at 3.6GHz. The higher latency caches play a role in this as they are necessary to help drive AMD's frequency up. What happens if we turn turbo on and peg the FX-8150 at 3.9GHz? Memory latency goes down. Bulldozer still isn't able to get to main memory as quickly as Sandy Bridge, but thanks to Turbo Core it's able to do so better than the outgoing Phenom II.

L3 Cache Latency

L3 access latency is effectively a wash compared to the Phenom II thanks to the higher clock speeds enabled by Turbo Core. Latencies haven't really improved though, and Bulldozer has a long way to go before it reaches Sandy Bridge access latencies.

The Impact of Bulldozer's Pipeline Windows 7 Application Performance
Comments Locked

430 Comments

View All Comments

  • Hrel - Wednesday, October 12, 2011 - link

    yes, I use it as a term that means being cheap. My friends often call me jewish cause I hunt for bargains pretty relentlessly.
  • silverblue - Friday, October 14, 2011 - link

    I get called Scottish for the same thing. ;)
  • poohbear - Wednesday, October 12, 2011 - link

    so disappointing to read this. What on earth were they doing all this time?? AMD's NEW cpu can't even outperform its OLD CPU? well atleast i can stick with my PhenomII X6 till Ivy Bridge comes out & thank goodness i didnt buy a pricey AM3+ before reading reviews.:p So sad to see AMD has come to this.....
  • OutsideLoopComputers - Wednesday, October 12, 2011 - link

    I think when Anand publishes benchmarks with a couple of Bulldozers working together in a dual or quad-socket board (Opteron), THEN we will see why AMD designed it the way they did. If the FX achieves parity and sometimes superiority in heavily multithreaded apps vs Sandy Bridge in a single socket, then imagine how two or four of these working together will do in server applications vs Sandy Bridge Xeon. I'll bet we see superiority in most server disciplines.

    I don't think this silicon was designed to go after Intel desktop processors, but to perform directly with dual and quad socket Xeon.

    Its intended to be an Opteron right now, and as an afterthought-to be sold as an FX desktop single socket part, to bridge the gap between A-series and Opteron.
  • JohanAnandtech - Wednesday, October 12, 2011 - link

    Indeed. The market for high-end desktop parts is very small, with low margins, and shrinking! The mobile market is growing, so AMD A6 en A8 CPUs make a lot more sense.

    The server market keeps growing, and the profit margins are excellent because a large percentage of the market wants high end parts (compare that to the desktop market, where almost every one wants the midrange and budgets). the Zip and crypting benchmarks show that Bulldozer is definitely not a complete failure. We'll see :-)
  • g101 - Wednesday, October 12, 2011 - link

    Good to see an intelligent reviewer that knows how to do more than run synthetic benchmarks and games.

    It's funny seeing all the uneducated gamer "complete failure" comments.
  • bassbeast - Thursday, February 9, 2012 - link

    I'm sorry but you are wrong sir and here is why: They are marketing this chip at the CONSUMER and NOT the server, which makes it a total Faildozer.

    If they would have kept P2 for the consumer and kept BD for the Opteron then you sir would have been 100% correct, but by killing their P2 they have just admitted they are out of the desktop CPU business and for a company that small that is a seriously DUMB move. Their Athlon and P2 have been the "go to" chip for many of us system builders because it gave "good enough" performance in the apps that people use, but Faildozer is a hot pig of a chip that is worse for consumer loads in every. single. way. over the P2.

    I'm just glad i bought an X6 when i did, but when i can no longer get the P2 and Athlon II for new builds i'll be switching to intel, the BD simply is worthless for the consumer market and NEVER should have been marketed to it in the first place! so please get off your high horse and admit the truth, the BD chip should have never been sold for anything but servers.
  • haplo602 - Wednesday, October 12, 2011 - link

    This is a server CPU abused for the desktop.

    Have a look at FPU performance. Almost clock for clock (3.3G vs 3.6G) it beats 6 FPU units in Phenom X6. That's quite nice.

    Once they do some optimisations on a mature process, this will achieve SB performance levels. However until then I am going for 2389 optys ....
  • GourdFreeMan - Wednesday, October 12, 2011 - link

    You introduce the fact that AMD lengthened the pipeline transitioning to Bulldozed without explicitly mentioning the pipeline length. How many stages exactly is Bulldozer's pipeline?
  • duploxxx - Wednesday, October 12, 2011 - link

    Well there clearly seems to be something wrong with the usage of the modules in combination with the way to high latency on any cache and memory. single threaded performance is hit by that and so does lack any gaming performance.

    So I hope anandtech can have a clear look at the following thread and continue to seek further:
    http://www.xtremesystems.org/forums/showthread.php...

    secondly during OC just like previous gen, do something more with NB oc in stead of just upping the GHZ, there is more to an architecture then just the ghz....

Log in

Don't have an account? Sign up now