To say that AMD has been uncharacteristically quiet lately would be an understatement of epic proportions. The company that had been so vocal about their K8 architecture in the past will hardly say anything at all about future products, extending even to its forthcoming AM2 platform. In just two months AMD is scheduled to officially unveil its first DDR2 platform (Socket-AM2), but we've heard virtually nothing about performance expectations.

Back in January we sought to discover for ourselves what AMD's Socket-AM2 platform would have in store for end users. You'll remember that when Intel made the shift to DDR2 it basically yielded no tangible performance improvement, and we were all quite afraid that the same would be true of AM2. When we finally tested the AM2 samples that were available at the time, performance was absolutely dismal. Not only could AMD's AM2 not outperform currently shipping Socket-939 platforms, but due to serious issues with the chip's memory controller performance was significantly lower.

Given that AMD was supposed to launch in June at Computex, the fact that AM2 was performing so poorly just five months before launch was cause for worry. Despite our worries, we elected not to publish benchmark results and to give AMD more time to fix the problems. We're not interested in creating mass panic by testing a product that's clearly premature.

In February we tried once more, this time with a new spin of the AM2 silicon, but performance continued to be lower than Socket-939. Luckily for AMD, the performance had improved significantly, so it was slower than Socket-939 but not as much as before.

The next revision of the AM2 silicon we received sometime in March, and this one finally added support for DDR2-800, which is what AM2 will launch with supposedly at Computex. With the launch only three months out, we expected performance to be at final shipping levels, and we were left disappointed once more. Even with DDR2-800 at the best timings we could manage back then, Socket-AM2 was unable to outperform Socket-939 at DDR-400.

That brings us to today; we're now in the month of April, with less than two months before AMD's official unveiling of its Socket-AM2 platform at Computex in June, and yes we have a brand new spin of AM2 silicon here to test. We should note that it's not all AMD that's been holding AM2 performance behind. The motherboard makers have of course gone through their fair share of board revisions, not to mention the various chipset revisions that have changed performance as well. Regardless, according to internal AMD documents, AM2 CPUs are going to start being sold to distributors starting next month, leaving very little time for significant changes to the CPU to impact performance. We feel that now is as good of a time to preview AM2 performance and put things into perspective as we're likely to get before the official launch.

What's AM2?
Comments Locked

107 Comments

View All Comments

  • mino - Tuesday, April 11, 2006 - link

    1) 3-cycle L1 on K7/K8 is the fastest required, it goes from the internal structure if the scheduler and the pipeline that 2-cycle chache would do almost no good. Also they would have to reduce L1 size to 32k+32k which would hurt. It simply does not make sense to change L1 at all, maybe on K8L but IMHO 128k+128k would help much more than 2-cycle latency.

    2) 17-cycle L2 is PRETTY GOOD for 1M L2 with exclusive structure!!! IMHO it is possible to do 16-cycle, maybe 15, but nowhere near Dothan's 10-cycle. Also remember lower-latency L2 has scaling problems (that's why intel made prescott's L2 slower than NW's)

    3) Concerning the memory subsystem(caches + memory) (on single-socket K8/K8L) the biggest issue is the robustness(amount of on the fly acceses to memory) and latency of the memory controller. To solve this is not trivial thing. IMHO to add 2-4M L3 with random access ~50 cycles would do.

    4) In the >4 sockets front all they need is effective caching of MOESI snoops.

    You are also forgot K7/K8 is mostly KISS architecture. It is just wery well balanced so has good performance in the end. However do one wrong change and you are screwed.
    KISS == Keep It Simple Silly

    About "weak" SIMD implementation on AMD, don't fool yourselves guys. Only x86 architecture faster than K8 on SSE/SSE2 is Netburst aka SIMD-by-intel.

    About conroe, ita has twice as wide ALU's and FPU's than PIII/K7/K8, this means it has huge resources at disposal to calculate SIMD.
    Same goes for K8L 2 quarters later. That said K7/K8 core has far more FP power than P6 architecture. On FP Conroe and K8 are about aquall.
    but K8L will wipe the floor with K8 and Conroe on FP. Conroe will wipe K8 on INT and be still faster than K8L by decent margin.

    Overall we are for another PIII vs. K7 battle with single very important change - AMD has a platform it had not back in the K7 vs. PIII days.
  • fitten - Thursday, April 13, 2006 - link

    I find the K8L a somewhat odd strategy. I guess they are targeting the Itanium market because Opterons already have a good part of the HPC market. Given that the HPC people are the ones that really care about FPU performance and that they are still a fairly small market segment, it seems an odd target. Integer performance rules the roost for servers... web, database, and just about everything else you can think of other than number crunching simulations and the like. Desktop uses for FPU are a few like games and some mathmatical stuff. Intel is focusing on integer performance at least as much as FPU with Conroe (Conroe gets a good dose of both), which makes sense to me since so much of the work done on computers, both desktops and servers, is dominated by integer operations. K8L speculation says only FPU horsepower will be added... just doesn't seem like a sound decision to me.
  • Zoomer - Monday, April 10, 2006 - link

    Hey anand, could you take out 1 of the two modules and do a quick test on that?

    With doubled (in theory) bandwidth with ddr2, wouldn't the dual channel mem controller be even more redundant? Perhaps we'll see a new 754-ish socket? :)
  • Zoomer - Monday, April 10, 2006 - link

    Hey anand, could you take out 1 of the two modules and do a quick test on that?

    With doubled (in theory) bandwidth with ddr2, wouldn't the dual channel mem controller be even more redundant? Perhaps we'll see a new 754-ish socket? :)
  • Furen - Monday, April 10, 2006 - link

    I dont believe we will. Even S1 will be dual-channel, and this is what would have benefited the most from being single-channel (since the pincount would be much lower the package could be much smaller).
  • BaronMatrix - Monday, April 10, 2006 - link

    Looking at the intensive timing and bus speed tweaks USING the SAME RAM as the latest XE955 article I would have expected the same kind of thing here. Anand doesn't look at lower speed lower latency for whatever chip he used. That RAM will do 3-2-2 at 667. Obviously AMD is more sensitive to latency.
  • ChristTheGreat - Monday, April 10, 2006 - link

    AMD is sensitive to latencies, cause of the memory controller. I'm sure that 3-2-2-9 DDR2 from OCZ, would give much more performance on AMD.

    Again, this is only a CPU that they use to test, so it's not the true CPU. They wouldn't give us the performance it gives before it's launch. That's like killing yourself right now if the performance is poor....

    I saw an article, that AMD could be working on DDR2 latencies. You think that 4-4-4-12 is good timings? 12 = tRAS

    "tRAS is the time required before (or delay needed) between the active and precharge commands. In other words, how long the memory must wait before the next memory access can begin."

    In fact, you have better frequencies, but lower timings.... What you need, is higher frequencies, and lower timings.

    So we will have to wait till they launch Socket AM2, to know the true performance of AM2.
  • defter - Monday, April 10, 2006 - link

    quote:

    You think that 4-4-4-12 is good timings?


    4-4-4-12 are good timings, even for DDR2-667. It isn't easy to find reasonable priced DDR2-667 that works on those timing with standard voltage.


    Some people forget that 99% of consumers won't be using super expensive overvolted 3-3-3-10 DDR2-800 memory just to get few percents of extra performance. And if you compare AMD CPU + super fast DDR2-800 against Intel CPU (which runs fine on DDR2-667 because of FSB limitation) then you need to take into account higher price of memory on AMD system.
  • Wesley Fink - Monday, April 10, 2006 - link

    We are continuing to test the AM2 on different AM2 boards. On another motherboard we could run at 3-3-3 DDR2-800 with the OCZ PC2-8000 memory. Latency was a bit lower and bandwidth a bit higher, but nothing realy changed from Anand's conclusions. We have also been running DDR2-667 and DDR2-533 tests with this new super fast OCZ memory and cheaper mainstream DDR2 memory, and we will be sharing those results as soon as testing is complete.
  • cornfedone - Monday, April 10, 2006 - link

    The crap the mobo companies have been shoving out the doors the past couple years is pure garbage as any number of hardware review sites have confirmed. It looks like the AM2 mobos might be more half-baked crap. Until you can test the shipping CPUs on a quality mobo that allows proper memory timing, it's difficult to know what AMD's AM2 CPUs will or won't deliver. If I had a dollar for every bogus claim Intel has made, I'd be a Billionaire so I wouldn't hold my breath that Conroe will perform as Intel claims.

Log in

Don't have an account? Sign up now