AMD's Efficiency Advantage?

Before we get to the actual barrage of performance tests, there is one issue that we have been wanting to tackle for quite some time now. 

AMD has often argued that their dual core architecture is inherently more efficient than Intel's, primarily because of their System Request Queue (SRQ).  All core-to-core transfers occur via this queue instead of over a main, shared FSB, which is the case in the Pentium D. 

Johan put AMD's architecture to the test by measuring the latency of cache-to-cache transfers in AMD's dual core chips vs. Intel's. The results were quite impressively in favor of AMD's architecture.  Cache-to-cache transfers on Intel's dual core CPUs took over twice as long as on AMD's dual core CPUs, but at that time, we could not find any real world benefit to the architecture.

Armed with a bit more time, we went through all of our benchmarks and specifically focused on those that received the most performance gain from dual core architectures.  Using these multithreaded and/or multitasking benchmarks, we looked at the performance improvements that the dual core processors offered over their single core counterparts.  For AMD, making this comparison was easy; we took the Athlon 64 X2 3800+ and compared it to its single core equivalent, the Athlon 64 3200+.  For Intel, the comparison is a bit more complicated.  The inclusion of Hyper Threading makes the single-core to dual-core jump a little less impressive in some cases, thanks to the fact that virtually all single-core Pentium 4 processors these days can execute two threads simultaneously.  Thus, for Intel, we had to look at HT enabled, dual core and dual core with HT enabled, all compared to single core performance to get a complete picture of Intel's multithreaded performance scaling. 

Remember that all performance increases are with reference to a single core processor, and in the case of Intel, we are talking about a single core Pentium 4 with HT disabled.  More specifically, we used a Pentium D 830 (3.0GHz) for the dual core tests and compared it to its single core counterpart - the Pentium 4 530 (3.0GHz). 

First, we have our Winstone 2004 benchmark suite; we omitted Business Winstone 2004, since it shows virtually no performance boost from dual core CPUs and instead, focused on Multimedia Content Creation Winstone 2004 and the Multitasking Winstone tests. 

While AMD scales slightly worse than Intel (comparing the AMD Dual Core to the Intel Dual Core rows) in the MMCC Winstone test and significantly worse in the Multitasking 1 test, AMD scales better in the last two tests.  Particularly in the third multitasking test, AMD gets a whopping 68.4% from the move to dual core while Intel only improves by 39.1%. 

It is also worth noting that although Hyper Threading improves performance with a single core, enabling HT on the dual core CPU actually yields lower overall performance than if we had left it off (+24.1% vs. +39.1%).  Johan explained exactly why situations like this exist on the Pentium D in his "Quest for More Processing Power".

Next up is the SYSMark 2004 suite.  In all but two of the tests, AMD scales slightly better than Intel when going to dual core.  The scaling advantages aren't huge, but they are tangible in some of the tests. 

Once again, while Hyper Threading itself tends to impress, HT + dual core gives us a mixed bag of results, sometimes outperforming dual core alone while falling behind other times.

Finally, we have our application-specific benchmarks; here, we have AMD scaling better than Intel in 3 out of the 5 tests, but then in the remaining 2, Intel scales better. 

Out of the 15 tests, 10 of them showed that AMD scaled better from single to dual core than Intel, while the remaining 5 showed the opposite, that Intel scales better.  Out of the 10 tests where AMD offered better scaling, only 6 of them showed AMD outscaling Intel by more than a 3% margin (one test had AMD with a 2.9% advantage, but it was close enough, so we counted it).  Of the 5 tests where Intel scaled better, 4 of them had Intel at an advantage by more than 3%.

While the Athlon 64 X2 does have much better cache-to-cache transfer latencies than the Pentium D, it appears as if for the most part, those advantages don't surface in real-world desktop usage.  That being the case, the Athlon 64 X2 3800+ must outperform the Pentium D 830 based on the performance advantages of its individual cores in order to win this battle, not based on any dual core architectural efficiencies.  So, does it?

New Pricing, but Higher Cost per Core? Head to Head: Athlon 64 X2 3800+ vs. Pentium D 830
Comments Locked

109 Comments

View All Comments

  • dougSF30 - Monday, August 1, 2005 - link

    No, in that Toledo die can have 1/2-cache disabled, and run as 512K L2 x 2

    However this page is silly for other reasons-- they probably had a Manchester 4200+, not Toledo, and more importantly, it just shouldn't make any difference, as the 89W TDP is given to both Toledo and Manchester 4200+s. They are both Rev E parts, and with equal frequency and enabled cache size, should on average be almost identical, as not much changed from E4 to E6.
  • dougSF30 - Monday, August 1, 2005 - link

    This whole page should go.

    (1) Your 4200+ was probably a Manchester part, not a Toledo

    (2) Toledo vs. Manchester doesn't really matter in terms of power. Rev E4 and E6 are basically the same. AMD is simply able to choose a lower official TDP for the lower-rated parts (4200+ and 3800+), whether they are made from the Manchester die OR the Toledo die. (Both 4200+ "BV" and 4200+ "CD" receive an 89W TDP from AMD, along with the new 3800+ "BV")
  • Doormat - Monday, August 1, 2005 - link

    If I had the money. Surely though, this is a great value and I can see people getting this chip (and subsequently OCing it to 2+GHz). Mainstream performance is about to get a kick in the pants.
  • dougSF30 - Monday, August 1, 2005 - link

    All the updates fixing the '4200+/4600+ were already manchester' issue just vanished...
  • Houdani - Monday, August 1, 2005 - link

    Page 5. Third table.

    This table shows AMD with higher numbers in 3 out of 4 benches, yet AMD is given the win in all 4. Did Intel get shafted by the math here? Perhaps the D830 managed to eke out 2 wins overall?

    Yes, yes. Nitpick. Just keeping you honest.

    Also (page 1) was AMD really late to the desktop dual core scene? While Intel did announce earlier, availability-wise weren't both companies on par?
  • SDA - Monday, August 1, 2005 - link

    Higher numbers do not equal higher performance in all of these tests. To quote the review (what, didn't you see the caption?): "The iTunes scores are Encoding Times in Minutes, lower numbers are better."

    And no, Intel's dual-cores were definitely available before AMD's. I distinctly remember hearing a lot of complaints about this. The gap doesn't seem that big in retrospect, but trust me, it was there.
  • Houdani - Monday, August 1, 2005 - link

    captions = good

    Agreed -- sometimes higher is better (frames) and sometimes lower is better (time). So in that regard, wouldn't all the encode tests (and 3dsmax) be measured in time? If so, then AMD would have lost 3 of the 4, rather than the other way around. Gah!

    What am I missing here .. are the Windows Media Encoder HD and the Divx 6 Encode measured in some other way besides time?
  • SDA - Monday, August 1, 2005 - link

    No, not all are measured in time. If that were the case, it would have been stated, I'd think.

    Video encoding can also be measured in frames encoded per second. It usually IS measured that way, because that way higher == better (as our minds tend to think).
  • dougSF30 - Monday, August 1, 2005 - link

    Or at least, I noticed it back on July 12 on AMD's quick reference guide. Both BV and CD.



  • coomar - Monday, August 1, 2005 - link

    the intel memory was rated at 3-3-3@533, thats pretty good, i think the lowest you can go on ddr2 is 3-2-2@533, but that would be a lot more than 2-2-2@200 ddr ram, isn't the intel board still 50 bucks more than the amd 939 boards, then the x2 3800 would be a compeititor with the 2.8 p-d but only for a month or two until they get the prices down on the new intel chipsets

    though ddr2 seems to have an easier time with larger ram modules, 1gb modules in particular

    i'll be interested in the overclocking of this chip

Log in

Don't have an account? Sign up now