Memory Latency and Bandwidth

Currently, there are only two manufacturers of desktop Pentium M motherboards who are selling into the channel - AOpen and DFI. Both AOpen and DFI's motherboards came about not because of widespread consumer demand, but because they each had one customer that needed a Pentium M motherboard for a specific application. Once the boards were designed and built, they were later repackaged and made available to the public as an afterthought.

The major issue with both of these motherboards is that they are based on the 855GME chipset. The 855GME only features AGP 4X support, but the killer is in its single-channel DDR333 memory controller. Without DDR400 support, the 855GME starves the Pentium M for bandwidth, as it is only capable of delivering 2.7GB/s of bandwidth to main memory while the Pentium M at 2.0GHz needs 3.2GB/s of bandwidth to remain most efficient. Overclocking the memory bus is somewhat of an option, but not exactly the most desirable one for reasons that we will get to later.

One solution is Intel's recently released mobile 915 chipset, which features a dual-channel DDR1/DDR2 memory controller. The dual channel controller is more than capable of supplying the appropriate memory bandwidth to the Pentium M, if not a bit overboard, but right now mobile 915 isn't an option on the desktop.

With an unsatisfactory amount of memory bandwidth, the Pentium M will undoubtedly be held back in performance in applications where memory bandwidth is most important. As we all know, memory bandwidth and latency are interdependent, so let's see how the latency to main memory compares.

For our memory latency tests, we once again turn to ScienceMark 2.0:

 CPU  Memory Latency
(in ns)
AMD Athlon 64 3200+ (2.0GHz) 50ns
Intel Pentium 4E 560 (3.6GHz) 80ns
Intel Pentium M 755 (2.0GHz) 80ns

With an on-die memory controller, the Athlon 64 obviously offers the lowest latency memory access out of the group. The reason why we used a 2.0GHz Athlon 64 for this comparison was to show the memory latency seen by a CPU clocked identically to the Pentium M. As strong as the Pentium M's branch predictor may be, the trip to main memory will always be longer than the Athlon 64 - increasing the penalty from having a longer pipeline.

When you compare the Pentium M to the Pentium 4, you see the real harm in only having a single channel DDR333 memory controller - the time for the Pentium M to get to main memory is very similar to that of the Pentium 4, even when the latter is using higher latency DDR2 memory. High memory latency will send the performance of the Pentium M tumbling as soon as it leaves the sanctity of its low latency L2 cache.

Low Latency L2 Cache Floating Point Performance
Comments Locked

77 Comments

View All Comments

  • Lupine - Wednesday, February 16, 2005 - link

    I'm surprised at these results. I'm setting up a new Dell Inspiron 9200 (M 725 @ 1.6GHz/400MHz FSB) and it is schooling both my Barton 2500+ @ 2.2GHz and TBred B 1700+ @ 2.2GHz running Stanford's Folding@Home project (600 point proteins: ~37min per frame for the XP boxes compared to ~34min per frame w/ the laptop).

    So, if it is so weak, what is allowing it to process WUs at such a competitive rate? Sure, that is slower than an A64, but competitive w/ most P4 procs.
  • fitten - Thursday, February 10, 2005 - link

    Something else to remember about the Banias/Dothan line of chips... Agressive power reduction was the #1 goal of the design process. In a 'normal' chip design, not all pipeline stages are the same length, the clock speed it runs at is the speed of the slowest part of the CPU. Since power usage is directly related to the frequency of the switching gates, the Intel engineers actually deliberately slowed down some parts of the chip to match the target release speeds (or get close to them) to reduce power consumption. This is, perhaps, the main reason why the frequencies don't scale so well as some would want them to scale.
  • Visual - Thursday, February 10, 2005 - link

    here's another thought... when the opterons launched initially at ECC DDR266, there were similar comments like "give it unbuffered DDR400 or higher and stay out of its way" :) well, now that we have that, ok it did improve performance a bit. but not hugely. shouldn't help the dothan significantly more too.
  • Visual - Thursday, February 10, 2005 - link

    I like how AMD got beaten by the P-M :) not because im intel fan, just because this will make things more interesting now.

    don't catch flame from this comment :p its my oppinion

    Funny how you picked the game benchmarks btw, its almost as if you wanted to show the P-M lacking behind the A64... from what I've seen it beats A64 in HL2 and CSS, and that's a game you don't skip usually :) so why now?

    Also looks suspicious how in lots of tests where P-M performs well with the A64 clock-for-clock or beats it, there is almost no difference in the 3800+ and 4000+ results... like if L2 isnt all that important, yet L2 is exactly how everyone explains the P-M success

    Maybe we'll see some 2MB L2 A64 "emergency edition" once Dothan gets a decent desktop chipset, just like what intel did to (try to) save P4 from the A64 :)
    actually i'd be happy if Dothan motivates AMD to develop faster L2 cache or something.

    Knowing Intel, i dont expect they'd even try to match AMD's prices with the P-M... and there's a lot of room for AMD to decreace prices, as they're selling with quite a margin now. So for sure the P-M won't be cost-effective compared to A64, not if you don't care for ultra-low power consumption at least.

    also it doesn't look likely Dothan could scale beyond 2.6GHz on current 90nm tech. by the time it gets there, AMD should've launched the 2.8 FX and most likely 3GHz too. so I have no doubts AMD will keep the lead for quite a while... maybe the race to 65nm will be the next turning point, as it seems its going smooth for intel (at least for P-M)

    anyway, even if AMD is better in absolute performance, pricepoint and (arguably) clock-for-clock, you gotta admit it to the P-M, it does quite a punch. fun times are coming :)
  • Zebo - Wednesday, February 9, 2005 - link

    dobwal buy intel if you want mhz, AMD is for performance.
  • dobwal - Wednesday, February 9, 2005 - link

    i wasn't referring to the FX series. Plus you are not understanding the point i was trying to make. Lets take a look at the FX series.

    OPN Model Operating Freq. Package ADAFX55DEI5AS FX55 2600MHz 939-Pin
    ADAFX53DEP5AS FX53 2400MHz 939-Pin
    ADAFX53CEP5AT FX53 2400MHz 940-Pin
    ADAFX51CEP5AT FX51 2200MHz 940-Pin
    ADAFX51CEP5AK FX51 2200MHz 940-Pin

    the first FX51 was release around late third quarter 2003. So in a little over a year the FX series has only increased 400 Mhz. Can you automatically assume that the FX has poor scalability in terms of cpu speed. NO. You know why, because the EE is underperforming and can't touch the FX. AMD has no need to push large scale speed increases out of the FX line, which would do nothing but increase cost with each new stepping it used to boost performance.

    The same goes for the Dothan at 2.26Ghz by the end of 2005. What other cpu offers the same level of performance vs. battery life. So why push for performance except to push sales.

    You simply can't determine the scalabiltiy of a cpu based on its roadmap especially when its the performance leader in its market segment and has no current viable competitor or one in the near future.
  • Aileur - Wednesday, February 9, 2005 - link

    Oh and, superpi relies on the fpu to do its calculations, so so much for this fpu is crap trend we have going here.

    http://mod.vr-zone.com.sg/Aopen_i855_review/25sPIm...
  • Aileur - Wednesday, February 9, 2005 - link

    Oh and before you start bragging about the better superpi1mb result of the a64
    http://www.akiba-pc.com/DFI_855/d17g_2608_spi1m.gi...

    this is 1 sec better, with 100mhz less, and single channel ram.
  • Aileur - Wednesday, February 9, 2005 - link

    Since you seem to like xtremesystems
    http://www.akiba-pc.com/DFI_855/d15g_2435_spi1m.PN...
    also a 1ghz overclock, also on default voltage

    Id like to see how an a64 would perform on a kt266 (if that were possible)

    Give the pentium m time to mature and all those "OMG HAHA YOU CPUZ IS SO HOT LOLOL!!!1111" will be obsoleet.
  • Zebo - Wednesday, February 9, 2005 - link

    58 "How long has A64 been stuck on 2.4Ghz."
    ----------------------------------

    There not. 2.6 FX-55 been out for months. More importantly AMD does'nt have to release new chips the way they dominate the benchmarks now. Could they? Hell ya.They got a nice buffer going, New FX's hit 3.0 on stock air. Cheap 90nm's are now hitting 2.7 on default Vcore and air. And by air I mean AMD's cheap all aluminum HS with a itty bitty 15mmx70mm fan, not Prescotts copper core screamers.

    T8000- You're clueless. Maybe it's the heat generated by your prescott making your head woozy, I dunno, but have a look here..1800 Mhz to 2800 Mhz on default Vcore stock fan.
    http://www.xtremesystems.org/forums/showthread.php...

Log in

Don't have an account? Sign up now