Literally Dual Core

One of the major changes with Presler is that unlike Smithfield, the two cores are not a part of the same piece of silicon. Instead, you actually have a single chip with two separate die on it.  By splitting the die in two, Intel can reduce total failure rates and even be far more flexible with their manufacturing (since one Presler chip is nothing more than two Cedar Mill cores on a single package). 


The chip at the bottom of the image is Presler; note the two individual cores.

Intel's architecture, featuring no on-die memory controller, allows for such a split to be made without any major changes.  Even on Smithfield, all traffic between the cores actually had to travel out one core, off the chip and onto the external FSB and then back into the other core.  With Presler, the same type of communication can take place without any disruptions. The only difference is that the data from core to core has a slightly longer distance to travel. 

In order to find out if there was an appreciable increase in core-to-core communication latency, we used a tool called Cache2Cache, which Johan first used in his series on multi-core processors.  Johan's description of the utility follows:
"Michael S. started this extremely interesting thread at the Ace's hardware Technical forum. The result was a little program coded by Michael S. himself, which could measure the latency of cache-to-cache data transfer between two cores or CPUs. In his own words: "it is a tool for comparison of the relative merits of different dual-cores."

"Cache2Cache measures the propagation time from a store by one processor to a load by the other processor. The results that we publish are approximately twice the propagation time. For those interested, the source code is available here."
Armed with Cache2Cache, we looked at the added latency seen by Presler over Smithfield:

   Cache2Cache Latency in ns (Lower is Better)
AMD Athlon 64 X2 4800+ 101
Intel Smithfield 2.8GHz 253.1
Intel Presler 2.8GHz 244.2

Not only did we not find an increase in latency between the two cores on Presler, communication actually occurs faster than on Smithfield.  We made sure that it had nothing to do with the faster FSB by clocking the chip at 2.8GHz with an 800MHz FSB and repeated the tests only to find consistent results. 

We're not sure why, but core-to-core communication is faster on Presler than on Smithfield.  That being said, a difference of less than 9ns just isn't going to be noticeable in the real world - given that we've already seen that the Athlon 64 X2's 100ns latency doesn't really help it scale better when going from one to two cores.

Power Consumption and The Test Larger L2, but no increase in latency?
Comments Locked

84 Comments

View All Comments

  • Betwon - Friday, December 30, 2005 - link

    edit:
    Why is it slower the latency of the memory? 101ns is much more than 5x ns. where is the 'on-die' communication? Your test program is wrong?
  • Viditor - Friday, December 30, 2005 - link

    Thanks Anand!

    I don't know if you'll have time, but one of the things lacking in all of the other reviews of the OC XE955 is a comparison to an OC X2 4800...
    Speculation is quite rife, and it would be a good comparison IMNSHO.

    Cheers!
  • Gigahertz19 - Friday, December 30, 2005 - link

    Intel's back...back again...backkkkkkk again..backkkkkkkk again...du dah duh da
  • yacoub - Tuesday, January 3, 2006 - link

    If by "back" you mean finally (barely) able to compete with existing AMD performance, then yes. ;P

Log in

Don't have an account? Sign up now