Final Words

We can't draw too many sweeping conclusions based on the data here today, but here's what we do know:

- Gaming performance is much improved with Barcelona over K8, this is most likely a result of the improvements to the SSE engine and the wider front end of the core.

- Encoding performance is improved (again SSE128 rearing its head), but 3dsmax saw an unexpectedly large performance improvement.

- With a 10 - 15% increase in performance on average, Phenom should be more competitive than K8 was on the desktop (as expected).

Here's where things get complicated; we knew Phenom/Barcelona would be faster clock for clock, it was only a matter of how big of an improvement we'd get. If we are to believe that 15% is the best we'll get on average, taking into account that Penryn is around 5% faster than Conroe, the updated architecture from AMD alone isn't enough to really compete with Intel. In other words, price matters.

We saw how competitive AMD became after the first round of price cuts this year, but after the second set Intel went back to dominating. The trouble for AMD this time around is that Phenom is a much larger chip than the outgoing Athlon 64 X2, whereas Intel's Penryn family will actually be smaller than Conroe. AMD is already losing a considerable amount of money each quarter, so fabbing a larger chip at the same price as current CPUs will only make the situation worse. However, Intel can afford to continue to keep its processors as aggressively priced, especially moving to 45nm.

To put it plainly: Phenom/Barcelona make this price war more difficult on AMD, while Penryn makes it easier on Intel. What's the end game? Is there a solution? We're not sure, all we know today is a starting point for Phenom expectations.

Barc Scaling: 2.0 vs. 2.5GHz
Comments Locked

70 Comments

View All Comments

  • MadBoris - Monday, September 10, 2007 - link

    hmm, especially if it is only @cas5, as mentioned above.
    It will be interesting to see if it yields anything more than just a few percent, as to scaling, and if benefits compound per socket.
    As to one socket and 4 cores I don't really envision it being that much more than a few percent, but then again, I'm not investing any thought or speculation to try and figure out what will be answered when it actually matters and HW is available.

    Major point for me is, being able to OC a q6600($280) to 3.2GHz - 3.4GHz on air is going to be real stiff competition for AMD's Phenom, as to my purchasing decisions, which is all I am concerned about mainly.

    Also I believe all peoples talk about "true" quad is going to fall a bit flat for the majority of applications/games in real world comparisons with Kentsfield. Because already anyone that is interested to research it can see that the cache/bus penalties in scaling from 2 to 4 cores is basically nonexistent on applications that actually 'fully leverage' all 4 cores. Some apps will benefit, but I expect this to come to light before long and people will see that the penalty of 2 cores in one (Intel Quad), was more speculation, than actual reality, for 'most' consumer applications and games.

    I do like AMD's advances but we seriously need more frequency, CPI cannot be overlooked.
  • duploxxx - Monday, September 10, 2007 - link

    if AMD is already able to show multiple phenom systems on 3.0GHZ without dditional cooling (just boxed heatpipe cooler) then i wouldn't be too worried about oc performance of k10
  • ilkhan - Monday, September 10, 2007 - link

    15% over K8 is not going to be enough if it launches at (or at least doesn't overclock easily to) 3.2Ghz. At the 2.5 indicated here, yorkfield@3.2+ is going to eat agena for lunch, while being more profitable for intel than agena can hope to be for AMD.
    Pity.
  • JackPack - Monday, September 10, 2007 - link

    Based on these numbers, consumers are likely going to stick with Intel quads.

    Clock for clock, Kentsfield was often >30% faster than Quad FX. Barcelona being 15% faster than K8 is reasonable but it's clearly not going to touch Penryn/Yorkfield.
  • duploxxx - Monday, September 10, 2007 - link

    Alltough it is nice to see what anand tried to put here on electronic paper. I can't be compared to the real phenom in a few months.

    If you want to know why, check Anand's memory review of a year a go and check how well k8 and also k10 is scaling with better/faster memory.

    in a barcelona rig you have reg 667@cas5.

    so people who are already making conclusions on these benches, one reply: too early.
  • JackPack - Monday, September 10, 2007 - link

    This isn't K8 though. The L3 in Barcelona is going to make it less sensitive to memory bandwidth and latency.
  • Regs - Monday, September 10, 2007 - link

    Memory hits and misses (latency) have nothing to do with the L3. The L3 is there as a buffer for the information being proportion to the 4 cores.
  • JackPack - Monday, September 10, 2007 - link

    Look up the term "memory hierarchy."
  • Regs - Tuesday, September 11, 2007 - link

    what do you think pulls the data into the L3? God?
  • JackPack - Tuesday, September 11, 2007 - link

    It's called prefetching. The data is in the L3 before the CPU needs it, reducing memory traffic and latency.

    Not only that, but Barcelona has a L3 latency of 20ns. To get data from the main memory, it has to go through all levels of cache. When you look at the cumulative latency of the memory hierarchy, the one or two cycle penalty of RDDR2 is trivial.

Log in

Don't have an account? Sign up now