The Long Awaited Penryn Update

In an unprecedented move, Intel made a very full disclosure of its first 45nm processor family, codenamed Penryn a couple of weeks ago. There were some vague elements of the initial Penryn disclosure that we’ve since cleared up.

First off, Penryn is designed to support up to a 1600MHz FSB, however we wondered whether desktop chips would even see the faster FSB support given that we haven’t so much as heard of support for it on Intel’s upcoming 3 series chipsets (e.g. P35, X38). It turns out that Intel is only confirming 1600MHz FSB support for Penryn based Xeon processors for the HPC market, not for the mobile or desktop markets. This tells us two things: 1) Intel is feeling AMD’s bandwidth advantage and strength in the HPC market and is using the faster FSB to help level the playing field, and 2) the desktop will most likely not see a FSB faster than 1333MHz.

Remember that with Nehalem being introduced in 2008, Intel will begin shifting away from its aging FSB architecture to a point-to-point interface akin to what AMD introduced with the K8 back in 2003. It doesn’t make a lot of sense for Intel to invest much money into moving cost focused desktop platforms to 1600MHz FSB only to abandon the efforts in a year’s time. While Intel hasn’t said anything, we’re expecting Penryn desktop parts to be 1333MHz FSB only, which makes sense given that the upcoming P35 chipset officially supports a maximum FSB frequency of 1333MHz.

Intel also mentioned that Penryn would support SSE4, but is its implementation complete or will we have to wait until Nehalem for that? It turns out that Penryn will support a total of 47 SSE4 instructions, not the full implementation of the ISA extensions. There will be an additional 7 instructions that Intel is stating will come in future microprocessors, we’re assuming that Intel is talking about Nehalem but it’s not yet set in stone.

The two interesting power related technologies that will make their debut with Penryn will apparently be mobile-only for now. Intel’s C6 state and EDAT (Enhanced Dynamic Acceleration Technology) will only be supported on mobile Penryn platforms given the nature of the two features. As a recap, the C6 power state allows for an extremely low power operating mode, the closest to a full reset of the CPU, while idle. Data is completely expelled from the on-die caches and the caches themselves are powered off, while core voltage is reduced to the lowest amount allowed by the process. The CPU’s state is saved in some on-chip storage, then the majority of the chip is powered down into a virtually off state. Recovery from C6 is possible, the state is read back from the CPU and the chip is powered up as it would from reset but with memory of what it was doing before it entered C6. The wakeup process does take some time (not noticeable to the user) thus it impacts performance and is suitable for mobile environments where the impact on battery life is worth the reduction in system performance.

Intel’s EDAT is the other mobile-only Penryn technology Intel talked about in its disclosure, and it allows the increase in clock speed of one core on a mobile Penryn when the other core is not in use. The idea is simple: in a notebook you are constrained by the cooling system used, not by the maximum clock speed attainable by the CPU itself. When running single threaded applications (or multithreaded applications with only one CPU intensive thread), the remaining core can power down reducing the total thermal footprint of the CPU itself. An EDAT enabled mobile Penryn core can then detect that only one core is being used and increase the clock speed of that operational core by a single speed bin (e.g. 2.40GHz to 2.66GHz) in order to provide a boost in performance to that one active thread. Once again, EDAT will be mobile-only.

Finally, with regards to motherboard support, Intel isn’t making any guarantees about Penryn’s backwards compatibility. While Penryn will still use the LGA-775 socket that Prescott and Conroe have used, motherboard support will require more than just the presence of the socket. If the appropriate VRM spec is implemented, then Penryn will work on your LGA-775 motherboard, the problem is that motherboard manufacturers haven’t yet released information on which of their boards will support the Penryn VRM changes. If history repeats itself, you can expect very limited official support for Penryn in currently shipping motherboards and guaranteed support with boards based on Intel’s new 3 series chipsets (e.g. P35). We did see Penryn up and running on an Intel BadAxe2 board, but it had a hardware VRM modification done to it in order to properly support Penryn. Penryn may also be able to work on boards without a VRM mod, however at increased (potentially out-of-spec) voltage settings.

At IDF Beijing Intel unveiled a little more about Penryn performance; it compared a quad-core 3.33GHz (1333MHz FSB) Yorkfield with 12MB of L2 cache (2 x 6MB per dual core die) to a quad-core Core 2 Extreme QX6800 2.93GHz (1066MHz FSB) Kentsfield with 8MB of L2 cache (2 x 4MB). According to Intel’s own benchmarks, Intel saw a 15% increase in imaging related applications, 25% in 3D rendering tests, greater than 40% in games, and a greater than 40% increase in video encoding performance when SSE4 support was utilized.

Obviously some of the performance improvement can be attributed to the higher clock speed and faster FSB of the Yorkfield system, while the remaining would be due to architectural enhancements and larger cache of Penryn. The percentage improvement Intel is indicating with Penryn is quite high, but as we’re comparing across different clock speeds it’s a bit of a skewed comparison. Don’t expect Penryn to have the same performance impact that Conroe did upon its introduction, but rather expect an evolutionary continuation of the performance we’ve seen from Intel thus far. Unlike the other P in Intel’s codename history, there are no terrible surprises with Penryn that will result in a step back in performance.

Index Project Larrabee
Comments Locked

13 Comments

View All Comments

  • rqle - Monday, April 16, 2007 - link

    My fear for AMD is that intel can clock their chip relatively higher and can ramp it up much faster than AMD. K10 will probably prove to be a stronger Quad chip clock for clock. But intel easy response is going to ramp up its Penryn chip up to 3.5+ range either matching or beating AMD chips and then KO? with its Nehalem onboard controller chip VERY shortly after the plan AMD launch.
  • fikimiki - Tuesday, April 17, 2007 - link

    You forget that every MHz means more power to consume. 3.5GHz chip will have 150W TDP or more. And 45nm won't help.
  • defter - Tuesday, April 17, 2007 - link

    Why wouldn't 45nm help? At 65nm, Intel will achieve 3GHz for quad core 130W TDP parts. It's quite logical to assume that Intel will hit at least 3.6GHz for quad core parts in Q1 2008 if necessary with 130W TDP naturally. That's only 20% clockspeed increase at the same TDP.

Log in

Don't have an account? Sign up now