Intel Developer Forum - Beijing 2007: Penryn and Intel's High End GPU
by Anand Lal Shimpi on April 16, 2007 9:00 PM EST- Posted in
- CPUs
The Long Awaited Penryn Update
In an unprecedented move, Intel made a very full disclosure of its first 45nm processor family, codenamed Penryn a couple of weeks ago. There were some vague elements of the initial Penryn disclosure that we’ve since cleared up.
First off, Penryn is designed to support up to a 1600MHz FSB, however we wondered whether desktop chips would even see the faster FSB support given that we haven’t so much as heard of support for it on Intel’s upcoming 3 series chipsets (e.g. P35, X38). It turns out that Intel is only confirming 1600MHz FSB support for Penryn based Xeon processors for the HPC market, not for the mobile or desktop markets. This tells us two things: 1) Intel is feeling AMD’s bandwidth advantage and strength in the HPC market and is using the faster FSB to help level the playing field, and 2) the desktop will most likely not see a FSB faster than 1333MHz.
Remember that with Nehalem being introduced in 2008, Intel will begin shifting away from its aging FSB architecture to a point-to-point interface akin to what AMD introduced with the K8 back in 2003. It doesn’t make a lot of sense for Intel to invest much money into moving cost focused desktop platforms to 1600MHz FSB only to abandon the efforts in a year’s time. While Intel hasn’t said anything, we’re expecting Penryn desktop parts to be 1333MHz FSB only, which makes sense given that the upcoming P35 chipset officially supports a maximum FSB frequency of 1333MHz.
Intel also mentioned that Penryn would support SSE4, but is its implementation complete or will we have to wait until Nehalem for that? It turns out that Penryn will support a total of 47 SSE4 instructions, not the full implementation of the ISA extensions. There will be an additional 7 instructions that Intel is stating will come in future microprocessors, we’re assuming that Intel is talking about Nehalem but it’s not yet set in stone.
The two interesting power related technologies that will make their debut with Penryn will apparently be mobile-only for now. Intel’s C6 state and EDAT (Enhanced Dynamic Acceleration Technology) will only be supported on mobile Penryn platforms given the nature of the two features. As a recap, the C6 power state allows for an extremely low power operating mode, the closest to a full reset of the CPU, while idle. Data is completely expelled from the on-die caches and the caches themselves are powered off, while core voltage is reduced to the lowest amount allowed by the process. The CPU’s state is saved in some on-chip storage, then the majority of the chip is powered down into a virtually off state. Recovery from C6 is possible, the state is read back from the CPU and the chip is powered up as it would from reset but with memory of what it was doing before it entered C6. The wakeup process does take some time (not noticeable to the user) thus it impacts performance and is suitable for mobile environments where the impact on battery life is worth the reduction in system performance.
Intel’s EDAT is the other mobile-only Penryn technology Intel talked about in its disclosure, and it allows the increase in clock speed of one core on a mobile Penryn when the other core is not in use. The idea is simple: in a notebook you are constrained by the cooling system used, not by the maximum clock speed attainable by the CPU itself. When running single threaded applications (or multithreaded applications with only one CPU intensive thread), the remaining core can power down reducing the total thermal footprint of the CPU itself. An EDAT enabled mobile Penryn core can then detect that only one core is being used and increase the clock speed of that operational core by a single speed bin (e.g. 2.40GHz to 2.66GHz) in order to provide a boost in performance to that one active thread. Once again, EDAT will be mobile-only.
Finally, with regards to motherboard support, Intel isn’t making any guarantees about Penryn’s backwards compatibility. While Penryn will still use the LGA-775 socket that Prescott and Conroe have used, motherboard support will require more than just the presence of the socket. If the appropriate VRM spec is implemented, then Penryn will work on your LGA-775 motherboard, the problem is that motherboard manufacturers haven’t yet released information on which of their boards will support the Penryn VRM changes. If history repeats itself, you can expect very limited official support for Penryn in currently shipping motherboards and guaranteed support with boards based on Intel’s new 3 series chipsets (e.g. P35). We did see Penryn up and running on an Intel BadAxe2 board, but it had a hardware VRM modification done to it in order to properly support Penryn. Penryn may also be able to work on boards without a VRM mod, however at increased (potentially out-of-spec) voltage settings.
At IDF Beijing Intel unveiled a little more about Penryn performance; it compared a quad-core 3.33GHz (1333MHz FSB) Yorkfield with 12MB of L2 cache (2 x 6MB per dual core die) to a quad-core Core 2 Extreme QX6800 2.93GHz (1066MHz FSB) Kentsfield with 8MB of L2 cache (2 x 4MB). According to Intel’s own benchmarks, Intel saw a 15% increase in imaging related applications, 25% in 3D rendering tests, greater than 40% in games, and a greater than 40% increase in video encoding performance when SSE4 support was utilized.
Obviously some of the performance improvement can be attributed to the higher clock speed and faster FSB of the Yorkfield system, while the remaining would be due to architectural enhancements and larger cache of Penryn. The percentage improvement Intel is indicating with Penryn is quite high, but as we’re comparing across different clock speeds it’s a bit of a skewed comparison. Don’t expect Penryn to have the same performance impact that Conroe did upon its introduction, but rather expect an evolutionary continuation of the performance we’ve seen from Intel thus far. Unlike the other P in Intel’s codename history, there are no terrible surprises with Penryn that will result in a step back in performance.
13 Comments
View All Comments
Roy2001 - Tuesday, April 17, 2007 - link
BTW, I am not fan of any brand. I just buy the best. I switched from P3 to Athlon, now I switched back to Core2.DeepThought86 - Tuesday, April 17, 2007 - link
Wow, this IDF is like deja vu - of AMD's plans that is. Torrenza ripped off, Quad FX ripped off, on-die mem controller ripped off, Fusion ripped off, GPGPU ripped off.Even if AMD dies, at least their ideas will live on. Looks like Intel just does a search/replace on AMD's plans and announces it 1-3 years later under a new name.
IntelUser2000 - Tuesday, April 17, 2007 - link
Most of them, if not all aren't even AMD's idea. If you are an engineer at a CPU company such as Intel/AMD, they would think of the ideas long time ago. ALL AMD did was bring to market. Oh wait, they only brought two on the market, and one is completely useless(Quad FX). If Barcelona came before Conroe, people would think the architecture enhancements were copied by Intel. But since Conroe came first, people assume AMD copied Intel. Which does sound pretty stupid to assume.
DeepThought86 - Wednesday, April 18, 2007 - link
Given your username and your comment, I'd say you switch your brain off when you hear the word Intel so there's no point arguing the facts. Intell...droool...gooooodRoy2001 - Tuesday, April 17, 2007 - link
AMD is in real trouble as I can see.Slaimus - Tuesday, April 17, 2007 - link
Didn't Intel acquire 3DLabs? The 3DLabs Realizm line was pretty impressive in terms of features.
Missing Ghost - Tuesday, April 17, 2007 - link
Quote from Wikipedia:"3Dlabs was previously a graphics card vendor that developed high-end graphics chip technology and marketed its Wildcat, Oxygen and Permedia computer graphics solutions to design professionals in the CAD and content creation industries. Formerly independent, it became a subsidiary of Creative Labs in 2002, but announced plans to spin in November 2006."
Visual - Tuesday, April 17, 2007 - link
considering AMD were talking about their best k10 also having a 40% better performance than intel's best quad (and at the time that was at lower clocks, wasn't it) in some floating point benchmark, this performance preview of the penryn must sound scary to them...i wonder if AMD can implement this sse4 thing or their k10 is too far in development already, as it seems to have quite an effect.
wooter - Tuesday, April 17, 2007 - link
Tolapai is an SoC designed for embedded markets that essentially combines CPU, north/south bridge, and network processor. No integrated graphics sorry.DigitalFreak - Monday, April 16, 2007 - link
I would hope that Intel would be smart enough to introduce a new workstation chipset that supports dual CPUs but does not require FB DIMMs for this project. Don't see why they'd need more than 4 DIMM slots on the boards.