Affordable Dual Core from AMD: Athlon 64 X2 3800+
by Anand Lal Shimpi on August 1, 2005 9:36 AM EST- Posted in
- CPUs
AMD's Efficiency Advantage?
Before we get to the actual barrage of performance tests, there is one issue that we have been wanting to tackle for quite some time now.AMD has often argued that their dual core architecture is inherently more efficient than Intel's, primarily because of their System Request Queue (SRQ). All core-to-core transfers occur via this queue instead of over a main, shared FSB, which is the case in the Pentium D.
Johan put AMD's architecture to the test by measuring the latency of cache-to-cache transfers in AMD's dual core chips vs. Intel's. The results were quite impressively in favor of AMD's architecture. Cache-to-cache transfers on Intel's dual core CPUs took over twice as long as on AMD's dual core CPUs, but at that time, we could not find any real world benefit to the architecture.
Armed with a bit more time, we went through all of our benchmarks and specifically focused on those that received the most performance gain from dual core architectures. Using these multithreaded and/or multitasking benchmarks, we looked at the performance improvements that the dual core processors offered over their single core counterparts. For AMD, making this comparison was easy; we took the Athlon 64 X2 3800+ and compared it to its single core equivalent, the Athlon 64 3200+. For Intel, the comparison is a bit more complicated. The inclusion of Hyper Threading makes the single-core to dual-core jump a little less impressive in some cases, thanks to the fact that virtually all single-core Pentium 4 processors these days can execute two threads simultaneously. Thus, for Intel, we had to look at HT enabled, dual core and dual core with HT enabled, all compared to single core performance to get a complete picture of Intel's multithreaded performance scaling.
Remember that all performance increases are with reference to a single core processor, and in the case of Intel, we are talking about a single core Pentium 4 with HT disabled. More specifically, we used a Pentium D 830 (3.0GHz) for the dual core tests and compared it to its single core counterpart - the Pentium 4 530 (3.0GHz).
First, we have our Winstone 2004 benchmark suite; we omitted Business Winstone 2004, since it shows virtually no performance boost from dual core CPUs and instead, focused on Multimedia Content Creation Winstone 2004 and the Multitasking Winstone tests.
While AMD scales slightly worse than Intel (comparing the AMD Dual Core to the Intel Dual Core rows) in the MMCC Winstone test and significantly worse in the Multitasking 1 test, AMD scales better in the last two tests. Particularly in the third multitasking test, AMD gets a whopping 68.4% from the move to dual core while Intel only improves by 39.1%.
It is also worth noting that although Hyper Threading improves performance with a single core, enabling HT on the dual core CPU actually yields lower overall performance than if we had left it off (+24.1% vs. +39.1%). Johan explained exactly why situations like this exist on the Pentium D in his "Quest for More Processing Power".
Next up is the SYSMark 2004 suite. In all but two of the tests, AMD scales slightly better than Intel when going to dual core. The scaling advantages aren't huge, but they are tangible in some of the tests.
Once again, while Hyper Threading itself tends to impress, HT + dual core gives us a mixed bag of results, sometimes outperforming dual core alone while falling behind other times.
Finally, we have our application-specific benchmarks; here, we have AMD scaling better than Intel in 3 out of the 5 tests, but then in the remaining 2, Intel scales better.
While the Athlon 64 X2 does have much better cache-to-cache transfer latencies than the Pentium D, it appears as if for the most part, those advantages don't surface in real-world desktop usage. That being the case, the Athlon 64 X2 3800+ must outperform the Pentium D 830 based on the performance advantages of its individual cores in order to win this battle, not based on any dual core architectural efficiencies. So, does it?
109 Comments
View All Comments
archcommus - Tuesday, August 2, 2005 - link
Call me old fashioned, but since when was $350 affordable for a CPU? I prefer not over $200. :Dceefka - Tuesday, August 2, 2005 - link
The argument "with Intel you'd need a new motherboard" is invalid if you haven't built anything yet and start from scratch. That would easily leave options open for anyone to chose either. I agree that if your budget can handle it, you should at least consider the X2.People still complaining about the price of the X2 should realize that this is no ordinairy gaming CPU and the newest tech never came cheap. Since the Pentium D is like two cores slapped together, it shouldn't cost anymore than it does.
I wonder if Intel's Pentium D had a slick architecture like the X2, it would be as cheap as the current Pentiums D. It's not the core itself perhaps that increases the cost as it is the tech that connects the two like the X2 does. Yes, that's included in the price of an X2 ;-)
SDA - Tuesday, August 2, 2005 - link
Actually, you are wrong. It is the core itself that increases the cost. Larger core equals less cores per wafer and (generally) more defective cores per batch (if the possibility of a defect happening in one square millimeter is one in X..).The technology connecting the two, R&D costs, are paid back in the A64's cost. I suppose in a sense they're paid back in every A64's cost, but the DIFFERENCE between the A64 and A64X2 has nothing to do with slick technology.
coldpower27 - Tuesday, August 2, 2005 - link
There is also something to keep in mind, why shouldn't a processor with with a die size of 199mm2 Toledo core cost 75% more then the 114mm2 San Diego core? I mean you still want to get as much profit as possible per silicon wafer. It doesn't really help your bottom line if you sell more silicon area for a lower price to me.Neither Intel's or AMD's procesor are double for Dual core die size,
Prescott = 112mm2, Smithfield = 206mm2. 84% Increase
San Diego = 114mm2, Toledo = 199mm2. 75% Increase
Venice = 84mm2, Manchester = 147mm2. 75% Increase
Though since Intel is just basically slapping two cores together with arbiter logic, if one core is defective on the silicon wafer, they can salvage a Prescott core from it, AMD can't do this, due to their Dual core implementation, though if the defect is in the cache, they can sell it as a Athlon 64x2, 3800+, 4200+, 4600+.
AMD's pricing structure though currently allows, for more margins on Dual Core processors while for Intel it is the opposite, margins are higher for their Prescott, Prescott-2M. Though they don't have to put up with this situation that much longer as Intel, has economical NetBurst Dual Cores for 65nm process. Though on that process there ar more interesting Dual cores as well.
masher - Tuesday, August 2, 2005 - link
> "...why shouldn't a processor with with a die size of 199mm2 Toledo core cost 75% more then the 114mm2 San Diego...It doesn't really help your bottom line if you sell more silicon area for a lower price to me."Because there are fixed unit costs in addition to the raw cost of processing a square mm of silicon. Costs that add to a lot more than the raw cost itself. You have to package the silicon, test it, pack it, ship it..not to mention R&D it, market it, and sell it. Those costs predominate in most cases. Which is why when AMD or Intel don't cut their prices in half the moment they move to the next lithography node.
Given a zero defect rate, a 75% larger die should be should be maybe a third more costly to sell. But that larger die also increases your defects/wafer by 75% (roughly) as well.
Example. Assuming a 70% yield (30% defect rate) on a single core chip, you'd expect around a 48% yield on the dual-core version. So each wafer gives you (0.75)(.48)/(.70) = 51% of the chips per wafer.
It gets much worse with low yields. For instance, a 50% single-core yield translates to a dual-core yield of a pitiful 12.5%! So when defects are high, you have to stick with small die sizes.
masher - Tuesday, August 2, 2005 - link
> "Neither Intel's or AMD's procesor are double for Dual core die size..."True enough; I spoke loosely. Intel is considerably closer to double, though, which was my point. All else being equal, it should be AMD who can provide a cheaper second core rather than Intel.
> "Though since Intel is just basically slapping two cores together with arbiter logic, if one core is defective on the silicon wafer, they can salvage a Prescott core from it..."
An excellent point, and that may indeed be a larger factor in the price differential than the defect rate.
coldpower27 - Tuesday, August 2, 2005 - link
I don't really call a difference of increases of 9% that much, but I guess it's all a matter of perspective. Though in the end of the day, the difference between Intel Smithfield and AMD's Toledo is no more then 4% approx on die size.There are also other cost advantages Intel enjoys, remember all 90nm production is on 300mm wafer processing, which allows for less waste and simply more die per wafer, and reduced resource use, while AMD won't be there till Q1 2006 when commerical production begins on Fab 36 and the activation of their Charter partner fab.
AMD's also uses SOI technology, which we see has benefits in curbing leakage, but we don't have a good idea on how much this technology adds to the cost of the wafer, from what I have seen, since AMD's han't made proclamations on how inexepnsive it was to implmement, cost is not a strong point of this technology.
masher - Wednesday, August 3, 2005 - link
> "I don't really call a difference of increases of 9% that much"Well, a 12% differential (1-.84/.75) to be technical...but its not huge. The point was just that it exists...and that it favors AMD, not Intel. Sans all the other factors of course.
> "remember all 90nm production is on 300mm wafer processing, which allows for less waste..."
Very true..and the wastage fraction gets worse with increased die size also.
> "AMD's also uses SOI technology...we don't have a good idea on how much this technology adds to the cost of the wafer"
A year ago, SOI wafers were triple the cost of bulk wafers. Probably a good bit less now...and the raw wafer cost doesn't include the processing and consumables cost. Finally, Intel's wafers are hardly bulk-grade either.
coldpower27 - Tuesday, August 2, 2005 - link
Addendum: Coming on the 65nm process :Dceefka - Tuesday, August 2, 2005 - link
Sounds logcial. What I wanted to stretch is: there is still the difference in development costs for the Pentium D and the X2. The D being cheaper to develop than the X2 and then of course the volumes in which Intel can sell its double whopper.