The Best Server CPUs part 2: the Intel "Nehalem" Xeon X5570
by Johan De Gelas on March 30, 2009 3:00 PM EST- Posted in
- IT Computing
OLTP: Dell DVD Store on MS SQL Server 2005 | |
Operating System | Windows 2008 Enterprise RTM (64-bit) |
Software | SQL Server 2005 Enterprise x64 SP3 (64-bit) |
Benchmark software | Dell DVDStore 2 |
Database Size | 3.5 GB |
Typical error margin | 2-4% |
DVD Store is a project that the Linux department of Dell developed in 2005 as a test for its internal server laboratory. The DVD Store database schema consists of only eight tables, but it does include stored procedures and transactions. The beauty is that it is available as open source software. This allowed us to turn this into a custom benchmark. With the default settings, the database can only be three sizes: 10MB, 1GB, or 100GB. A 10MB database size is simply too small. A 1GB size does not allow us to scale well, as too much locking contention happens. Those two options are out, but to run DVD Store with a 100GB database as a CPU benchmark, we need to take out a second mortgage on all the houses of our team to pay for the necessary storage racks.
We decided to recompile the test, allowing us to use a 3.5GB database. A 3.5GB database proved to be a good compromise between not needing too much storage speed and making the database scale well to eight cores and beyond. As you can read on our benchmark configuration page, we used a RAID 0 set of six SSDs for the data, and four 15000RPM SAS disks for the logs. We monitored the DQL (Disk Queue Length) to ensure our test was not bottlenecked by the storage subsystem.
Most of the time, our storage subsystem copes well with the transactions (DQL <1), but there are a few brief spikes where the disks are limiting throughput. This means that our fastest CPUs are running at a slightly lower CPU load (Xeon X5570 is at slightly less than 80%) than the slowest CPU (85%). Giving our fastest CPUs an even faster storage system hardly improved performance despite somewhat higher CPU load levels. In reality, it is very unlikely that you will add a few drives because you wish to run your CPU at an 82% instead of 78% CPU load, so we feel this small variation in CPU load is acceptable. This is especially true as the resulting variation in performance is much smaller: we are talking about 2-3% performance variations, well within the error margin of our test.
You can test this OLTP database via a very thin web tier or directly. As the web tier only added noise (it uses the slow ODBC driver!) to our results, we tested directly. All servers were tested in a dual CPU configuration.
The AMD Shanghai has no trouble leaving the older Xeons behind, even at a lower clock speed. The Xeon 5570 does not play in the same class. Thanks to SMT, it is capable of outperforming its older brother by 78% and the competition by 66%. Hyper-Threading gives the Xeon 5570 a 21% performance boost.
One Xeon 5570 server is capable of replacing 3 to 4 older server systems based on the Xeon 50xx series.
44 Comments
View All Comments
rkchary - Tuesday, June 16, 2009 - link
We've a customer who is interested in upgrading to Nehalem. He's running on Windows with Oracle database for SAP Enterprise Portals.Could you kindly let us know your recommendations please?
The approximate concurrent users would be around 3000 Portal users.
Keenly looking forward for your response and if you could state any instances of Nehalem installed in SAP environment for production usage, that would be a great deal of help.
Regards,
Chary
Adun - Thursday, April 9, 2009 - link
Hello,I understand the PHP not-enough-threads explanation as to why Dual X5570 doesn't scale up.
But, can anyone please explain why when you add another AMD Opteron 2384 the increase is from 42.9 to 63.9, while when you add another Xeon X5570 there isn't such an increase?
Thank you for the article,
Adun.
stimudent - Thursday, April 2, 2009 - link
Was it really too much effort to clean off the processor before posting a picture of it? Or were they trying to show that it was used, tested?LizVD - Friday, April 3, 2009 - link
Would you perhaps like us to draw a smiley face on it as well? ;-)GazzaF - Wednesday, April 1, 2009 - link
Well done on an excellent review using as many real-world tests as possible. The VMWare test is a real eye opener and shows how the 55xx can match double the number of CPUs from the last generation of Xeons *AND* crucially save $$$$ on licensing from Windows and MS SQL and other per-socket licensed software, plus the power saving which is again a financial saving if you hire rack space in a datacentre.I eagerly await your own in-house VM tests. Please consider also testing using Windows 2008 Hyper-V which I think doesn't have the 55xx optimisations that the latest release of VMWare has (and might not have until R2?).
Thanks for the time you put in to running the endless tests. The results make a brilliant business case for anyone wanting to upgrade their servers. You must have had the chips a good week before Intel officially launched them. :-) I do feel sorry for AMD though. I'm sure they have plenty of motivation to come back with a vengeance like they did a few years ago.
JohanAnandtech - Thursday, April 2, 2009 - link
Thanks! Good to hear from another professional. I believe the current Hyper Beta R2 already has some form of support for EPT.Our virtualization testing is well under way. I'll give an update soon on our blog page.
Lifted - Wednesday, April 1, 2009 - link
You mention octal servers from Sun and HP for VM's, but does anybody really use these systems for VM's? I can't imagine why anybody would, since you are paying a serious premium for 8 sockets vs. 2 x 4 socket servers, or even 4 x 2 socket servers. Then the redundancy options are much lower when running only a few 8 socket servers vs many 2 or 4 socket servers when utilizing v-motion, and the expansion options are obviously far less w/ NIC's and HBA's. From what I've seen, most 8 socket systems are for DB's.Veteran - Wednesday, April 1, 2009 - link
What i mentioned after reading the review is there are very few benches on benchmarks a little bit favored by AMD.For example, only 1 3DSmax test (so unusefull) at least 2 are needed
Only 1 virtualization benchmark, which is really a shame....
Virtualization is becoming so important and you guys only throw in one test?
Besides that, the review feels a bit biased towards intel, but i will check some other reviews of the xeon 5570
duploxxx - Wednesday, April 1, 2009 - link
Virtualization benchmark come from the official Vmmark scores.However there is something real strange going on in the results...
HP HP ProLiant DL370 G6
VMware ESX Build #148783 VMmark v1.1
23.96@16tiles
View Disclosure 2 sockets
8 total cores
16 total threads 03/30/09
Dell Dell PowerEdge R710
VMware ESX Build #150817 VMmark v1.1
23.55@16tiles
View Disclosure 2 sockets
8 total cores
16 total threads 03/30/09
Inspur Inspur NF5280
VMware ESX Build #148592 VMmark v1.1
23.45@17tiles
View Disclosure 2 sockets
8 total cores
16 total threads 03/30/09
Intel Intel Supermicro 6026-NTR+
VMware ESX v3.5.0 Update 4 VMmark v1.1
14.22@10 tiles
View Disclosure 2 sockets
8 total cores
16 total threads 03/30/09
So lets see all the prebuilds of esx3.5 update 4 get a real high score of 16 tiles almost as much as a 4s shanghai while Vmware performance team themselves stated that we should never see the HT core as a real cpu in Vmware (even with the new code for HT) while yet the benchmark shows a high performance increase, no not like anandtech is stating that this is due to the more available memory and its bandwith, those Vmmarks are not memory starving. Now look at the official Intel benchmark with ESX update 4, it provides 10 tiles and a healthy increase, that from a technical point of view seems much more realistic. All other marketing stuff like switching time etc, all nice, but then again is within the same line of current shanghai.
JohanAnandtech - Wednesday, April 1, 2009 - link
What kind of tests are you looking for? The techreport guys have a lot of HPC tests, we are focusing on the business apps."very few benches on benchmarks a little bit favored by AMD."
That is a really weird statement. First of all, what is a test favored by AMD?
Secondly, this new kind of testing with OLTP/OLAP testing was introduced in the Shanghai review. And it really showed IMHO that there was a completely wrong perception about harpertown vs Shanghai. Because Shanghai won in the tests that mattered the most to the market. While many tests (inclusive those of Intels) were emphasizing purely CPU intensive stuff like Blackscholes, rendering and HPC tests. But that is a very small percentage of the market, and that created the impression that Intel was on average faster, but that was absolutely not the case.
"Only 1 virtualization benchmark, which is really a shame..."
Repeat that again in a few weeks :-). We have just succesfully concluded our testing on Nehalem.
Personally I am a bit shocked about the "not enough tests" :-). Because any professional knows how hard these OLTP/OLAP tests are to set up and how much time they take. But they might not appeal to the enthousiast, I am not sure.