Measuring Real-World Power Consumption

The Equal Workload (EWL) version of vApus FOS is very similar to our previous vApus Mark II "Real-world Power" test. To create a real-world “equal workload” scenario, we throttle the number of users in each VM to a point where you typically get somewhere between 20% and 80% CPU load on a modern dual CPU server. The amount of requests is the same for each system, hence "equal workload". The CPU load is typically around 30-50%, with peaks up to 65% (for more info see here). At the end of the test, we get to a low 10%, which is ideal for the machine to boost to higher CPU clocks (Turbo) and race to idle.

We used the "Balanced" power policy and enabled C-states as the current ESXi settings make poor use of the C6 capabilities of the latest Opterons and Xeons.

First let's check out the response times.

vApus FOS Response times (ms)
CPU PhpBB1 PHPBB2 MySQL OLAP Zimbra
AMD Opteron 6276 101 30 3.8 41
AMD Opteron 6174 118 41 3.8 45
Intel Xeon X5650 45 18 2.4 29
Intel Xeon E5-2660 41 18 2.5 25
Intel Xeon E5-2690 27 14 2.3 23

It's worth noting that enabling the C-states in ESXi improves the performance/watt ratio of the Opteron 6276 quite a bit. Not only is the power consumption lower (see below), but enabling C6 allows higher turbo clocks, which in turn benefits response times. Compared to our previous test (standard out of the box "Balanced") all response times improve by 10% except for MySQL (which is already very low).

Even with that improvement however it is not enough to beat the Xeon E5. The Xeon E5 delivers extremely low response times....

vApus FOS EWL Power consumption

... while sipping very little power, despite being run inside a feature rich server. Kudos to Intel for a job very well done.

Virtualization Performance: Linux VMs on ESXi SQL Server 2008 R2 "OLAP" Workload
Comments Locked

81 Comments

View All Comments

  • think-ITB-live-OTB - Tuesday, March 6, 2012 - link

    Can i ask you a question? do you at least get paid when you bend over for Intel?

    These are Server Chips - who cares about single-threaded application performance.. or Corporate IPOs. AMD has delivered far greater TCO/performance than Intel has for at least a Decade and running.

    You want to praise a company like a Deity? ARM Holdings. nuff said. They can design a 35 dollar computer that can decode H.264 better than Intel can on SoCs that run 4x's the price. Currently have more Chips in more devices than in Intels entire history and Push Power envelopes far beyond anything Intel could ever muster.

    Just you wait before the Storm ARM and its Licensees unleash as it will eventually take over ALL markets including the Server space (Calxeda much?). Oh and as for Apple. (an ARM Licensee itself... i can see them moving to in-house ARM designs pretty soon). 4-6-8 Core Cortex A15 (with A7 core for low power iPod/tablet sync) Macbook Airs anyone?

    Intel is becoming the strongest of the Dinosaurs. But even the T-Rex fell eventually.
  • swizeus - Wednesday, March 7, 2012 - link

    We have been using the Flemish/Dutch Web 2.0 website Nieuws.be as a benchmark for some time. 99% of the loads on the database are selects and about 5% of them are stored procedures.

    The database is loaded 104%. is it possible ?
  • JohanAnandtech - Wednesday, March 7, 2012 - link

    Stored procedures can contain selects :-)
  • fredisdead - Saturday, April 7, 2012 - link

    From the 'article' .....

    'The Opteron might also have a role in the low end, price sensitive HPC market, where it still performs very well. It won't have much of chance in the high end clustered one as Intel has the faster and more power efficient PCIe interface'

    Well, if that's the case, why exactly would AMD be scoring so many design wins with Interlagos. Including this one ...

    http://www.pcmag.com/article2/0,2817,2394515,00.as...

    http://www.eweek.com/c/a/IT-Infrastructure/Cray-Ti...

    U think those guys at Cray were going for low performance ? In fact, seems like AMD has being rather cleaning up in the HPC market since the arrival of Interlagos. And the markets have picked up on it, AMD stock is thru the roof since the start of the year. Or just see how many Intel processors occupy the the top 10 supercomputers on the planet. Nuff said ...
  • InsaneScientist - Wednesday, March 7, 2012 - link

    Johan, where in the specs where you have this line:
    Transistors (Billion) 2,26 2x 1,2 2x 904 1,17

    I sure hope that 2x 904 (Billion) is a typo... otherwise AMD has some serious explaining to do. ;)

    Should be 2x ,904 (I think? Would be 2x .904 for me, I assume you follow the same rules...)
  • iliev - Wednesday, March 7, 2012 - link

    Page 5, Benchmark Configuration

    R2208GZ4GSSPP specs table... E5-2660 is 2.2Ghz, and not 2.9GHz
  • dodge776 - Wednesday, March 7, 2012 - link

    Hi Johan,
    Always look forward to reading your server reviews at AT, but no SAPS benchmarks this time?
  • ppennisi - Wednesday, March 7, 2012 - link

    For maximum VMware performance on Opteron Interlagos cpu under VMWARE it's better to disable C1E and enable, where available, HPC mode.

    I found myself on a fresh installation of ESXi 5.0 on Dell R715 that leaving C1E enable literally crippled vm performance.
  • boudini - Thursday, March 8, 2012 - link

    I'm not sure I would recommend using iray as a reliable benchmark renderer in 3ds max. It is not a self configuring mental ray, but an unbiased renderer which behaves fairly differently to mental ray, and most other renderers such as vray, final render and brazil. It is comparible to maxwell and fryrender, but is very new compared to those two longer established unbiased render engines. It also attempts to use the gpu to add to its calculations as well - which could significantly skew results.

    Using mental ray or vray might well give you quite a different result, and besides I don't think iray is widely used in the industry.
  • omega4711 - Friday, March 9, 2012 - link

    This. The results of iray are mostly dependent on the GPU. The lack of proper scaling certainly isn't due to Amdahl's law. Just use mentalray with small enough render buckets and you can easily satisfy 64+ threads.

    Also, due to the limitations of iray, it can (at this moment) only be used in about 1-3% of real world scenarios.

    Please, for all the people that care about these benchmarks, use mentalray and/or vray.

    Otherwise, it's a brilliant article.

Log in

Don't have an account? Sign up now