Video Transcoding

With our lightly threaded tests behind us, it's time to move to more CPU intensive pastures.

We migrated to the latest verison of the x264 HD benchmark which features a much newer version of x264 and a much heavier workload. The focus here is on quality rather than speed, thus the benchmark uses a 2-pass encode and reports the average frame rate in each pass.

Windows 8 - x264 HD 5.0.1 - 1st Pass

Windows 8 - x264 HD 5.0.1 - 2nd Pass

Here we see all of the 6-core parts rise to the top, including Intel's old Gulftown based Core i7-990X. Despite being a few years old at this point, the 990X's 6-core design and relatively high clock speed gives it better performance here than the quad-core Haswell 4770K.

The 4960X manages to be around 30% faster than the old 990X, and is 40% faster than the 4770K. For heavily threaded applications, there's simply no replacement for more cores.

Just like I did earlier, I dusted off one of our really old x264 tests so we'd have comparison data to even older CPUs including the Pentium 4 and Penryn based Extreme Edition parts:

x264 HD Encode Test - 1st pass - x264 0.59.819

x264 HD Encode Test - 2nd pass - x264 0.59.819

The 4960X manages to deliver nearly 3x the performance of Intel's flagship from 6 years ago. The Pentium EE 955 comparison is even more insane. IVB-E is basically an order of magnitude faster than the last high-end Pentium 4s to come out of Intel back in 2005.

3D Rendering

Our new POV-Ray benchmark uses the latest beta binary (3.7RC6) and runs through both single and multithreaded versions of the popular raytracing benchmark.

Windows 8 - POV-Ray 3.7RC6 - Single Threaded

Isolating a single core shows us exactly what we're missing by having Ivy Bridge at the heart of the 4960X instead of Haswell. Here the 4770K manages a 16% performance advantage over the 4960X, which costs 3x as much and draws substantially more power. Looking at AMD's FX-8350 however it's clear why Intel can get away with launching a high-end 6-core chip without its latest cores. Piledriver's single threaded performance falls somewhere between Nehalem and Sandy Bridge, giving Intel room to launch another Ivy Bridge based high-end SKU in 2013 and get away with it.

Windows 8 - POV-Ray 3.7RC6 - Multi Threaded

The multithreaded performance story is very different. Here even the Gulftown based 990X is faster than Haswell thanks to its six cores. The 4960X is 40% faster than the Haswell based 4770K. Even AMD's FX-8350 does really well here, basically equalling Haswell's performance.

Created by the Cinema 4D folks we have Cinebench, a popular 3D rendering benchmark that gives us both single and multi-threaded 3D rendering results.

Cinebench 11.5 - Single Threaded

In Cinebench, the single threaded Haswell/IVB-E gap narrows to 5%.

Cinebench 11.5 - Multi-Threaded

Multithreaded performance continues to be just stellar. Here the 4960X is just under 50% faster than the 4770K. Note the relatively small gap between the 4960X and the SNB-E based 3970X however - the performance gain is only 5%. The bulk of Ivy Bridge's advancements were in GPU performance (not applicable to IVB-E) and power consumption (which we'll get to shortly).

Our final two Cinebench tests use the R10 benchmark to enable a comparison to more/older data points:

Cinebench R10 - Single Threaded Benchmark

Cinebench R10 - Multi-Threaded Benchmark

Memory & General Purpose Performance Visual Studio, Photoshop, File Compression & Excel Math Perf
Comments Locked

120 Comments

View All Comments

  • ShieTar - Tuesday, September 3, 2013 - link

    Whats the point? A 10-core only runs at 2GHz, and a 8-core only runs at 3 GHz, so both have less overall performance than a 6-core overclocked to more than 4GHz. You simply cannot put more computing power into a reasonable power envelope for a single socket. If a water-cooled Enthusiast 6-core is not enough for your needs, you automatically need a 2-socket system.

    And its not like that is not feasible for enthusiasts. The ASUS Z9PE-D8 WS, the EVGA Classified SR-X and the Supermicro X9DAE are mainboard aiming at the enthusiast / workstation market, combining two sockets for XEON-26xx with the capability to run GPUs in SLI/CrossFire. And if you are looking to spend significantly more than 1k$ for a CPU, the 400$ on those boards and the extra cost for ECC Memory should not scare you either.

    Just go and check Anandtech own benchmarking: http://www.anandtech.com/show/6808/westmereep-to-s... . It's clear that you need two 8-cores to be faster then the enthusiast 6-cores even before overclocking is taken into account.

    Maybe with Haswell-E we can get 8 cores with >3.5GHz into <130W, but with Ivy Bridge, there is simply no point.
  • f0d - Tuesday, September 3, 2013 - link

    who cares if the power envelope is "reasonable"?
    i already have my SBE overclocked to 5.125Ghz and if they release a 10core i would oc that thing like a mutha******

    that link you posted is EXACTLY why i want a 10/12 core instead of dual socket (which i could afford if it made sense performance wise) - its obvious that video encoding doesnt work well with NUMA and dual sockets but it does work well with multi cored single cpu's

    so i say give me a 10 core and let me OC it like crazy - i dont care if it ends up using 350W+ i have some pretty insane watercooling to suck it up (3k ultra kaze's in push/pull on a rx480rad 24v laingd5s raystorm wb - a little over the top but isnt that what these extreme cpu's are for?)
  • 1Angelreloaded - Tuesday, September 3, 2013 - link

    I have to agree with you in the extreme market who gives a damn about being green, most will run 1200watt Plat mod PSUs with an added extra 450 watt in the background, and 4GPUs as this is pretty much the only reason to buy into 2011 socket in the first place 2 extra cors and 40x PCIe lanes.
  • crouton - Tuesday, September 3, 2013 - link

    I could not agree with you more! I have a OC'd i920 that just keeps chugging along and if I'm going to drop some coin on an upgrade, I want it to be an UPGRADE. Let ME decide what's reasonable for power consumption. If I burn up a 8/10 core CPU with some crazy cooling solution then it's MY fault. I accept this. This is the hobby that I've chosen and it comes with risks. This is not some elementary school "color by numbers" hobby where you can follow a simple set of instructions to get the desired result in 10 minutes. This is for the big boys. It takes weeks or more to get it right and even then, we know we can do better. Not interested in XEON either.
  • Assimilator87 - Tuesday, September 3, 2013 - link

    The 12 core models run at 2.7Ghz, which will be slightly faster than six cores at 5.125Ghz. You could also bump up the bclk to 105, which would put the CPU at 2.835Ghz.
  • Casper42 - Tuesday, September 3, 2013 - link

    2690 v2 will be 10c @ 3.0 and 130W. Effectively 30Ghz.
    2697 v2 will be 12c @ 2.7 and 130W. Effectively 32.4Ghz

    Assuming a 6 Core OC'd to 5Ghz Stable, 6c @ 5.0 and 150W? (More Power due to OC)
    effectively 30Ghz.

    So tell me again how a highly OC'd and large unavailable to the masses 6c is better than a 10/12c when you need Multiple Threads?
    Keep in mind those 10 and 12 core Server CPUs are almost entirely AIR cooled and not overclocked.

    I think they should have released an 8 and 10 core Enthusiast CPU. Hike up the price and let the market decide which one they want.
  • MrSpadge - Tuesday, September 3, 2013 - link

    6c @ 5.0 will eat more like 200+ W instead of 130/150.
  • ShieTar - Wednesday, September 4, 2013 - link

    For Sandy Bridge, we had:
    2687, 8c @ 3.1 GHz => 24.8 GHz effectively
    3970X, 6c @ 3.5 GHz => 21 GHz before overclocking, only 4.2 GHz required to exceed the Xeon.

    Fair enough, for Ivy Bridge Xeons, the 10core at 3 GHz has been announced. I'll believe that claim when I see some actual benchmarks on it. I have some serious doubts that a 10core at 3 GHz can actually use less power than an 8 core at 3.4 GHz. So lets see on what frequency those parts will actually run, under load.

    Furthermore, the effective GHz are not the whole truth, even on highly parallel tasks. While cache seems to scale with the number of cores for most Xeons, memory bandwidth does not, and there are always overheads due to the common use of the L3 cache and the memory.

    Finally, not directly towards you but to several people talking about "green": Entirely not the point. No matter how much power your cooling system can remove, you are always creating thermal gradients when generating too much heat on a very small space. Why do you guys think there was no 3.5GHz 8 core for Sandy Bridge-EP? The silicon is the same for 6-core and 8-core, the core itself could run the speed. But INTEL is not going to verify the continued operation of a chip with a TDP >150W.

    They give a little leeway when it comes to the K-class, because there the risk is with customer to a certain point. But they just won't go and sell a CPU which reliably destroys itself or the MB the very moment somebody tries to overclock it.
  • psyq321 - Thursday, September 5, 2013 - link

    I am getting 34.86 @Cinebench with dual Xeon 2697 v2 running @3 GHz (max all-core turbo).

    Good luck reaching that with superclocked 4930/4960X ;-)
  • piroroadkill - Tuesday, September 3, 2013 - link

    All I really learn from these high end CPU results is that if you actually invested in high end 1366 in the form of 980x all that time ago, you've got probably the longest lasting system in terms of good performance that I can even think of.

Log in

Don't have an account? Sign up now