Video Transcoding

With our lightly threaded tests behind us, it's time to move to more CPU intensive pastures.

We migrated to the latest verison of the x264 HD benchmark which features a much newer version of x264 and a much heavier workload. The focus here is on quality rather than speed, thus the benchmark uses a 2-pass encode and reports the average frame rate in each pass.

Windows 8 - x264 HD 5.0.1 - 1st Pass

Windows 8 - x264 HD 5.0.1 - 2nd Pass

Here we see all of the 6-core parts rise to the top, including Intel's old Gulftown based Core i7-990X. Despite being a few years old at this point, the 990X's 6-core design and relatively high clock speed gives it better performance here than the quad-core Haswell 4770K.

The 4960X manages to be around 30% faster than the old 990X, and is 40% faster than the 4770K. For heavily threaded applications, there's simply no replacement for more cores.

Just like I did earlier, I dusted off one of our really old x264 tests so we'd have comparison data to even older CPUs including the Pentium 4 and Penryn based Extreme Edition parts:

x264 HD Encode Test - 1st pass - x264 0.59.819

x264 HD Encode Test - 2nd pass - x264 0.59.819

The 4960X manages to deliver nearly 3x the performance of Intel's flagship from 6 years ago. The Pentium EE 955 comparison is even more insane. IVB-E is basically an order of magnitude faster than the last high-end Pentium 4s to come out of Intel back in 2005.

3D Rendering

Our new POV-Ray benchmark uses the latest beta binary (3.7RC6) and runs through both single and multithreaded versions of the popular raytracing benchmark.

Windows 8 - POV-Ray 3.7RC6 - Single Threaded

Isolating a single core shows us exactly what we're missing by having Ivy Bridge at the heart of the 4960X instead of Haswell. Here the 4770K manages a 16% performance advantage over the 4960X, which costs 3x as much and draws substantially more power. Looking at AMD's FX-8350 however it's clear why Intel can get away with launching a high-end 6-core chip without its latest cores. Piledriver's single threaded performance falls somewhere between Nehalem and Sandy Bridge, giving Intel room to launch another Ivy Bridge based high-end SKU in 2013 and get away with it.

Windows 8 - POV-Ray 3.7RC6 - Multi Threaded

The multithreaded performance story is very different. Here even the Gulftown based 990X is faster than Haswell thanks to its six cores. The 4960X is 40% faster than the Haswell based 4770K. Even AMD's FX-8350 does really well here, basically equalling Haswell's performance.

Created by the Cinema 4D folks we have Cinebench, a popular 3D rendering benchmark that gives us both single and multi-threaded 3D rendering results.

Cinebench 11.5 - Single Threaded

In Cinebench, the single threaded Haswell/IVB-E gap narrows to 5%.

Cinebench 11.5 - Multi-Threaded

Multithreaded performance continues to be just stellar. Here the 4960X is just under 50% faster than the 4770K. Note the relatively small gap between the 4960X and the SNB-E based 3970X however - the performance gain is only 5%. The bulk of Ivy Bridge's advancements were in GPU performance (not applicable to IVB-E) and power consumption (which we'll get to shortly).

Our final two Cinebench tests use the R10 benchmark to enable a comparison to more/older data points:

Cinebench R10 - Single Threaded Benchmark

Cinebench R10 - Multi-Threaded Benchmark

Memory & General Purpose Performance Visual Studio, Photoshop, File Compression & Excel Math Perf
Comments Locked

120 Comments

View All Comments

  • Rick83 - Tuesday, September 3, 2013 - link

    If you really want that number of cores, Ivy Bridge E5/E7 Xeons are going to deliver that, in the 150W power envelope. This is useful in the server market, but will only sell in homeopathic quantities in the desktop market. Still, you should be able to find them in retail around Christmas. Knock yourself out!

    Really IB-E is a free product for Intel, which is the only reason it made it to market at all. They need the 6-core dies for the medium density servers anyway, which is where they actually make sense over SB-Xeons, due to the smaller power envelope/higher efficiency. The investment to turn that core into a consumer product on an existing platform is almost zero, short of a small marketing budget, and possibly a tiny bit of (re-)validation.

    This was never a product designed for the enthusiast market, and is being shoe-horned into that position. Due to the smaller die Intel can probably make better margin over SB-E, which is the only reason to introduce this product in the sector anyway, and possibly to get some brand awareness going with the launch of a new flagship.

    From an economical point of view it makes no sense for Intel to have an actual enthusiast platform. Haswell refresh will be unlikely to bring more cores either (and without the extra I/O they would be a bit hobbled, I imagine), so possibly with Skylake there will be a 6-core upper mainstream solution. Still unlikely from an economical point of view, as Intel would probably prefer sticking to two dies, and going 6/4 may not be economical, whereas selling 6-core CPUs as quads (as they do with 48xx) doesn't work that well in the part of the market that generates reasonable volume.
  • f0d - Tuesday, September 3, 2013 - link

    the problem with xeon is that you cant overclock them so my 5ghz SBE would be close to as good as a 8/10 core xeon

    i dont really care about why intel are not releasing high core count cpu's i just know i want them at a decent price ($1k and under) and overclockable - these 6 core ones just dont make the cut anymore

    i just hate the direction cpu's are going with low power low core count highly integrated everythiing, 5 years ago i was dreaming of 8 core cpu's being standard about now but we still have 4 (6 with sbe) core cpu's as standard which blows and per core performance hasnt really changed much going from sandy bridge to haswell

    i dont care about power and heat just give me the performance i want to encode highest quality handbrake movies in less than 24 hours.!!
  • ShieTar - Tuesday, September 3, 2013 - link

    "All" you want is Intel to invest a massive development effort in order to produce for the first time an overclocking CPU with a TDP of around 200W, with silicone for which their business customers would pay 2k$ to 3k$, and sell it to you and the other 500 people in your niche for less than 1k$?

    Intel already offer you a solution if you need more processing power than the enthusiast solution gives you: 2 socket workstation boards, 4 socket server boards, 60-core co-processor cards.
  • f0d - Tuesday, September 3, 2013 - link

    2 socket is inefficient for my workloads
    they could just release a xeon that is unlocked and let me do what i want with it - its not like the workstation/server guys would overclock so its not like intel would be losing any money
    no development needed
    2-3k? i can already buy 8 core SBE for 1k - why not let me oc that?
  • wallysb01 - Tuesday, September 3, 2013 - link

    Many would overclock when Intel is charging hundreds of dollars for just small GHz bumps. You won't seem the academic or large corporation clusters doing it, but the small businesses with just a handful of workstations? They might.

    Look at the 2660 v2 2.2GHz at $1590 and the 2680 v2 2.8GHz at $1943. That's $353 for 600MHz. On a dual processor system its $700, then you have to pay the markups from those actually selling the computers (ie Dell/HP), which takes $700 to $1000 or more. One small little tweak and you're saving yourself $1000, while not stressing the system all that much (assuming you don't go crazy and try to get 3.5GHz from that 2.2GHz base chip).
  • mapesdhs - Wednesday, September 4, 2013 - link


    The catch though is that the mbds used for these systems don't have
    BIOS setups which support oc'ing, and the people who use them aren't
    generally experienced in such things. I know someone at a larger movie
    company who said it'd be neat to be able to experiment with this,
    especially an unlocked XEON, but in reality the pressures of time, the
    scale of the budgets involved, the large number of systems used for
    renderfarms, the OS management required, etc., all these issues mean
    it's easier to just buy off the shelf units and scale as required (the
    renderfarm at the company I'm thinking of has more than 7000 cores
    total, mostly based on Dell blade servers) and management isn't that
    interested in doing anything different or innovative/risky. It's easy
    to think a smaller company might be more likely to try such a thing,
    but in reality for a smaller company it would be a much larger
    financial risk to do so. Bigger companies could afford to try it, but
    aren't geared up for such ideas.

    Btw, oc'ing a XEON is viable with single-socket mbds that happen to
    support them and have chipsets which don't rely on the CPU multiplier
    for oc'ing, eg. an X5570 on an Asrock X58 Extreme6 works ok (I have
    one); the chip advantage is a higher TDP and 50% faster QPI compared to
    a clock-comparable i7 950.

    Sadly, other companies often don't bother supporting XEONs anyway;
    Gigabyte does on some of its boards (X58A-UD3R is a good example) but
    ASUS tends not to.

    Some have posted about core efficiency and they're correct; I have a
    Dell T7500 with two X5570s, but my oc'd 3930K beats it for highly
    threaded tasks such as CB 11.5, and it's about 2X faster for single-
    threaded ops. The 3930K's faster RAM probably helps aswell (64GB
    DDR3/2400, vs. only DDR3/1333 in the Dell which one can't change).

    Someone commented about Intel releasing an unlocked XEON. Of course
    they could, but they won't because they don't need to, and biz users
    wouldn't really care, it's not what they want, and note that power
    efficiency is very important for big server setups, something of which
    oc'ing can of course make utterly ruin. :D Someone said who cares about
    power guzzling when it comes to enthusiast builds, and that's true, but
    when it comes to XEONs the main target market does care, so again Intel
    has no incentive to bother releasing an unlocked XEON.

    I agree with the poster who said 40 PCIe lanes isn't ridiculous. We had
    such provision with X58, so if anything for a top-end platform only 40
    lanes isn't that impressive IMO. Far worse is the continued limit of
    just 2 SATA3 ports; that really is a pain, because the 3rd party
    controllers are generally awful. The Asrock X79 Extreme11 solved this
    to some extent by offering onboard SAS, but they kinda crippled it by
    not having any cache RAM as part of the built-in SAS chip.

    Ian.
  • wallysb01 - Wednesday, September 4, 2013 - link

    "It's easy to think a smaller company might be more likely to try such a thing, but in reality for a smaller company it would be a much larger financial risk to do so. Bigger companies could afford to try it, but aren't geared up for such ideas.

    Btw, oc'ing a XEON is viable with single-socket mbds that happen to support them and have chipsets which don't rely on the CPU multiplier for oc'ing, eg. an X5570 on an Asrock X58 xtreme6 works ok (I have one); the chip advantage is a higher TDP and 50% faster QPI compared to a clock-comparable i7 950."

    These two statements work against eachother. If OC a SP xeon is relatively easy, only if supported, there isn't much reason a DP xeon set up couldn't be OCed within reason without much effort.

    I'm not going to say this would be a common thing, but the small shops run by someone with a "tinkerer" mind set towards computing would certainly be interested in attempting to get that extra 10-20% performance, which Intel would change another $1000 or more for, but get it for free.
  • psyq321 - Thursday, September 5, 2013 - link

    Z9PE-D8 WS has decent overclocking options (not like their consumer X79 boards, but not bad either).

    However, apart from a small BCLK bump, this is useless as SNB-EP and IVB-EP Xeons are locked.

    The best I can do with dual Xeon 2697 v2 is ~3150 MHz (I might be able to go a bit further but I did not bother) for all-core turbo.

    Even if Intel ignores the business reasons NOT to allow Xeon overclocking (to force high-performance-trading people to buy more expensive Xeons as they showed willingness to overclock and, so, potentially cannibalize market for more expensive EX parts) technically this would be a huge challenge.

    Why? Well, 12-core Xeon 2697 power-usage would literally explode if you allow running this on 4+ GHz and with voltages normally seen in overclocking world. I am sure the power draw of the single part would be more than 300W, so 600W for a dual-socket board.

    This is not unheard of (after all, high-end GPUs can draw comparable power) - however, this would mandate significantly higher specs for the motherboard components and put people in actual danger of fires by using inadequate components.

    Maybe when Intel moves to Haswell E/EP - when the voltage regulation becomes CPUs's business, maybe they can find a way to allow overclocking of such huge CPUs after passing lots of checks. Otherwise, Intel runs huge risk of being sued for causing fires.
  • mapesdhs - Sunday, August 13, 2017 - link

    Four years later, who could have imagined we'd end up with Threadripper, and the mess Intel is now in? Funny old world. :D
  • stephenbrooks - Monday, September 23, 2013 - link

    --[i just hate the direction cpu's are going with low power low core count highly integrated everythiing, 5 years ago i was dreaming of 8 core cpu's being standard about now but we still have 4]--

    So I got an AMD FX8350, that's 8 cores and 4GHz before turbo. Quite a bit cheaper than Intel's too.

    OK, obviously AMD gets less operations per clock and the 8 cores only have 4 "real" FPUs between them but I wanted 8 cores to test scaling of computer programs on without breaking the bank.

Log in

Don't have an account? Sign up now