CPU Office Tests

The office programs we use for benchmarking aren't specific programs per-se, but industry standard tests that hold weight with professionals. The goal of these tests is to use an array of software and techniques that a typical office user might encounter, such as video conferencing, document editing, architectural modeling, and so on and so forth.

All of our benchmark results can also be found in our benchmark engine, Bench.

Chromium Compile (v56)

Our new compilation test uses Windows 10 Pro, VS Community 2015.3 with the Win10 SDK to compile a nightly build of Chromium. We've fixed the test for a build in late March 2017, and we run a fresh full compile in our test. Compilation is the typical example given of a variable threaded workload - some of the compile and linking is linear, whereas other parts are multithreaded.

Office: Chromium Compile (v56)

One of the interesting data points in our test is the Compile, and it is surprising to see the 1920X only just beat the Ryzen 7 chips. Because this test requires a lot of cross-core communication, the fewer cores per CCX there are, the worse the result. This is why the 1950X in SMT-off mode beats the 3 cores-per-CCX 1920X, along with lower latency memory support. We know that this test is not too keen on victim caches either, but it does seem that the 2MB per core ratio does well for the 1950X, and could explain the performance difference moving from 8 to 12 to 16 cores under the Zen microarchitecture.

PCMark8: link

Despite originally coming out in 2008/2009, Futuremark has maintained PCMark8 to remain relevant in 2017. On the scale of complicated tasks, PCMark focuses more on the low-to-mid range of professional workloads, making it a good indicator for what people consider 'office' work. We run the benchmark from the commandline in 'conventional' mode, meaning C++ over OpenCL, to remove the graphics card from the equation and focus purely on the CPU. PCMark8 offers Home, Work and Creative workloads, with some software tests shared and others unique to each benchmark set.

Office: PCMark8 Home (non-OpenCL)

Office: PCMark8 Work (non-OpenCL)

Strangely, PCMark 8's Creative test seems to be failing across the board. We're trying to narrow down the issue.

SYSmark 2014 SE: link

SYSmark is developed by Bapco, a consortium of industry CPU companies. The goal of SYSmark is to take stripped down versions of popular software, such as Photoshop and Onenote, and measure how long it takes to process certain tasks within that software. The end result is a score for each of the three segments (Office, Media, Data) as well as an overall score. Here a reference system (Core i3-6100, 4GB DDR3, 256GB SSD, Integrated HD 530 graphics) is used to provide a baseline score of 1000 in each test.

A note on context for these numbers. AMD left Bapco in the last two years, due to differences of opinion on how the benchmarking suites were chosen and AMD believed the tests are angled towards Intel processors and had optimizations to show bigger differences than what AMD felt was present. The following benchmarks are provided as data, but the conflict of opinion between the two companies on the validity of the benchmark is provided as context for the following numbers.

Office: SYSMark 2014 SE (Overall)

Benchmarking Performance: CPU Encoding Tests Benchmarking Performance: CPU Legacy Tests
Comments Locked

347 Comments

View All Comments

  • launchcodemexico - Thursday, August 10, 2017 - link

    Why did you end all the gaming review sections with something like "Switching it to Game mode would have made better numbers..."? Why didn't you run the benchmarks in Gaming mode in the first place?
  • Ian Cutress - Thursday, August 10, 2017 - link

    Gaming mode is not default, and we run gaming mode alongside the default - there's two sets of values in each graming test.
  • DanNeely - Thursday, August 10, 2017 - link

    You might want to call that out more clearly in the text. I also missed that you have two sets of 1950X results; and probably wouldn't've figured out what the -G suffix meant without a hint.
  • Ian Cutress - Thursday, August 10, 2017 - link

    I mentioned it in the Game vs Creator mode page, but I'll propagate it through.
  • lordken - Thursday, August 10, 2017 - link

    read before you complain, it is stated at beginning of the review that -G is for game mode...
  • DanNeely - Thursday, August 10, 2017 - link

    Especially during the work day a lot of people just are doing quick glances at the most interesting parts. I'll end to end read it sometime tonight.
  • mapesdhs - Thursday, August 10, 2017 - link

    If people quick-glance, that's their problem for missing key info. :D When learning about something as new as this, I read everything. Otherwise, it's like the tech equivalent of crossing a road while gawping at a phone. :}

    Last time I read so much about a new CPU launch was Nehalem/X58.

    Ian.
  • smilingcrow - Thursday, August 10, 2017 - link

    It seemed really clear to me but for people who didn't read the long text on NUMA etc maybe not.
    The dangers of skimming!
  • mapesdhs - Friday, August 11, 2017 - link

    Indeed. :D Reminds me of when a long time ebay seller told me that long item decriptions are pointless, because most bidders only read the first paragraph, often only the first sentence.
  • Ian Cutress - Thursday, August 10, 2017 - link

    The test suite is a global glove: rather than have 20 tests for each segment, it's a global band of 80 tests for every situation. Johan does different tests as his office is several hundred miles away from where I am (and we're thousands of miles away from any other reviewer).

    For the gaming benchmarks, there are big differences in 99th percentile frame rates and Time Under analysis. As games become more and more GPU bottlenecked for average frame rates, this is where the differentiation point is. It's a reason why we still test 1080p as well. With regards the AI test, I've asked the Civ team repeatedly to make the AI test accessible from the command line so I can rope it into my testing scripts easily (they already do it with the main GPU test). But like many other game studios, getting them to unlock a flag is a frustrating endeavor when they don't even respond to messages.

Log in

Don't have an account? Sign up now