CPU Performance: Office and Web

Our previous set of ‘office’ benchmarks have often been a mix of science and synthetics, so this time we wanted to keep our office section purely on real world performance.

Agisoft Photoscan 1.3.3: link

Photoscan stays in our benchmark suite from the previous benchmark scripts, but is updated to the 1.3.3 Pro version. As this benchmark has evolved, features such as Speed Shift or XFR on the latest processors come into play as it has many segments in a variable threaded workload.

The concept of Photoscan is about translating many 2D images into a 3D model - so the more detailed the images, and the more you have, the better the final 3D model in both spatial accuracy and texturing accuracy. The algorithm has four stages, with some parts of the stages being single-threaded and others multi-threaded, along with some cache/memory dependency in there as well. For some of the more variable threaded workload, features such as Speed Shift and XFR will be able to take advantage of CPU stalls or downtime, giving sizeable speedups on newer microarchitectures.

For the update to version 1.3.3, the Agisoft software now supports command line operation. Agisoft provided us with a set of new images for this version of the test, and a python script to run it. We’ve modified the script slightly by changing some quality settings for the sake of the benchmark suite length, as well as adjusting how the final timing data is recorded. The python script dumps the results file in the format of our choosing. For our test we obtain the time for each stage of the benchmark, as well as the overall time.

The final result is a table that looks like this:

(1-1) Agisoft Photoscan 1.3, Complex Test

As explained in the power tests, the 4800U with double the cores wins out here, and due to the vector pressure also wins on power efficiency. There’s still a sizeable uplift from Ice Lake to Tiger Lake at 15 W, although 28 W is needed to get something sizeable.

Mozilla Kraken 1.1

Kraken is a 2010 benchmark from Mozilla and does a series of JavaScript tests. These tests are a little more involved than previous tests, looking at artificial intelligence, audio manipulation, image manipulation, json parsing, and cryptographic functions. The benchmark starts with an initial download of data for the audio and imaging, and then runs through 10 times giving a timed result.

Automation involves loading the direct webpage where the test is run and putting it through. All CPUs finish the test in under a couple of minutes, so we put that as the end point and copy the page contents into the clipboard before parsing the result. Each run of the test on most CPUs takes from half-a-second to a few seconds.

(7-1) Kraken 1.1 Web Test

Both the Tiger Lake results are very fast, not showing much difference between the power modes. Intel pushes ahead of AMD here, and ultimately a sizable jump over Ice Lake.

Google Octane 2.0

Our second test is also JavaScript based, but uses a lot more variation of newer JS techniques, such as object-oriented programming, kernel simulation, object creation/destruction, garbage collection, array manipulations, compiler latency and code execution.

Octane was developed after the discontinuation of other tests, with the goal of being more web-like than previous tests. It has been a popular benchmark, making it an obvious target for optimizations in the JavaScript engines. Ultimately it was retired in early 2017 due to this, although it is still widely used as a tool to determine general CPU performance in a number of web tasks.

Octane’s automation is a little different than the others: there is no direct website to go to in order to run the benchmark. The benchmark page is opened, but the user has to navigate to the ‘start’ button or open the console and initiate the JavaScript required to run the test. The test also does not show an obvious end-point, but luckily does try and aim for a fixed time for each processor. This is similar to some of our other tests, that loop around a fixed time before ending. Unfortunately this doesn’t work if the first loop goes beyond that fixed time, as the loop still has to finish. For Octane, we have set it to 75 seconds per run, and we loop the whole test four times.

(7-2) Google Octane 2.0 Web Test

The Tiger Lake system reaches new records in Optane. If there’s anything this system is fast at, it is web workloads.

Speedometer 2: JavaScript Frameworks

Our newest web test is Speedometer 2, which is a test over a series of JavaScript frameworks to do three simple things: built a list, enable each item in the list, and remove the list. All the frameworks implement the same visual cues, but obviously apply them from different coding angles.

Our test goes through the list of frameworks, and produces a final score indicative of ‘rpm’, one of the benchmarks internal metrics. Rather than use the main interface, we go to the admin interface through the about page and manage the results there. It involves saving the webpage when the test is complete and parsing the final result.

We repeat over the benchmark for a dozen loops, taking the average of the last five.

(7-3) Speedometer 2.0 Web Test

Again, another good win for Tiger Lake.

CPU MT Performance: SPEC 2006, SPEC 2017 CPU Performance: Simulation and Science
Comments Locked

253 Comments

View All Comments

  • MadManMark - Thursday, September 17, 2020 - link

    I don't quite get the "sardine oil basting AMD" analogy?

    Is that some English thing, can you Britsplain the meaning for this ignorant Yank?
  • MamiyaOtaru - Thursday, September 17, 2020 - link

    ironically it's actually a super American pop culture reference: like some other things in the article it's a reference to something from Tiger King
  • huangcjz - Thursday, September 17, 2020 - link

    Ars Technica has disclosed that the system was made by MSI, and that it kinda resembles what will eventually become their Prestige 14 Evo system.
  • Oxford Guy - Thursday, September 17, 2020 - link

    "Kinda resembles" doesn't mean it might not have super special cooling to show the chip in an artificially good light.

    If that level of cooling isn't going to be in the market then the results are marketing distortion.
  • Oxford Guy - Thursday, September 17, 2020 - link

    Before anyone says it's can't be special ask yourself why there are special conditions, like photographing the inside, etc. etc. The review said the cooling is overbuilt and not something for the marketplace.

    Remember Intel's overbuilt fridge that it used to sucker people?
  • Spunjji - Friday, September 18, 2020 - link

    Better cooling won't change the fact that it has a 15W power limit imposed for those particular tests, which Ian confirmed through testing - it just means that temperature won't be the limit. I really, really don't think this is a distorting factor for this comparison.

    If the OEMs put it into laptops that can't actually cool 15W, that's kind of on them. I suspect it'll happen (thought not as much as it happens with AMD designs, natch).
  • Oxford Guy - Sunday, September 20, 2020 - link

    I really think you're likely wrong.

    Lower temperature means more work for those watts.
  • Oxford Guy - Sunday, September 20, 2020 - link

    We are also familiar with Intel's "TDP" versus how much the machines actually draw.

    I don't trust any numerical claims from Intel. Verify then trust.
  • Spunjji - Sunday, September 20, 2020 - link

    They verified the numbers...

    Eh. I give up.
  • asfletch - Friday, September 18, 2020 - link

    Yeah Dave Lee recognised it as Prestige 14 shell too...wonder why the cloak+dagger...

Log in

Don't have an account? Sign up now