** = Old results marked were performed with the original BIOS & boost behaviour as published on 7/7.

Benchmarking Performance: Web Tests

While more the focus of low-end and small form factor systems, web-based benchmarks are notoriously difficult to standardize. Modern web browsers are frequently updated, with no recourse to disable those updates, and as such there is difficulty in keeping a common platform. The fast paced nature of browser development means that version numbers (and performance) can change from week to week. Despite this, web tests are often a good measure of user experience: a lot of what most office work is today revolves around web applications, particularly email and office apps, but also interfaces and development environments. Our web tests include some of the industry standard tests, as well as a few popular but older tests.

We have also included our legacy benchmarks in this section, representing a stack of older code for popular benchmarks.

All of our benchmark results can also be found in our benchmark engine, Bench.

WebXPRT 3: Modern Real-World Web Tasks, including AI

The company behind the XPRT test suites, Principled Technologies, has recently released the latest web-test, and rather than attach a year to the name have just called it ‘3’. This latest test (as we started the suite) has built upon and developed the ethos of previous tests: user interaction, office compute, graph generation, list sorting, HTML5, image manipulation, and even goes as far as some AI testing.

For our benchmark, we run the standard test which goes through the benchmark list seven times and provides a final result. We run this standard test four times, and take an average.

Users can access the WebXPRT test at http://principledtechnologies.com/benchmarkxprt/webxprt/

WebXPRT 3 (2018)

WebXPRT 2015: HTML5 and Javascript Web UX Testing

The older version of WebXPRT is the 2015 edition, which focuses on a slightly different set of web technologies and frameworks that are in use today. This is still a relevant test, especially for users interacting with not-the-latest web applications in the market, of which there are a lot. Web framework development is often very quick but with high turnover, meaning that frameworks are quickly developed, built-upon, used, and then developers move on to the next, and adjusting an application to a new framework is a difficult arduous task, especially with rapid development cycles. This leaves a lot of applications as ‘fixed-in-time’, and relevant to user experience for many years.

Similar to WebXPRT3, the main benchmark is a sectional run repeated seven times, with a final score. We repeat the whole thing four times, and average those final scores.

WebXPRT15

Speedometer 2: JavaScript Frameworks

Our newest web test is Speedometer 2, which is a accrued test over a series of javascript frameworks to do three simple things: built a list, enable each item in the list, and remove the list. All the frameworks implement the same visual cues, but obviously apply them from different coding angles.

Our test goes through the list of frameworks, and produces a final score indicative of ‘rpm’, one of the benchmarks internal metrics. We report this final score.

Speedometer 2

Google Octane 2.0: Core Web Compute

A popular web test for several years, but now no longer being updated, is Octane, developed by Google. Version 2.0 of the test performs the best part of two-dozen compute related tasks, such as regular expressions, cryptography, ray tracing, emulation, and Navier-Stokes physics calculations.

The test gives each sub-test a score and produces a geometric mean of the set as a final result. We run the full benchmark four times, and average the final results.

Google Octane 2.0

Mozilla Kraken 1.1: Core Web Compute

Even older than Octane is Kraken, this time developed by Mozilla. This is an older test that does similar computational mechanics, such as audio processing or image filtering. Kraken seems to produce a highly variable result depending on the browser version, as it is a test that is keenly optimized for.

The main benchmark runs through each of the sub-tests ten times and produces an average time to completion for each loop, given in milliseconds. We run the full benchmark four times and take an average of the time taken.

Mozilla Kraken 1.1

Web Tests Analysis

Overall, in the web tests, the new Ryzen 3900X and 3700X perform very well with both chips showcasing quite large improvements over the 2700X.

We’re seeing quite an interesting match-up against Intel’s 9700K here, which is leading all of the benchmarks. The reason for this is that SKU has SMT turned off. The singe-threaded performance advantage of this is that the CPU core no longer has to share the µOP cache structure between to different threads, and has the whole capacity dedicated to one thread. Web workloads in particular are amongst the most instruction pressure heavy workloads out there, and they benefit extremely from turning SMT off on modern cores.

Whilst we didn’t have the time yet to test the new 3900X and 3700X with SMT off, AMD’s core and op cache works the same in that it’s sharing the capacity amongst two threads, statically partitioning it. I’m pretty sure we’d see larger increases in the web benchmarks when turning off SMT as well, and we’ll be sure to revisit this particular point in the future.

SPEC2006 & 2017: Industry Standard - ST Performance & IPC Benchmarking Performance: CPU System Tests
Comments Locked

447 Comments

View All Comments

  • Korguz - Monday, July 8, 2019 - link

    huh ???
  • mkozakewich - Saturday, July 13, 2019 - link

    "...as unfortunately the timing didn’t work out."

    You should increase his voltage a little and reboot, that might help.
  • Meteor2 - Monday, July 15, 2019 - link

    It's hard to get one's head around this, but basically: *all* the Intel benchmarks *do not* include the security patches for the MDS-class flaws. The 9000 and 8000 series tests do include the OS-side Spectre fixes, but that's it. No OS-fixes for other CPUs, and no motherboard firmware fixes for any Intel CPUs

    At the very least, all the Intel CPUs should be retested on Windows 10 1903 which has the OS-side MDS fixes.

    Most if not all the motherboards used for the Intel reviews can also have their firmware upgraded to fix Spectre and most times MDS flaws. Do it.

    This is sensible and reasonable to do: no sensible and reasonable user would leave their OS vulnerable. Maybe the motherboard, because it's a bit scary to do, but as that can be patched, it should be by reviewers.

    This would result in all the Intel scores being lower. We don't know by how much without this process actually being done. But until it is, the Intel results, and thus the review itself, are invalid.

    While you're at it Anandtech, each year buy the latest $999 GPU for CPU testing. Consider it a cost of doing business. Letting the GPU bottleneck the CPU on most game resolutions benchmarked is pointless.
  • plonk420 - Sunday, July 7, 2019 - link

    mountain time zone, best time zone... 7am 7/7!
  • exactopposite - Sunday, July 7, 2019 - link

    Been waiting on this one for a long time
  • Eris_Floralia - Sunday, July 7, 2019 - link

    It's really nice to see Andrei starting to take part in desktop processor reviews and Gavin Bonshor's hardwork!
  • mjz_5 - Sunday, July 7, 2019 - link

    Intel gets 5% better FPS In games is really not a win. I’ll consider that a tie. In multiple applications AMD gets 20% more performance. That’s a win!!
  • Dragonstongue - Sunday, July 7, 2019 - link

    hopefully Anal lists :P see how much a "win" the Ryzen 3k / x5xx / Navi truly are, not only to get AMD margins even higher but to take more market share from Intel and "stagnate" Nvidia's needing to "up the price" to make more $$$$$$ when AMD "seems" to be making "as much if not more" selling a small amount less per unit (keep in mind, AMD is next Playstation and Xbox which are 99/9% likely to be using the same silicon, so, AMD take a "small hit" to get as many Ryzen gen 3 and Navi "in the world" which drums up market/mindshare which is extremely important for any business, at this stage in the game is VITAL for AMD.
  • sor - Sunday, July 7, 2019 - link

    I’m honestly wondering what the point is of the gaming benchmarks for CPU tests anymore.

    It seems like the game is either so easy to render that we are comparing values in the hundreds of FPS, or they’re so hard to render that it’s completely GPU dependent and the graph is flat for all CPUs.

    In the vast majority of tests here one would have an identical experience with any of the top four CPUs tested.
  • Targon - Monday, July 8, 2019 - link

    Game engines are starting to use more cores, and at lower resolutions(which do not stress the video card all that much) will show improvements/benefits of using one CPU or even platform over another. In this review, due to the RAM being used, the gaming benchmarks are almost invalid(DDR4-3200 CL16), since moving to CL14 RAM would potentially eliminate any advantage Intel has in these benchmarks.

Log in

Don't have an account? Sign up now