CPU Performance, Short Form

For our motherboard reviews, we use our short form testing method. These tests usually focus on if a motherboard is using MultiCore Turbo (the feature used to have maximum turbo on at all times, giving a frequency advantage), or if there are slight gains to be had from tweaking the firmware. We put the memory settings at the CPU manufacturers suggested frequency, making it very easy to see which motherboards have MCT enabled by default.

Handbrake 1.1.0: Streaming and Archival Video Transcoding

A popular open source tool, Handbrake is the anything-to-anything video conversion software that a number of people use as a reference point. The danger is always on version numbers and optimization, for example the latest versions of the software can take advantage of AVX-512 and OpenCL to accelerate certain types of transcoding and algorithms. The version we use here is a pure CPU play, with common transcoding variations.

We have split Handbrake up into several tests, using a Logitech C920 1080p60 native webcam recording (essentially a streamer recording), and convert them into two types of streaming formats and one for archival. The output settings used are:

  • 720p60 at 6000 kbps constant bit rate, fast setting, high profile
  • 1080p60 at 3500 kbps constant bit rate, faster setting, main profile
  • 1080p60 HEVC at 3500 kbps variable bit rate, fast setting, main profile

Handbrake 1.1.0 - 720p60 x264 6000 kbps FastHandbrake 1.1.0 - 1080p60 x264 3500 kbps FasterHandbrake 1.1.0 - 1080p60 HEVC 3500 kbps Fast

In Handbrake both the G.Skill TridentZ RGB DC and ZADAK Shield RGB DC memory show a noticable benefit.  The results are beyond the margin of error associated as well as both kits performing very similar across all three formats.

Blender 2.79b: 3D Creation Suite

A high profile rendering tool, Blender is open-source allowing for massive amounts of configurability, and is used by a number of high-profile animation studios worldwide. The organization recently released a Blender benchmark package, a couple of weeks after we had narrowed our Blender test for our new suite, however their test can take over an hour. For our results, we run one of the sub-tests in that suite through the command line - a standard ‘bmw27’ scene in CPU only mode, and measure the time to complete the render.

Blender can be downloaded at https://www.blender.org/download/

Rendering: Blender 2.78

The extra capacity and design proves ineffective within Blender and actually performs similar to other normal capacity kits at similar frequencies.

Rendering - Cinebench R15: link

Cinebench is a benchmark based around Cinema 4D, and is fairly well known among enthusiasts for stressing the CPU for a provided workload. Results are given as a score, where higher is better. The benchmark was created by MAXON and integrates workloads suitable for applications such as graphic design, VFX, game development and render engines. The testing is split into single thread and multi-threaded performance and is primarily a CPU and graphics benchmark.

Cinebench R15 Single ThreadedCinebench R15 Multi-Threaded

Performance in Cinebench R15 proved similar to other kits on test with the G.Skill TridentZ RGB DC DDR4-3200 proving the best of the pack in the multi-threaded test due to sub-timings.

POV-Ray 3.7.1: Ray Tracing

The Persistence of Vision ray tracing engine is another well-known benchmarking tool, which was in a state of relative hibernation until AMD released its Zen processors, to which suddenly both Intel and AMD were submitting code to the main branch of the open source project. For our test, we use the built-in benchmark for all-cores, called from the command line.

POV-Ray can be downloaded from http://www.povray.org/

Rendering: POV-Ray 3.7

POV-Ray performance is essentially the same in this test.

WinRAR 5.60b3: Archiving Tool

My compression tool of choice is often WinRAR, having been one of the first tools a number of my generation used over two decades ago. The interface has not changed much, although the integration with Windows right click commands is always a plus. It has no in-built test, so we run a compression over a set directory containing over thirty 60-second video files and 2000 small web-based files at a normal compression rate.

WinRAR is variable threaded but also susceptible to caching, so in our test we run it 10 times and take the average of the last five, leaving the test purely for raw CPU compute performance.

Encoding: WinRAR 5.40

WinRAR is one of the more memory sensitive benchmarks and encoding can gain from having higher performance memory installed. Both kits of DC memory show benefit here and outperform the standard kits consistently.

7-zip v1805: Popular Open-Source Encoding Engine

Out of our compression/decompression tool tests, 7-zip is the most requested and comes with a built-in benchmark. For our test suite, we’ve pulled the latest version of the software and we run the benchmark from the command line, reporting the compression, decompression, and a combined score.

It is noted in this benchmark that the latest multi-die processors have very bi-modal performance between compression and decompression, performing well in one and badly in the other. There are also discussions around how the Windows Scheduler is implementing every thread. As we get more results, it will be interesting to see how this plays out.

Encoding: 7-Zip

The DC memory also displays consistently better performance over standard capacity memory in 7-Zip.

3D Particle Movement v2.1: Brownian Motion

Our 3DPM test is a custom built benchmark designed to simulate six different particle movement algorithms of points in a 3D space. The algorithms were developed as part of my PhD., and while ultimately perform best on a GPU, provide a good idea on how instruction streams are interpreted by different microarchitectures.

A key part of the algorithms is the random number generation – we use relatively fast generation which ends up implementing dependency chains in the code. The upgrade over the naïve first version of this code solved for false sharing in the caches, a major bottleneck. We are also looking at AVX2 and AVX512 versions of this benchmark for future reviews.

For this test, we run a stock particle set over the six algorithms for 20 seconds apiece, with 10 second pauses, and report the total rate of particle movement, in millions of operations (movements) per second. We use a non-AVX version here.

3DPM v2.1 can be downloaded from our server: 3DPMv2.1.rar (13.0 MB)

System: 3D Particle Movement v2.1

Results in our 3DPM benchmark are all within a percent from top to bottom and there isn't much benefit.

DigiCortex 1.20: Sea Slug Brain Simulation

This benchmark was originally designed for simulation and visualization of neuron and synapse activity, as is commonly found in the brain. The software comes with a variety of benchmark modes, and we take the small benchmark which runs a 32k neuron / 1.8B synapse simulation, equivalent to a Sea Slug.


Example of a 2.1B neuron simulation

We report the results as the ability to simulate the data as a fraction of real-time, so anything above a ‘one’ is suitable for real-time work. Out of the two modes, a ‘non-firing’ mode which is DRAM heavy and a ‘firing’ mode which has CPU work, we choose the latter. Despite this, the benchmark is still affected by DRAM speed a fair amount.

DigiCortex can be downloaded from http://www.digicortex.net/

System: DigiCortex 1.20 (32k Neuron, 1.8B Synapse)

Similarly, there is no real benefit to the new memory. The G.Skill is higher than the ZADAK again however.

ZADAK Shield RGB DC Overview Gaming Performance
Comments Locked

50 Comments

View All Comments

  • Bp_968 - Thursday, January 24, 2019 - link

    Or you could go with an AMD board and get to keep the ECC features enabled. :)
  • KarlKastor - Friday, January 25, 2019 - link

    How do u disable registered capability on a reg. DIMM? They are physically different to UDIMMs. Thats absolutely BS.
    You can run ECC UDIMM on every board, maybe without ECC working. But you will never run reg. DIMM on Hardware not supporting reg. DIMMs
  • nevcairiel - Wednesday, January 23, 2019 - link

    You cannot use ECC registered RAM on consumer motherboards.
  • CheapSushi - Thursday, January 24, 2019 - link

    Yes you can. It depends on the CPU/BIOS.
  • jordanclock - Wednesday, January 23, 2019 - link

    The kind of systems that use ECC RAM aren't the kind of systems that are going to see any overclocking.
  • alpha754293 - Thursday, January 24, 2019 - link

    To the best of my knowledge, you can put ECC RDIMMs in consumer-grade systems where you have overclocking capabilities made apparent and available to you just as people have overclocked Xeons before.

    Just because you have RDIMMs doesn't necessarily mean that you have to only use them with a motherboard that supports and requires the use of RDIMMs. (In fact, many motherboards that recommend using RDIMMs will also run with UDIMMs, but usually with a lower installed capacity limit.)
  • Lord of the Bored - Wednesday, January 23, 2019 - link

    Wouldn't two DIMMs on one card be a quad in-line memory module(QIMM)?
  • Billy Tallis - Thursday, January 24, 2019 - link

    No, because this approach doesn't make the memory bus any wider, it just stacks the memory deeper on the same bus than a normal DIMM would.
  • Lord of the Bored - Thursday, January 24, 2019 - link

    But both 8- and 32-bit SIMMs existed under the same name, so it clearly isn't bus width the name refers to.

    For what it is worth, I was wrong for a different reason. Research indicates that the D refers to rows of contacts. In a SIMM, the backside contacts were electrically the same as the frontside contacts, so there was only a single row. In a DIMM, they aren't, so you have dual contact rows. And since this still only has two rows of contacts...
  • Targon - Thursday, January 24, 2019 - link

    The connection to the motherboard and support in the BIOS and memory controller are the key. More memory on the same memory channel will not boost performance, and will even degrade performance a bit due to needing to connect the additional memory on the same channel.

    DDR5(not to be confused with GDDR5) will help. Going to Gen-Z where the memory can connect to the Gen-Z bus might potentially help as well if done right, because you want that memory bus to be able to access more RAM at the same time, so latency will hopefully go down with the next generation of desktop memory.

Log in

Don't have an account? Sign up now