CPU Performance, Short Form

For our motherboard reviews, we use our short form testing method. These tests usually focus on if a motherboard is using MultiCore Turbo (the feature used to have maximum turbo on at all times, giving a frequency advantage), or if there are slight gains to be had from tweaking the firmware. We put the memory settings at the CPU manufacturers suggested frequency, making it very easy to see which motherboards have MCT enabled by default.

Handbrake 1.1.0: Streaming and Archival Video Transcoding

A popular open source tool, Handbrake is the anything-to-anything video conversion software that a number of people use as a reference point. The danger is always on version numbers and optimization, for example the latest versions of the software can take advantage of AVX-512 and OpenCL to accelerate certain types of transcoding and algorithms. The version we use here is a pure CPU play, with common transcoding variations.

We have split Handbrake up into several tests, using a Logitech C920 1080p60 native webcam recording (essentially a streamer recording), and convert them into two types of streaming formats and one for archival. The output settings used are:

  • 720p60 at 6000 kbps constant bit rate, fast setting, high profile
  • 1080p60 at 3500 kbps constant bit rate, faster setting, main profile
  • 1080p60 HEVC at 3500 kbps variable bit rate, fast setting, main profile

Handbrake 1.1.0 - 720p60 x264 6000 kbps FastHandbrake 1.1.0 - 1080p60 x264 3500 kbps FasterHandbrake 1.1.0 - 1080p60 HEVC 3500 kbps Fast

In Handbrake both the G.Skill TridentZ RGB DC and ZADAK Shield RGB DC memory show a noticable benefit.  The results are beyond the margin of error associated as well as both kits performing very similar across all three formats.

Blender 2.79b: 3D Creation Suite

A high profile rendering tool, Blender is open-source allowing for massive amounts of configurability, and is used by a number of high-profile animation studios worldwide. The organization recently released a Blender benchmark package, a couple of weeks after we had narrowed our Blender test for our new suite, however their test can take over an hour. For our results, we run one of the sub-tests in that suite through the command line - a standard ‘bmw27’ scene in CPU only mode, and measure the time to complete the render.

Blender can be downloaded at https://www.blender.org/download/

Rendering: Blender 2.78

The extra capacity and design proves ineffective within Blender and actually performs similar to other normal capacity kits at similar frequencies.

Rendering - Cinebench R15: link

Cinebench is a benchmark based around Cinema 4D, and is fairly well known among enthusiasts for stressing the CPU for a provided workload. Results are given as a score, where higher is better. The benchmark was created by MAXON and integrates workloads suitable for applications such as graphic design, VFX, game development and render engines. The testing is split into single thread and multi-threaded performance and is primarily a CPU and graphics benchmark.

Cinebench R15 Single ThreadedCinebench R15 Multi-Threaded

Performance in Cinebench R15 proved similar to other kits on test with the G.Skill TridentZ RGB DC DDR4-3200 proving the best of the pack in the multi-threaded test due to sub-timings.

POV-Ray 3.7.1: Ray Tracing

The Persistence of Vision ray tracing engine is another well-known benchmarking tool, which was in a state of relative hibernation until AMD released its Zen processors, to which suddenly both Intel and AMD were submitting code to the main branch of the open source project. For our test, we use the built-in benchmark for all-cores, called from the command line.

POV-Ray can be downloaded from http://www.povray.org/

Rendering: POV-Ray 3.7

POV-Ray performance is essentially the same in this test.

WinRAR 5.60b3: Archiving Tool

My compression tool of choice is often WinRAR, having been one of the first tools a number of my generation used over two decades ago. The interface has not changed much, although the integration with Windows right click commands is always a plus. It has no in-built test, so we run a compression over a set directory containing over thirty 60-second video files and 2000 small web-based files at a normal compression rate.

WinRAR is variable threaded but also susceptible to caching, so in our test we run it 10 times and take the average of the last five, leaving the test purely for raw CPU compute performance.

Encoding: WinRAR 5.40

WinRAR is one of the more memory sensitive benchmarks and encoding can gain from having higher performance memory installed. Both kits of DC memory show benefit here and outperform the standard kits consistently.

7-zip v1805: Popular Open-Source Encoding Engine

Out of our compression/decompression tool tests, 7-zip is the most requested and comes with a built-in benchmark. For our test suite, we’ve pulled the latest version of the software and we run the benchmark from the command line, reporting the compression, decompression, and a combined score.

It is noted in this benchmark that the latest multi-die processors have very bi-modal performance between compression and decompression, performing well in one and badly in the other. There are also discussions around how the Windows Scheduler is implementing every thread. As we get more results, it will be interesting to see how this plays out.

Encoding: 7-Zip

The DC memory also displays consistently better performance over standard capacity memory in 7-Zip.

3D Particle Movement v2.1: Brownian Motion

Our 3DPM test is a custom built benchmark designed to simulate six different particle movement algorithms of points in a 3D space. The algorithms were developed as part of my PhD., and while ultimately perform best on a GPU, provide a good idea on how instruction streams are interpreted by different microarchitectures.

A key part of the algorithms is the random number generation – we use relatively fast generation which ends up implementing dependency chains in the code. The upgrade over the naïve first version of this code solved for false sharing in the caches, a major bottleneck. We are also looking at AVX2 and AVX512 versions of this benchmark for future reviews.

For this test, we run a stock particle set over the six algorithms for 20 seconds apiece, with 10 second pauses, and report the total rate of particle movement, in millions of operations (movements) per second. We use a non-AVX version here.

3DPM v2.1 can be downloaded from our server: 3DPMv2.1.rar (13.0 MB)

System: 3D Particle Movement v2.1

Results in our 3DPM benchmark are all within a percent from top to bottom and there isn't much benefit.

DigiCortex 1.20: Sea Slug Brain Simulation

This benchmark was originally designed for simulation and visualization of neuron and synapse activity, as is commonly found in the brain. The software comes with a variety of benchmark modes, and we take the small benchmark which runs a 32k neuron / 1.8B synapse simulation, equivalent to a Sea Slug.


Example of a 2.1B neuron simulation

We report the results as the ability to simulate the data as a fraction of real-time, so anything above a ‘one’ is suitable for real-time work. Out of the two modes, a ‘non-firing’ mode which is DRAM heavy and a ‘firing’ mode which has CPU work, we choose the latter. Despite this, the benchmark is still affected by DRAM speed a fair amount.

DigiCortex can be downloaded from http://www.digicortex.net/

System: DigiCortex 1.20 (32k Neuron, 1.8B Synapse)

Similarly, there is no real benefit to the new memory. The G.Skill is higher than the ZADAK again however.

ZADAK Shield RGB DC Overview Gaming Performance
Comments Locked

50 Comments

View All Comments

  • prateekprakash - Thursday, January 24, 2019 - link

    Could you please mention the names of the motherboards which did not post with these memories?
    Also could you please try these with Intel 6xxx/ 7xxx series CPUs with 2xx chipsets ( z270, b250).
  • mito0815 - Thursday, January 24, 2019 - link

    Any thoughts on how scalable this apporach is? I mean...the obvious issues (heatsink fan clearance being one of them) aside, 4-row-high-DIMMs would look absolutely hilarious. I'd buy them. Just for the joke.
  • KarlKastor - Thursday, January 24, 2019 - link

    I don't get why there is a need for double height.
    There are lots of DIMMs in the market, that have 18 ICs per side on a regular DIMM.

    I think it's just marketing, to show visually they have something new. The Cooler occupies the space anyway. But don't get, why every Tech-website mention it's neccessary.
  • Targon - Thursday, January 24, 2019 - link

    I suspect it is all about the memory density. So, rather than trying to get 7nm fab process RAM, these companies are using less expensive chips and just increasing the size of the board to compensate, plus the need to connect the RAM chips on the DIMM. What sort of timings are on these things, 2T, 3T, or 4T for the command rate? How about the latency ratings?
  • KarlKastor - Friday, January 25, 2019 - link

    Mh? I talk not about the number of DRAM Dies. I speak just about the size of the PCB. What has lithographie to do with PCB size?
    Here u have 16 packages per side. There are a lot of normal sized DIMMs outside with that amount of packages.
  • Danvelopment - Friday, January 25, 2019 - link

    What are the use cases? I would have thought that, by the time you need those sort of capacities, you would be better served by a quad channel Xeon.
  • NoSoMo - Friday, January 25, 2019 - link

    Interesting -- now if they could just pair them with some 3D nand and allow hybrid RAM / storage like intel wants to do with optane. Perhaps it'd come in a variant that sees 16GB PC 3000 and a slot similar to M.2 with capacities that mirror that of NVMEs thus moving storage over to the RAM bus and freeing up the PCI bus. The modules would be L shaped so that the storage addition completes the form factor thus allowing it to retain the same profile as these taller units, vs having a module hanging off the side.
  • 13Gigatons - Wednesday, January 30, 2019 - link

    Maybe they could focus on lowering the price????

    Other then that what is the case use?
  • DPete27 - Tuesday, February 12, 2019 - link

    You can fit 2 SODIMMs using a single locking mechanism on each end within the limits of a mITX board. Surely that would be much easier and more universal.
    [img]https://lh3.googleusercontent.com/-L0fCpsbFSWA/We5...[/img]
  • ExclamationMediaLLC - Wednesday, July 10, 2019 - link

    Hi Ian and Gavin! Very helpful article! I’m building a SFF workstation using these modules. I want to remove the heat spreaders but I’m afraid of damaging the DIMMs. I see you guys managed it. How risky is it? Is there anything special I should know about removing the RGB lighting strips? (Yes, everyone, I know it will void the warranty)

Log in

Don't have an account? Sign up now