Workstation Performance - SPECworkstation 3.1

SFF PCs traditionally do not lend themselves to workstation duties. However, a recent trend towards miniaturized workstations has been observed. Since Khadas markets the Mind as a modular portable workstation, we decided to benchmark the system for both content creation workloads as well as professional applications.

SPECworkstation 3.1

The SPECworkstation 3.1 benchmark measures workstation performance based on a number of professional applications. It includes more than 140 tests based on 30 different workloads that exercise the CPU, graphics, I/O and memory hierarchy. These workloads fall into different categories.

  • Media and Entertainment (3D animation, rendering)
  • Product Development (CAD/CAM/CAE)
  • Life Sciences (medical, molecular)
  • Financial Services
  • Energy (oil and gas)
  • General Operations
  • GPU Compute

Individual scores are generated for each test and a composite score for each category is calculated based on a reference machine (HP Z240 tower workstation using an Intel E3-1240 v5 CPU, an AMD Radeon Pro WX3100 GPU, 16GB of DDR4-2133, and a SanDisk 512GB SSD). Official benchmark results generated automatically by the benchmark itself are linked in the table below for the systems being compared.

SPECworkstation 3.1 Official Results (2K)
Khadas Mind Premium Run Summary
Khadas Mind Premium + Mind Dock Run Summary
ASRock NUC BOX-1260P Run Summary
Intel NUC12WSKi7 (Wall Street Canyon) Run Summary
ASRock NUC BOX-1360P-D5 (Performance) Run Summary
Beelink GTR7 Run Summary
ASRock 4X4 BOX-7735U (Performance) Run Summary
ASRock NUCS BOX-1360P-D4 Run Summary
Intel NUC13ANKi7 (Arena Canyon) Run Summary

Details of the tests in each category, as well as an overall comparison of the systems on a per-category basis are presented below.

Media and Entertainment

The Media and Entertainment category comprises of workloads from five distinct applications:

  • The Blender workload measures system performance for content creation using the open-source Blender application. Tests include rendering of scenes of varying complexity using the OpenGL and ray-tracing renderers.
  • The Handbrake workload uses the open-source Handbrake application to transcode a 4K H.264 file into a H.265 file at 4K and 2K resolutions using the CPU capabilities alone.
  • The LuxRender workload benchmarks the LuxCore physically based renderer using LuxMark.
  • The Maya workload uses the SPECviewperf 13 maya-05 viewset to replay traces generated using the Autodesk Maya 2017 application for 3D animation.
  • The 3ds Max workload uses the SPECviewperf 13 3dsmax-06 viewset to replay traces generated by Autodesk's 3ds Max 2016 using the default Nitrous DX11 driver. The workload represents system usage for 3D modeling tasks.

SPECworkstation 3.1 - Media and Entertainment

Product Development

The Product Development category comprises of eight distinct workloads:

  • The Rodinia (CFD) workload benchmarks a computational fluid dynamics (CFD) algorithm.
  • The WPCcfd workload benchmarks another CFD algorithm involving combustion and turbulence modeling.
  • The CalculiX workload uses the Calculix finite-element analysis program to model a jet engine turbine's internal temperature.
  • The Catia workload uses the catia-05 viewset from SPECviewperf 13 to replay traces generated by Dassault Systemes' CATIA V6 R2012 3D CAD application.
  • The Creo workload uses the creo-02 viewset from SPECviewperf 13 to replay traces generated by PTC's Creo, a 3D CAD application.
  • The NX workload uses the snx-03 viewset from SPECviewperf 13 to replay traces generated by the Siemens PLM NX 8.0 CAD/CAM/CAE application.
  • The Solidworks workload uses the sw-04 viewset from SPECviewperf 13 to replay traces generated by Dassault Systemes' SolidWorks 2013 SP1 CAD/CAE application.
  • The Showcase workload uses the showcase-02 viewset from SPECviewperf 13 to replay traces from Autodesk's Showcase 2013 3D visualization and presentation application

SPECworkstation 3.1 - Product Development

Life Sciences

The Life Sciences category comprises of four distinct test sets:

  • The LAMMPS set comprises of five tests simulating different molecular properties using the LAMMPS molecular dynamics simulator.
  • The NAMD set comprises of three tests simulating different molecular interactions.
  • The Rodinia (Life Sciences) set comprises of four tests - the Heartwall medical imaging algorithm, the Lavamd algorithm for calculation of particle potential and relocation in a 3D space due to mutual forces, the Hotspot algorithm to estimate processor temperature with thermal simulations, and the SRAD anisotropic diffusion algorithm for denoising.
  • The Medical workload uses the medical-02 viewset from SPECviewperf 13 to determine system performance for the Tuvok rendering core in the ImageVis3D volume visualization program.

SPECworkstation 3.1 - Life Sciences

Financial Services

The Financial Services workload set benchmarks the system for three popular algorithms used in the financial services industry - the Monte Carlo probability simulation for risk assessment and forecast modeling, the Black-Scholes pricing model, and the Binomial Options pricing model.

SPECworkstation 3 - Financial Services

Energy

The Energy category comprises of workloads simulating various algorithms used in the oil and gas industry:

  • The FFTW workload computes discrete Fourier transforms of large matrices.
  • The Convolution workload computes the convolution of a random 100x100 filter on a 400 megapixel image.
  • The SRMP workload processes the Surface-Related Multiples Prediction algorithm used in seismic data processing.
  • The Kirchhoff Migration workload processes an algorithm to calculate the back propogation of a seismic wavefield.
  • The Poisson workload takes advantage of the OpenMP multi-processing framework to solve the Poisson's equation.
  • The Energy workload uses the energy-02 viewset from SPECviewperf 13 to determine system performance for the open-source OPendTec seismic visualization application.

SPECworkstation 3 - Energy

General Operations

In the General Options category, the focus is on workloads from widely used applications in the workstation market:

  • The 7zip workload represents compression and decompression operations using the open-source 7zip file archiver program.
  • The Python workload benchmarks math operations using the numpy and scipy libraries along with other Python features.
  • The Octave workload performs math operations using the Octave programming language used in scientific computing.
  • The Storage workload evaluates the performance of the underlying storage device using transaction traces from multiple workstation applications.

SPECworkstation 3 - General Operations

GPU Compute

In the GPU Compute category, the focus is on workloads taking advantage of the GPU compute capabilities using either OpenCL or CUDA, as applicable:

  • The LuxRender benchmark is the same as the one seen in the media and entertainment category.
  • The Caffe benchmark measures the performance of the Caffe deep-learning framework.
  • The Folding@Home benchmark measures the performance of the system for distributed computing workloads focused on tasks such as protein folding and drug design.

We only process the OpenCL variants of the benchmark, as the CUDA version doesn't process correctly with default driver installs.

SPECworkstation 3 - GPU Compute

Most of the Ryzen-based systems are equipped with a large number of high-performance cores. This translates to better performance for select workloads compared to what is obtained using the hybrid scheme of performance and efficiency cores from Intel. Within the set of systems with similar number of cores, if the generation-to-generation improvements (RPL-P being better than ADL-P) are discounted, we find that the performance is essentially dependent on the available power budget. The main exception is GPU Compute - a weak point for Intel's iGPUs, as they can't complete some of the components successfully. The net result is that the Mind Premium configurations land up in the middle of the graph across all of the SPECworkstation 3.1 components.

System Performance: UL and BAPCo Benchmarks System Performance: Miscellaneous Workloads
Comments Locked

20 Comments

View All Comments

  • peterfares - Friday, September 15, 2023 - link

    This is really cool, but who is this for?
  • abufrejoval - Friday, September 15, 2023 - link

    I’d say it’s mostly for commuters, who’ll oscillate between two or more workplaces with high frequency and regularity, but don’t want to carry more than the “soul” of the computer with them.

    When I do that, I tend to make that a VM I keep on a high-speed USB stick and I then suspend the VM when I commute. Get’s the job done with a bit of overhead but in a smaller form factor but it means having a physical computer at every workplace and other compromises.

    Having a full-sized GPU dock on every location might be somewhat difficult in terms of budget, but a power primary and somewhat after-hours secondary, might be enough to satisfy a large part of the user base.

    Being able to just pick up the running machine right in the middle of something and then try catching a train or plane running might seem attractive, but Windows tends to glitch in far too many ways to make that realistic.

    I’ve had far too many Windows laptops being woken up from some powersave or even hibernation slumber in the middle of a flight, ostensibly for scheduled maintenance, only to then have them cook themselves and their battery to death for lack of cooling in the onboard luggage: I guess I should be glad they didn’t go as far as combusting, but generally I wound up without a working machine on the busy end of the trip…

    If you own a tiny home, operate in a boat, trailer or some other space constrained place this could be cool, but with an eye on longevity I’d not risk anything that wasn’t standards based and if TB isn’t enough, including dGPU, it’s really just tough luck.

    BTW, I do believe they offer TB and not just USB4, they just don’t have the certification done, because they do mention eGPU on their website.

    Pre-configured only: 32GB is certainly better than 16, even if I’d go for 64, especially at current prices for DRAM. The M.2 slots are only 30mm length so there wasn’t that much variety in terms of updates anyway, but that is currently changing because of these Steam console class devices. I guess the reason they won’t let you open the device is mostly to cut down on customer service issues, because there is just too many people out there who overestimate their technical skills and dexterity.

    I guess mostly it just inspired me into looking using a NUC for this commute style use case instead. They aren’t really that much bigger than this, especially if you don’t carry a power brick. Having to make sure they are properly hibernated, isn’t that much of an issue and they are far more economical.
  • brucethemoose - Friday, September 15, 2023 - link

    > 64-48-48-112 @ 5200

    Whoa, is this a typo?

    The timings seem awfully loose, like waay above default JEDEC.
  • meacupla - Friday, September 15, 2023 - link

    I can't even find what JEDEC specifies for LPDDR5
    Timings for LPDDR have always been looser than their regular counterparts.

    They clock higher at lower voltages, and the timings are loose as a result.
  • Kamen Rider Blade - Friday, September 15, 2023 - link

    So, instead of creating a "Proprietary Standard"?

    Why don't they use the existing PC/104 stacking Board standard that has been around for decades?
  • meacupla - Saturday, September 16, 2023 - link

    PC/104 is meant for internal only. It has exposed and unsupported pins, which makes it easy to bend the pins. It's fragile.
    This slot connector is a more robust design.

    Having said that, oculink and TB4 are plenty robust and have an existing market.
  • Kamen Rider Blade - Saturday, September 16, 2023 - link

    You do know that PC/104 has updated to PCIe/104 and uses PCIe connectors that are plenty strong. Version 3 of the spec has been ratified since Feb 17, 2015.

    Also it would be pretty easy for them to figure out how to create a base board to stack modules onto given the modular nature.

    They could've used a EPIC or EBC MoBo base board and stack modules on top.
  • meacupla - Saturday, September 16, 2023 - link

    not my fault you named the wrong spec
  • sjkpublic@gmail.com - Sunday, September 17, 2023 - link

    One main difference between this and other NUC's is the LPDDR5. This could have been a show stopper if they broke the 64GB barrier. Would consider a 128GB LPDDR5 memory version. Otherwise not much to see here.
  • xol - Wednesday, September 20, 2023 - link

    Polite reminder that "industrial design" means designing a product to be mass produced, not edgelord brushed titanium designed to convinced suckers a product is worth twice its equivalent value

Log in

Don't have an account? Sign up now