Miscellaneous Aspects and Final Words

In the process of reviewing the Western Digital Red 6 TB drives, we did face one hiccup. Our QNAP testbed NAS finished resyncing a RAID-5 volume with three of those drives, but suddenly indicated an I/O error for one of them.

We were a bit surprised (in all our experience with hard drive review units, we had never had one fail that quickly). To check into the issue, we ran the SMART diagnostics and also a short test from within the NAS UI. Even though both of them passed clean, the NAS still refused to accept the disk for inclusion in the RAID volume. Fortunately, we had a spare drive that we could use to rebuild the volume. Putting the 'failed' drive in a PC didn't reveal any problems either. We are chalking this down to compatibility issues, though it is strange that the rebuilt volume with the same disks completed benchmarking without any problems. In any case, I would advise prospective consumers to ensure that their NAS is in the compatibility list for the drive before moving forward with the purchase.

RAID Resync and Power Consumption

The other aspect of interest when it comes to hard drives and NAS units is the RAID rebuild / resync times and the associated power consumption numbers. The following table presents the relevant values for the resyncing of a RAID-5 volume involving the respective drives.

QNAP TS-EC1279U-SAS-RP RAID-5 Volume Resync
Disk Model Duration Avg. Power
Western Digital Red 6 TB 14h 27m 52s 90.48 W
Seagate Enterprise Capacity 3.5" HDD v4 6 TB 10h 24m 22s 105.42 W
HGST Ultrastar He6 6 TB 12h 34m 20s 95.36 W

Update: We also have some power consumption numbers under different scenarios. In each of these cases, we have three of the drives under consideration configured in a RAID-5 volume in the NAS. The access mode is exercised by running the corresponding IOMeter trace from 25 clients simultaneously.

QNAP TS-EC1279U-SAS-RP RAID-5 Power Consumption
Workload WD Red 6 TB Seagate Enterprise Capacity 3.5" HDD v4 6 TB HGST Ultrastar He6 6 TB
Idle 79.34 W 87.16 W 84.98 W
Max. Throughput
(100% Reads)
93.90 W 107.22 W 97.58 W
Real Life
(60% Random, 65% Reads)
84.04 W 109.25 W 94.03 W
Max. Throughput
(50% Reads)
96.74 W 112.82 W 99.25 W
Random 8 KB
(70% Reads)
85.22 W 105.65 W 91.47 W

As expected, the Seagate Enterprise Capacity 3.5" HDD v4 consumes the most power, while the He6 is much better off thanks to its HelioSeal technology while retaining the same rotational speed. The WD Red, on the other hand, wins the power efficiency battle as expected - a good thing for home consumers who value that over pure performance.

Concluding Remarks

We have taken a look at three different 6 TB drives, but it is hard to recommend any particular one as the clear cut choice unless the particular application is known. The interesting aspect here is that none of the three drives have overlapping use-cases. For home consumers who are interested in stashing their media collection / smartphone-captured photos and videos and expect only four or five clients to simultaneously access the NAS, the lower power consumption as well as the price of the WD Red 6 TB is hard to ignore. For users looking for absolute performance and those who need multiple iSCSI LUNs for virtual machines and other such applications would find the Seagate Enterprise Capacity v4 6 TB a good choice. The HGST Ultrastar He6 is based on upcoming technological advancements, and hence, carries a premium. However, the TCO aspect turns out to be in its favour, particularly when multiple drives running 24x7 are needed. It offers the best balance of power consumption, price and performance.

Multi-Client Access - NAS Environment
Comments Locked

83 Comments

View All Comments

  • NonSequitor - Thursday, July 24, 2014 - link

    So something isn't adding up here. I have a set of nine Red 3TB drives in a RAID6. They are scrubbed - a full rebuild - once a month. They have been in service for a year and a half with no drive failures. Since it's RAID6, if one drive returns garbage it will be spotted by the second parity drive. Obviously they can't be returning bad data, or they would have been failed out long ago. The array was previously made of 9 1TB Greens, scrubbed monthly, and over two and a half years I had a total of two drive failures, one hard, one a SMART pre-failure.
  • asmian - Friday, July 25, 2014 - link

    Logically, I think this might well be due to the URE masking on these Red drives - something the Green drives weren't doing. You've been lucky that with a non-degraded RAID6 you've always had that second parity drive that has perhaps enabled silent repair by the controller when a URE occurred. I've been pondering more about this and here's what I have just emailed to Ganesh, with whom I've been having a discussion...

    --------------------

    On 24/07/2014 03:45, Ganesh T S wrote:
    > Ian,
    >
    > Irrespective of the way URE needs to be interpreted, the points you
    > raise are valid within a certain scope, and I have asked WD for their
    > inputs. I will ping Seagate too, since their new NAS HDD models are
    > expected to launch in late August.

    Thanks. This problem is a lot more complicated than it looks, I think, and than a single URE figure might suggest. But the other "feature" of these Red drives is also extremely concerning to me now. I am a programmer and often think in low-level algorithms, and everything about this URE masking seems wrong in the likely usage scenarios. Please help me with my logic train here - correct me if I'm wrong. Let's assume we have an array of these Red drives. Irrespective of the chance of a URE while the array is rebuilding or in use, let's assume one occurs. You have stated WD's info that the URE is masked and the disk simply returns dummy info for the read.

    If you were in a RAID5 and you were rebuilding after you've lost a drive, that MUST mean the array is now silently corrupted, right? The drive has masked the read failure and reported no error, and the RAID controller/software has no way to detect the dummy data. Critically, having a back-up doesn't help if you don't know that you NEED to repair or restore damaged files, and without a warning, due to simple space contraints a good backup will likely be over-written with one that now contains the corrupted data... so all ways round you are screwed. Having a masked URE in this situation is worse than having one reported, as you have no chance to take any remedial action.

    If you are in RAID6 and you lost a drive, then you still have a parity check to confirm the stripe data with while rebuilding. But if there's a masked URE and dummy data, then how will the controllers react? I presume they ALWAYS test the stripe data with the remaining parity or parities for consistency... so at that point the controller MUST throw a major rebuild error, right? However, they cannot determine which drive gave the bad data - just that the parity is incorrect - unless it's a parity disc that errored and one parity calculates correctly while the other is wrong. If they knew WHICH disc had UREd then they could easily exclude it from the parity calculation and rebuild that dummy data on the fly with the spare parity, but the masking makes that impossible. The rebuild must fail at this point. At least you have your backups... hopefully.

    Obviously the above situation will also be the same for a RAID5 array in normal usage or a degraded RAID6. Checking reads with parity, a masked URE means a failure with no way to recover. If you have an unmasked URE at least the drive controller can exclude the erroring disc for that stripe and just repair data silently using the remaining redundancy, knowing exactly where the error has come from. After all, it's logically just an EXPECTED disk event with a statistically low chance of happening, not necessarily an indication of impending disk failure unless it is happening frequently on the same disc. The only issue will be the astronomically unlikely chance of another URE occurring in the same stripe on another disc.

    Fundamentally, a masked URE means you get bad data without any explanation of why a disc is returning it, which gives no information to the user (or to the RAID controller so it can take action or warn the user appropriately). For me, that's catastrophic. It all really depends on what the controllers do when they discover parity errors and UREs in rebuild situations and how robust their recovery algorithms are - an unmasked URE does not NEED to be a rebuild-killer for RAID6, as thank G*d you had a second redundancy disc...

    Anyway, the question will be whether these new huge drives will, as you say, accumulate empirical evidence from users that array failures are happening more and more frequently. Without information from reviews like yours that warn against their use in RAID5 (or mirrors) despite the marketing as NAS products, the thought-experiment above suggests the most likely scenario is extremely widespread silent array corruption. I stand by my comment that this URE masking should be a total deal-breaker in considering them for home array usage. Better a disk that at least tells you it's errored.
  • NonSequitor - Tuesday, July 29, 2014 - link

    That still doesn't add up - there are no unmasked UREs in my situation, as I'm using Linux software RAID. It is set to log any errors or reconstructions, and there are literally none. One of these arrays has a dozen scrubs on it now with no read errors whatsoever.
  • m0du1us - Friday, July 25, 2014 - link

    @asmian This is why we run 28 disk arrays minimum for SAN. If you really want RAID6 to be reliable, you need A LOT of disks, no matter the size. Larger disks decrease your odds of rebuilding the array, that just means you need more disks. Also, you should never build a RAID6 array with less than 5 disks. At 5 disks, you get no protection from a disk failure during rebuild. At 12 disks, You can have 2 spares and 2 failures during rebuild before loosing data.
  • NonSequitor - Friday, July 25, 2014 - link

    Your comment makes no sense. A five disk RAID-6 is N+2. A 28 disk RAID-6 is N+2. More disks will decrease reliability. Our storage vendor advised keeping our N+2 array sizes between 12 and 24 disks, for instance: 12 disks as a minimum before it started impacting performance, 24 disks as a maximum before failure risks started to get too high. Our production experience has borne this out. Bigger arrays are actually treated as sets of smaller arrays.
  • LoneWolf15 - Friday, July 25, 2014 - link

    asmian, it totally depends on what you are building the array for, and how you are building it.

    Myself, I'd use them for home or perhaps small business, but in a RAID 10 or RAID 6. Then I'd have some guaranteed security. That said, if I needed constant write performance (e.g., multiple IP cameras) I'd use WD RE, and if I wanted enterprise level performance, I'd use enterprise drives.

    That, and I realize that RAID != backup strategy. I have a RAID-5 at home; but I sure as heck have a backup, too.
  • tuxRoller - Monday, July 21, 2014 - link

    Hi Ganesh,

    Would you mind listing the max power draw of these drives? That is, how much power is required during spin-up?
  • WizardMerlin - Tuesday, July 22, 2014 - link

    If you're not going to show the results on the same graph then for the love of god don't change the scale of the axis between graphs - makes quick comparison completely impossible.
  • ganeshts - Tuesday, July 22, 2014 - link

    I have tried that before and the problem is that once all the drives are on the same scale, then some of them become really difficult to track absolute values across. Been down that road and decided the issues it caused are not worth the effort taken for readers (including me and my colleagues) to glance right and left on the axes side to see what the absolute numbers are.
  • Iketh - Tuesday, July 22, 2014 - link

    what? why would they become difficult to "track absolute values" ??

Log in

Don't have an account? Sign up now