Sequential Read/Write Speed

To measure sequential performance I ran a 1 minute long 128KB sequential test over the entire span of the drive at a queue depth of 1. The results reported are in average MB/s over the entire test length.

As impressive as the random read/write speeds were, at low queue depths the Vertex 4's sequential read speed is problematic:

Desktop Iometer - 128KB Sequential Read (4K Aligned)

Curious as to what's going on, I ran AS-SSD and came away with much better results:

Incompressible Sequential Read Performance - AS-SSD

Finally I turn to ATTO, giving me the answer I'm looking for. The Vertex 4's sequential read speed is slow at low queue depths with certain workloads, move to larger transfer sizes or high queue depths and the problem resolves itself:

QD2
QD4
QD8

The problem is that many sequential read operations for client workloads occur at 64 – 128KB transfer sizes, and at a queue depth of 1 - 3. Looking at the ATTO data above you'll see that this is exactly the weak point of the Vertex 4.

I went back to Iometer and varied queue depth with our 128KB sequential read test and got a good characterization of the Vertex 4's large block, sequential read performance:

The Vertex 4 performs better with heavier workloads. While other drives extract enough parallelism to deliver fairly high performance with only a single IO in the queue, the Vertex 4 needs 2 or more for large block sequential reads. Heavier read workloads do wonderfully on the drive, ironically enough it's the lighter workloads that are a problem. It's the exact opposite of what we're used to seeing. As this seemed like a bit of an oversight, I presented OCZ with my data and got some clarification.

Everest 2 was optimized primarily for non-light workloads where higher queuing is to be expected. Extending performance gains to lower queue depths is indeed possible (the Everest 1 based Octane obviously does fine here) but it wasn't deemed a priority for the initial firmware release. OCZ instead felt it was far more important to have a high-end alternative to SandForce in its lineup. Given that we're still seeing some isolated issues on non-Intel SF-2281 drives, the sense of urgency does make sense.

There are two causes for the lower than expected, low queue depth sequential read performance. First, OCZ doesn't currently enable NCQ streaming for queue depths less than 3. This one is a simple fix. Secondly, the Everest 2 doesn't currently allow pipelined read access from more than 8 concurrent NAND die. For larger transfers and queue depths this isn't an issue, but smaller transfers and lower queue depths end up delivering much lower than expected performance.

To confirm that I wasn't crazy and the Vertex 4 was capable of high, real-world sequential read speeds I created a simple test. I took a 3GB archive and copied it from the Vertex 4 to a RAM drive (to eliminate any write speed bottlenecks). The Vertex 4's performance was very good:

Sequential Read - 3GB Archive Copy to RAM Disk

Clearly the Vertex 4 is capable of reading at very high rates – particularly when it matters, however the current firmware doesn't seem tuned for any sort of low queue depth operation.

Both of these issues are apparently being worked on at the time of publication and should be rolled into the next firmware release for the drive (due out sometime in late April). Again, OCZ's aim was to deliver a high-end drive that could be offered as an alternative to the Vertex 3 as quickly as possible.

Update: Many have been reporting that the Vertex 4's performance is dependent on having an active partition on the drive due to its NCQ streaming support. While this is true, it's not the reason you'll see gains in synthetic tests like Iometer. If you don't fill the drive with valid data before conducting read tests, the Vertex 4 returns lower performance numbers. Running Iometer on a live partition requires that the drive is first filled with data before the benchmark runs, similar to what we do for our Iometer read tests anyway. The chart below shows the difference in performance between running an Iometer sequential read test on a physical disk (no partition), an NTFS partition on the same drive and finally the physical disk after all LBAs have been written to:

Notice how the NTFS and RAW+precondition lines are identical, it's because the reason for the performance gain here isn't NCQ streaming but rather the presence of valid data that you're reading back. Most SSDs tend to give unrealistically high performance numbers if you read from them immediately following a secure erase so we always precondition our drives before running Iometer. The Vertex 4 just happens to do the opposite, but this has no bearing on real world performance as you'll always be reading actual files in actual use.

Despite the shortcomings with low queue depth sequential read performance, the Vertex 4 dominated our sequential write tests, even at low queue depths. Only the Samsung SSD 830 is able to compete:

Desktop Iometer - 128KB Sequential Write (4K Aligned)

Technically the SF-2281 drives equal the Vertex 4's performance, but that's only with highly compressible data. Large sequential writes are very often composed of already compressed data, which makes the real world performance advantage of the Vertex 4 tangible.

Incompressible Sequential Write Performance - AS-SSD

AS-SSD gives us another taste of the performance of incompressible data, which again is very good on the Vertex 4. As far as writes are concerned, there's really no beating the Vertex 4.

Random Read/Write Speed AnandTech Storage Bench 2011
POST A COMMENT

126 Comments

View All Comments

  • kyuu - Wednesday, April 04, 2012 - link

    Whichever is cheaper. Performance-wise, you won't notice a difference. I promise. Reply
  • tynopik - Wednesday, April 04, 2012 - link

    don't think you said what you meant Reply
  • Anand Lal Shimpi - Wednesday, April 04, 2012 - link

    I think you're quite right :) Fixed.

    Take care,
    Anand
    Reply
  • ViviTheMage - Wednesday, April 04, 2012 - link

    I have two M4's, I find the iops to be as delicious as milk shakes. Reply
  • gloinsir - Wednesday, April 04, 2012 - link

    What's up with the huge variance in performance between the Samsung 830 512MB and 256MB drives? On the light workload test - http://www.anandtech.com/show/5719/ocz-vertex-4-re... - The 256MB version is 25-50% faster on all tests. That's an enormous difference.
    Is there a firmware difference between the two drives?

    Thanks for a great review. I hope the firmware updates and reliability work out. Competition is good!
    Reply
  • Anand Lal Shimpi - Wednesday, April 04, 2012 - link

    There is indeed a firmware difference between the drives - unfortunately Samsung's latest firmware won't install on the 512GB drive for whatever reason, so it's left with lower performance.

    Take care,
    Anand
    Reply
  • ckryan - Wednesday, April 04, 2012 - link

    Are you sure that's a FW difference? I didn't really notice much of a difference overall between my 256GB and 512GB 830s, but there are differences.

    My 512 on 01FW pulls down substantially higher QD1 4K RWs than the 256 on 01 or 03FW, but there are other differences. I think there are slightly different characteristics between the 32gbit and 64gbit dies.
    Reply
  • ViviTheMage - Wednesday, April 04, 2012 - link

    Seriously, these are probably the most nutritious, and scrumdidaliumpcious iop looking SSD's I have seen. Reply
  • edlee321 - Wednesday, April 04, 2012 - link

    Im another one of those crucial m4 fans, you cannot get better bang for the buck than these drives, especially for power consumption and reliability.

    The difference between 4k read from one ssd to another is not important, reliability and idle power consumption is whats important.

    If anand can run a one month stress test using random uncompressable data on all the current drives in his possession that would be great. I want to see what drive lasts out the longest.
    Reply
  • RussianSensation - Wednesday, April 04, 2012 - link

    It seems after extensive use and degradation, the Corsair Performance Pro is one of the best, even besting the Crucial M4:

    http://www.xbitlabs.com/articles/storage/display/m...
    Reply

Log in

Don't have an account? Sign up now