AnandTech Storage Bench 2011

Two years ago we introduced our AnandTech Storage Bench, a suite of benchmarks that took traces of real OS/application usage and played them back in a repeatable manner. Anand assembled the traces out of frustration with the majority of what we have today in terms of SSD benchmarks.

Although the AnandTech Storage Bench tests did a good job of characterizing SSD performance, they weren't stressful enough. All of the tests performed less than 10GB of reads/writes and typically involved only 4GB of writes specifically. That's not even enough exceed the spare area on most SSDs. Most canned SSD benchmarks don't even come close to writing a single gigabyte of data, but that doesn't mean that simply writing 4GB is acceptable.

Originally the benchmarks were kept short enough that they wouldn't be a burden to run (~30 minutes) but long enough that they were representative of what a power user might do with their system.

1) The MOASB, officially called AnandTech Storage Bench 2011 - Heavy Workload, mainly focuses on the times when your I/O activity is the highest. There is a lot of downloading and application installing that happens during the course of this test. Our thinking was that it's during application installs, file copies, downloading and multitasking with all of this that you can really notice performance differences between drives.

2) We tried to cover as many bases as possible with the software incorporated into this test. There's a lot of photo editing in Photoshop, HTML editing in Dreamweaver, web browsing, game playing/level loading (Starcraft II & WoW are both a part of the test) as well as general use stuff (application installing, virus scanning). We've included a large amount of email downloading, document creation and editing as well. To top it all off we even use Visual Studio 2008 to build Chromium during the test.

The test has 2,168,893 read operations and 1,783,447 write operations. The IO breakdown is as follows:

AnandTech Storage Bench 2011 - Heavy Workload IO Breakdown
IO Size % of Total
4KB 28%
16KB 10%
32KB 10%
64KB 4%

Only 42% of all operations are sequential, the rest range from pseudo to fully random (with most falling in the pseudo-random category). Average queue depth is 4.625 IOs, with 59% of operations taking place in an IO queue of 1.

AnandTech Storage Bench 2011 - Heavy Workload

Heavy Workload 2011 - Average Data Rate

The full data set including disk busy times and read/write separation can be found in our Bench.

AnandTech Storage Bench 2011 - Light Workload

Our light workload actually has more write operations than read operations. The split is as follows: 372,630 reads and 459,709 writes. The relatively close read/write ratio does better mimic a typical light workload (although even lighter workloads would be far more read centric). There's lots of web browsing, photo editing (but with a greater focus on photo consumption), video playback as well as some application installs and gaming.

The I/O breakdown is similar to the heavy workload at small IOs, however you'll notice that there are far fewer large IO transfers. Interestingly, the 480GB drive actually comes out ahead in this case, suggesting it's more capable at light workloads.

AnandTech Storage Bench 2011 - Light Workload IO Breakdown
IO Size % of Total
4KB 27%
16KB 8%
32KB 6%
64KB 5%

Light Workload 2011 - Average Data Rate

Performance vs Transfer Size Power Consumption
Comments Locked

65 Comments

View All Comments

  • ahar - Thursday, January 9, 2014 - link

    Can we also have one for the article? ;)
    "...so the number's you're seeing here..."
  • Unit Igor - Saturday, January 11, 2014 - link

    Tell me Kristian please would EVO 120GB msata have any advantage over EVO 250gb msata in longer battery life when you compare power consuptipon vs. disk busy times and mb/s.I use my ultrabook only for mails ,somtimes wathing movies and surfing.I dont need more then 120GB SSD but i am willing to buy 250Gb if it would give me more battery life.What i wanted to see in your benchmark is MobileMark 2012 because msata is for laptops and that is where battery life play big role
  • guidryp - Thursday, January 9, 2014 - link

    "endurance is fine for consumer usage"

    Thanks for your opinion, but I'll stick with MLC.

    Do you also think Multi-TB HDDs are fine for consumer use? Since HDDs went over 1TB, they have been failing/wearing out for me regularly. I am sure you can find some theoretical numbers that say these are "fine for consumer usage" as well.

    There is a big trend to bigger sizes but lower reliability. That trend can get stuffed.

    Samsungs advantage of Being the only TLC player strikes me as a reason to avoid Samsung, so I can avoid TLC and decreasing endurance that goes with it.
  • Kristian Vättö - Thursday, January 9, 2014 - link

    That's just your experience, it's not a proof that over 1TB hard drives are less reliable. We can't go out and start claiming that they are less reliable unless we have some concrete proof of that (failures on our end, statistics etc).

    The same applies for TLC. All we have is the P/E cycle number and frankly it gives us a pretty good estimation of the drive's lifespan and those numbers suggest that the endurance of TLC is completely fine for consumer usage. Or do you think our calculations are incorrect?
  • MrSpadge - Thursday, January 9, 2014 - link

    And add to that that the P/E cycles are usually conservatively estimated by manufacturers. The SSD-burn-tests at XS sometimes exceed the ratings significantly.
  • guidryp - Thursday, January 9, 2014 - link

    I think if you examine any aggregate source of reviews like Newegg you will see a significant drop in drive satisfaction do to early failures, since drives went over 1TB. So it isn't just some personal fluke that half of my >1TB drives have failed worn out, so far.

    I am really sick of this trend of declining reliability being sold as good enough. If TLC is "good enough" I will take MLC with 3X "good enough" unless the we are talking about 1/3 the price for TLC.

    Weren't the Samsung 840s failing in days for Anand last year?

    Unlike reviewers, I use my products until they fail, so reliability matters a LOT, and is something that is going in the wrong direction IMO.
  • Kristian Vättö - Thursday, January 9, 2014 - link

    Reliability is not the same as endurance. TLC has lower endurance, that's a fact, but it's not less reliable. Endurance is something you can predict (in the end all electronics have a finite lifespan) but reliability you cannot since there's a lot else than just NAND that can fail. I would claim that today's SSDs are much more reliable than the SSDs we had two years ago -- there haven't been any widespread issues with current drives (compared to e.g. early SandForce drives).

    Yes, we had a total of three 840 and 840 Pros that failed but that was on pre-production firmware. The retail units shipped with a fixed firmware.

    This isn't a new trend. Historically we can go back all the way to 1920s when light bulb companies started rigging their products so the lifespan would be shorter, which would in turn increase sales. Is it fair? Of course not. Do all companies do it? Yes.

    I do see your point but I think you're exaggerating. Even TLC SSDs will easily outlive the computer as a whole since the system will become obsolete in in a matter of years anyway if it's not updated.
  • gandergray - Saturday, January 25, 2014 - link

    For information concerning hard drive failure rates that is more objective, please see the following article: http://www.extremetech.com/extreme/175089-who-make... .
  • althaz - Thursday, January 9, 2014 - link

    TLC is NOT a trade off in reliability, but a tradeoff in longevity.

    Longevity is measured in write-cycles and with heavy consumer loads TLC drives will still last for many years.
  • bsd228 - Thursday, January 9, 2014 - link

    Other than the fact that they both store data, SSDs and HDDs have nothing in common, so it's silly to presume a problem that isn't really what you think it is in the first place. HDDs got dirt cheap as we cross the TB threshold and with it went diligent QA. You want 2TB for $80, you're going to get a higher defect rate. And going to 4 or 5 platters just increases the failure points, but the razor thin margins are the culprit here.

    In contrast, a bigger SSD just means either more chips, or higher density ones. But 16 chips is nothing new, and since there are no mechanical parts, nothing to worry about. Aside from OCZ, the SSD track record for reliability has been pretty solid, and Samsung (and Intel) far better than that. If you want to stick to 256G in your laptop out of a silly fear of TLC, you're just hurting yourself. The Anand guys have already shown how overstated the wear issue has become.

Log in

Don't have an account? Sign up now