Who is the Optane SSD 900P for?

With a price per GB a little over twice that of the the fastest flash-based consumer SSDs, the Optane SSD 900P is an exclusive high-end product. For most desktop usage, drives like the 960 PRO are already fast enough to make storage no longer a severe bottleneck. The most noticeable delays due to storage performance on a 960 PRO are when moving around large files, and the Optane SSD doesn't offer any significant improvement to sequential transfer speeds. Random writes can be a challenge for flash-based SSDs, but volatile write caches and SLC caches allow them to handle short bursts with very high performance.

The unprecedented random read performance of the Optane SSD 900P is its biggest strength on paper, but not one that will often lead to a proportional speedup in overall application performance. Too many programs and filesystems are still designed with mechanical hard drive performance in mind as the baseline, and further increases to SSD performance serve mainly to shift the bottlenecks further onto the CPU, RAM, network, and even the user's own reaction time.

The scenarios where a drive like the Optane SSD 900P can offer meaningful and worthwhile performance improvements can be broadly categorized as as situations where the Optane SSD can help with one of two problems:

1. Storage is too slow

About the only time a desktop could challenge the sequential access performance of a high-end PCIe SSD (based on flash or 3D XPoint) is when dealing with high resolution uncompressed video. The Optane SSD doesn't help much here because of its limited capacity, and the PCIe 3 x4 link itself is a bottleneck at the highest refresh rates and bit depths. For video work, flash-based SSDs are definitely a better choice, and RAID arrays of cheaper SATA SSDs may be a better option than PCIe SSDs. Desktop workloads that require extremely high sustained random write performance are very rare, and SLC caching on a flash-based SSD nicely takes care of most realistic quantities of random writes.

That said, there are some situations where higher random read performance can be quite noticeable. Searching through a large volume of data is a common case, such as searching through a video, but it usually presents enough opportunities for parallelization that the drive's queue depth will climb up to the range where flash-based SSDs come close to the Optane SSD. Game level load times can in theory benefit greatly from faster read speeds, but in practice decompressing the assets after loading them into RAM quickly becomes the bottleneck. Most of the other situations where the performance advantage of the Optane SSD will really help are better described as a different kind of problem:

2. RAM is too small

In the workstation market, there are abundant examples of compute tasks with a memory working set that doesn't fit in RAM. Almost any simulation or rendering task will have a parameter for mesh density or particle count that can very quickly scale the memory requirements from a few GB to tens or hundreds of GB. An Optane SSD is far slower than four to eight channels of DDR4, but 16GB DIMMs are least 6-7 times more expensive per GB than the Optane SSD 900P, and putting more than 128GB of DRAM in an ATX motherboard is even more expensive.

Intel PR provided an example of using SideFX Houdini to render a high-resolution animation that included a 1.1 billion particle water simulation. Their test used a machine with a 10-core CPU and 64GB of RAM, and compared the 512GB Samsung 960 PRO against the 480GB Optane SSD 900P. The total memory requirements (DRAM+swap) of the rendering job were not disclosed, but the resulting 2.7x speedup is very plausible for a task that absolutely hammers the swap device. With a sufficiently high thread count to keep the queue depth high, that margin could be narrower (especially with the fastest 2TB 960 PRO), but then context switch overhead would become problematic. With the Optane SSD 900P, the random read latency is low enough that it would be hard to host more than two swap-limited threads per core without context switch overhead wasting more time than waiting on the SSD.

Star Citizen Bundle

Even though gaming isn't the ideal workload for the Optane SSD 900P to show off its performance, Intel is marketing the 900P to gaming enthusiasts. They're bundling a code for the game Star Citizen with the 900P, and including a new in-game spaceship variant as an exclusive item for Optane SSD customers. Intel has partnered with Star Citizen developer Roberts Space Industries (RSI) to hold a launch event for the 900P at CitizenCon 2017 today, which they are streaming live on Twitch and YouTube. Attendees will have the chance to playtest the Intel-exclusive Sabre Raven ship, but it is still undergoing final QA and will not be immediately available to Optane SSD 900P customers. The web page for redeeming the Star Citizen game code had not gone live as of the time of writing, so I was unable to attempt any testing with the game. (ed: I remember when AMD was offering a Star Citizen bundle in 2014 as well. The game still hasn't shipped.)

At the media briefing for the 900P, an RSI representative said they are exploring ways to optimize the Star Citizen experience on Optane SSDs, but not many specifics were provided. One approach under consideration is using less compression for some game assets, freeing up CPU time but relying on high storage performance. It didn't sound like this work was close to release. In the game's current state, RSI claims they've seen load times improve by 20-25%, but they didn't specify what other storage device they were comparing against.

Introduction Drive Features
POST A COMMENT

204 Comments

View All Comments

  • lmcd - Friday, October 27, 2017 - link

    Funny, when SSDs came out we were promised they'd be orders of magnitude faster than mechanical HDDs. The first ones weren't. Reply
  • Drumsticks - Sunday, October 29, 2017 - link

    I think my primary point of criticism with your view is that, despite Intel not reaching their goals, it is measurably better than the competition at a better price per GB per performance.

    It might not have hit the endurance targets they wanted, but for 2-3x the cost of your opponent, they've achieved some 20x endurance, on par in the worst case, and 6-8 times better in many measurably important scenarios.

    On top of that, you tout the benefits of SLC and claim that SLC is undeniably faster than XPoint, because if SLC wasn't held back by the cheap and underpowered controllers of yesteryear, it could really fly. How do you know the same thing is not true of 3DXP? Perhaps better understanding of its use and a better controller, and maybe a second generation of the memory, will enable Intel to reach even higher performance and endurance heights. You can't claim that SLC is better, and is simply held back by 2014 era controllers, while not allowing that 3DXP could likely be held back by it's controller as well. We have no idea what the performance ceiling for 3DXP is, because we've seen all of one generation of products.

    To extend your analogy, I might be disappointed if I only received $450, but if everybody else is still only making $200, I'm going to go home happy.
    Reply
  • Rektide - Friday, October 27, 2017 - link

    This shows an Intel 900P with a total write endurance of 8.7PB, and a 850 Pro with a total write endurance of 150 TB. But a stress test of a 850 Pro in fact survived 9.1 PB[1]! Meanwhile, if you reach 8.7PB on the 900P, Intel will forcibly move your drive into read-only mode.

    If you look at Ark, Intel describes it's Endurance Rating to mean "Endurance rating indicates the expected data storage cycles to be expected over the life of the device." When they say that, they mean "and not a megabyte more". Whereas when Samsung says it, they, at least once, meant "but this drive may go 60x more than it's rating". I really severely dislike this twist of the knife, this drastic change Intel and Intel alone is perpetrating against it's consumers.

    Does the P4800X also commit seppuku the megabyte it reaches it's Endurance Lifetime? I'm not sure if all Intel drives are so malicious, or is it consumer & enthusiast drives? How do I know which drives are programmed to self destruct on me, Intel?

    [1] http://www.guru3d.com/news-story/endurance-test-of...
    Reply
  • ddriver - Friday, October 27, 2017 - link

    It is actually worse than locking in read-only mode - on the next boot cycle the drive gets bricked. So if you didn't manage to get that data in time, it is gone forever. Great feature. Reply
  • Spunjji - Saturday, October 28, 2017 - link

    I have seen this with Intel consumer SSDs. It's amazing - it doesn't even tell you that it's failing and that you have one (yes, one) chance to backup your data before it goes forever. Usually the drive just throws an error, so your average user reboots the system and bam, the drive's not even in the BIOS anymore.

    Their drive failure behaviour is criminal.
    Reply
  • voicequal - Saturday, October 28, 2017 - link

    Agreed - bricking is a terrible failure mode for a consumer drive. For an enterprise drive, it *might* make sense, since its you want things to fail hard and fast so that backup systems can detect and take over. Reply
  • FwFred - Friday, October 27, 2017 - link

    Wow, need a block button to improve SnR. Reply
  • Meteor2 - Tuesday, October 31, 2017 - link

    Indeed -- though I simply skip any comments written by ddriver. Never any value in them (but a remarkably effective troll). Reply
  • Lolimaster - Saturday, October 28, 2017 - link

    The 1000X was supposed to be on latency and endurance, not much more, and maybe 10X random 4K performance. Reply
  • CajunArson - Friday, October 27, 2017 - link

    I thought you were a reliable AMD koolaid drinker?

    It's funny how you insult these products that you have never used but would never in a million years hurl their $7000 Radeon SSG under the bus... you know, the one that uses a consumer-grade GPU slapped together with a RAID-0 array of cheap consumer-grade NVME drives to supposedly do rendering jobs faster than a regular GPU.
    Reply

Log in

Don't have an account? Sign up now