As a rather unexpected move, SanDisk has announced that it will be stepping into the storage array business with its in-house designed InfiniFlash all-flash array series. The driving force behind SanDisk's strategy is big data as the current solutions on the market don't meet all the needs that big data customers have. Most importantly the customers are after higher density and lower price as the data centers can easily be hundreds of petabytes in size, so the acquisition cost as well as the power, cooling and property play a major role.

The InfiniFlash comes in three different flavors. The IF100 is an open platform that is designed for OEMs and hyperscale customers with their own software platforms, although it does include a Software Development Kit (SDK) for monitoring purposes. The IF500 is aimed specifically for object storage using ceph (an open source object storage system), which allows very high capacity clusters to be built efficiently. I admit that I need to dig in a bit deeper to fully understand the way object storage works and its benefits, but it requires less metadata than traditional file and block based storage systems do and the overhead of metadata managing is also lower, which practically enables unlimited scale out opportunities. Finally, the IF700 focuses on performance and is built upon Fusion-io's ION platform with database (Oracle, SAP, etc.) workloads in mind. In a nut shell, the key difference is that the IF100 is hardware only, whereas the IF500 and IF700 come with InfiniFlash OS (I was told it's Ubuntu 14.04 based). Hence the IF100 is also cheaper at less than $1 per GB, while the IF500 and IF700 will be priced somewhere between $1 and $2 per GB. For an all-flash array that's a killer price for raw storage (i.e. before compression and de-duplication) as it's not uncommon for vendors to still charge over $20 per raw gigabyte.

SanDisk InfiniFlash Specifications
Form Factor 3U
Raw Capacity 256TB & 512TB
Usable Capacity 245TB & 480TB
Performance (IOPS) >780K IOPS
Performance (Throughput) 7GB/s
Power Consumption 250W (idle) / 750W (active)
Connectivity 8x SAS 2.0 (6Gbps)

SanDisk offers two capacity configurations: 256TB and 512TB, of which 245TB and 480TB are usable. The ~7% over-provisioning is used mostly for garbage collection with the goal of increasing sustained performance. In terms of performance, SanDisk claims over 780K IOPS, although the presentation showed up to 1 million IOPS at <1ms latency for 4KB random reads with two nodes. The endurance is 75PB of writes, which results in ~4-year lifespan with 90/10 read/write ratio at 6GB/s. For connectivity the InfiniFlash offers eight SAS 6Gbps ports, which can be upgraded to SAS 12Gbps in the future without any downtime. 

EDIT: My initial endurance numbers were actually wrong. The endurance for 8KB random writes is 189PB and 2,270TB for sequential 16KB writes. Assuming 6GB/s throughput that would result in a little over 10 years with 8KB random IO with 90/10 read/write ratio. 

Architecturally the InifiniFlash is made out of 64 custom 8TB cards. The cards are SAS 6Gbps based, although as usual SanDisk wouldn't disclose the underlying controller platform. However, judging by the photo it appears to be a Marvell design (see the hint of a Marvell logo on the chip), which honestly wouldn't surprise me since SanDisk has relied heavily on Marvell for controllers, including SAS ones (e.g. Optimus Eco uses the 88SS9185 SAS 6Gbps controller). As for the NAND, SanDisk is using its own 1Ynm 64Gbit MLC (i.e. second generation 19nm) and each card consists of 64 eight-die NAND packages (this actually sums up to only 4TB, so I'm waiting to hear back if there's a typo somewhere in the equation -- maybe it's a 128Gbit die after all). SanDisk will also be using different types of NAND in the future ones new nodes become available in volume.

EDIT: As I speculated, the die capacity is 128Gbit (16GB), so each package consists of eight 128Gbit dies that yields 128GB per package and with 64 packages per card the total raw capacity is 8TB.

The InfiniFlash will be sold through the usual channel (i.e. through OEMs, SIs & VARs), but SanDisk is also selling it directly to some of its key hyperscale customers that insisted on working directly with SanDisk. While it may seem like SanDisk is stepping into its partners area with direct sales, the big hyperscale customers are known to built their own designs, so in most cases they wouldn't be buying from the OEMs, SIs or VARs anyway. Besides, the InfiniFlash doesn't include any computing resources, meaning that the partners can differentiate by integrating InfiniFlash to servers for an all-in-one approach, whereas the units SanDisk is selling are just a bunch of raw flash.

While the InfitiFlash as a product is certainly exciting and unique, it's not as interesting as the implications of this strategical move. In the past the SSD OEMs have kept their fingers away from the storage array market and have simply focused on building the drives that power the arrays. This strategy originates from the hard drive era where the separation was clear: companies X made the hard drives and companies Y integrated them to arrays. Since there wasn't really a way to customize a hard drive in terms of form factor and features, there was no use of merging X and Y given that the two had completely different expertise. 

With flash that's totally different thanks to the versatility of NAND. Only the sky is the limit when it comes to form factors as NAND can easily scale from a tiny eMMC package to 2.5" drive and most importantly to any custom form factor that is desired. SanDisk's biggest advantage in the storage array business is without a doubt its NAND expertise. Integrating NAND properly takes much more knowhow and effort than hard drives because ultimately you have to abandon the traditional form factors and go custom. It's not a surprise that the InfiniFlash is the densest all-flash array on the planet because as a NAND manufacturer SanDisk has the supply and engineering talent to put 8TB behind a single controller with a very space efficient design. The majority of storage array vendors simply source their drives from the likes of SanDisk, Intel and Samsung, so they are limited by the constraining form factors that are available on the open market. 

The big question is how will the market react and will this become a trend among other SSD vendors as well. The storage array market is certainly changing and the companies that used to rule the space are starting to lose market share to smaller, yet innovative companies. As the all-flash array market matures, it seems logical for the SSD vendors to step in and eliminate the middle-man for increased vertical integration. The future remains to be seen, but I wouldn't be surprised to see others following SanDisk's lead. The bar has definitely been set high, though.

Source: SanDisk InfiniFlash Product Page

POST A COMMENT

23 Comments

View All Comments

  • bill.rookard - Wednesday, March 04, 2015 - link

    Oh gosh, will someone PLEASE put together a semi-affordable, multi-TB drive for consumer use? Spinny disks in my NAS make me nervous (even if it is somewhat protected by RAID). I'd like to see how the Samsung VNAND brings pricing down once the yields come up.

    Just looking at this ($1/GB), a 1TB is about $1000, so the 256TB unit is... $256,000.00 - yikes.
    Reply
  • SirMaster - Wednesday, March 04, 2015 - link

    But it's still nowhere near cost effective.

    If you don't need the performance, HDDs are less than $0.04 per GB. You could store 10 backup copies of your data on HDDs and it's still cheaper than 1 copy on SSD. 10 HDD copies is far, far more reliable than 1 SSD copy.
    Reply
  • Flunk - Wednesday, March 04, 2015 - link

    I'm guessing you're not used to SAN prices because this is positively cut-rate. Reply
  • Anonymous Blowhard - Wednesday, March 04, 2015 - link

    Seriously. Gimme this for my VDI deployment right now. Reply
  • ats - Wednesday, March 04, 2015 - link

    You obviously haven't priced out commercials SAN/NAS storage. That pricing is extremely competitive. Esp for an all flash system. Its basically cheaper than that equiv capacity in 15k rpm drives and should run rings around them.

    TB SSDs for the consumer are already pretty cheap at ~40c per GB. And its unlikely that we'll really see capacity increases on consumer drives because 1 TB SSDs already sell in fairly low numbers.

    For capacity situation nothing is going to be spinning disks.
    Reply
  • npz - Wednesday, March 04, 2015 - link

    Spinning magnetized plates are still required for long term storage. SSD = very limited data rentention. Also the smaller the process, the lower the data retention life.

    The performance issues on the Samsung 840 EVO for idle data due to cell charge leakage is an example of data retention problems inherent to all SSDs.

    And the specialized write cache optimized SSDs? 6 months retention in their own specs.

    Even when mechanical HDs die, the are salvageable since it's the motors that die while the data on the platters are perfectly preserved.
    Reply
  • cb216 - Friday, April 17, 2015 - link

    Also, I'll say that any published pricing from enterprise vendors is far from the actual selling price. The pricing floors for this kind of hardware are ALL over the board. You see discounts that can go as high as 90% in enterprise hardware sales. Yes 90%. Reply
  • serendip - Wednesday, March 04, 2015 - link

    Not for the likes of Joe and Jane Consumer then. The low per-GB cost and decent speed could make this a nightmare for other flash SAN providers, especially if you can do away with complex optimizations and just throw more and more flash at a problem.

    I'm still waiting for slow but cheap flash for mobile devices for use as mainly read-only storage. I wouldn't mind 32 GB of fast eMMC or even SATA on a phone with 128 or 256 GB on microSD to store static content like music and videos.
    Reply
  • juhatus - Wednesday, March 04, 2015 - link

    AT has usually steered away from enterprise storage, I feel writer is a bit of a victim of enterprise propaganda. Well spinned. Reply
  • Anonymous Blowhard - Wednesday, March 04, 2015 - link

    So that whole subsection under ""Cloud/Datacenter and IT dating back to 2010" is just a figment of my imagination then?

    Go back to Reddit.
    Reply

Log in

Don't have an account? Sign up now