One of the key drivers to increase capacity in next generation storage has been to increase the number of bits that can be stored per cell. The easy jump of one to two bits-per-cell gives a straight 100% increase, in exchange for more control needed to read/write the bit but also limits the cell endurance. We’ve seen commercialization of storage up to four bits-per-cell, and talk about five. A Japanese company is now ready to start talking about their new 7 bits-per-cell solution.


Image courtesy of Plextor, up to 4 bits-per-cell

Moving from one to two bits-per-cell gives an easy doubling of capacity, and moving to three bits-per-cell is only another 50% increase. As more bits are added, the value of adding those bits diminishes, but the cost in the equipment to control the read and writes increases exponentially. There has to be a medium balance between how many bits-per-cell makes economic sense, and how much the control electronics costs to implement to enable those bits.

  • 1 bit per cell requires detection of 2 voltage levels, base capacity
  • 2 bit per cell requires detection of 4 voltage levels, +100% capacity
  • 3 bit per cell requires detection of 8 voltage levels, + 50% capacity
  • 4 bit per cell requires detection of 16 voltage levels, +33% capacity
  • 5 bit per cell requires detection of 32 voltage levels, +25% capacity
  • 6 bit per cell requires detection of 64 voltage levels, +20% capacity
  • 7 bit per cell requires detection of 128 voltage levels, +16.7% capacity

Also, the more bits-per-cell, the lower the endurance – the voltage variation when you store many bits only has to drift slightly to get the wrong result, and so repeated read/writes to a high capacity cell will make that voltage drift until the cell is unusable. Right now the market seems happy with three bits-per-cell (3bpc) for performance and four bits-per-cell (4bpc) for capacity, with a few 2bpc designs for longer term endurance. Some of the major vendors have been working on 5bpc storage, although the low endurance may make the technology only good for WORM – write once, read many, which is a common acronym for the equivalent of something like an old-school CD or non-rewritable DVD.

Floadia Corp., a Series C startup from Japan, issued a press release this week to state that it has developed st­­orage technology capable of seven bits-per-cell (7bpc). Still in the prototype stage, this 7bpc flash chip, likely in a WORM scenario, has an effective 10-year retention time for the data at 150C. The company says that a standard modern memory cell with this level of control would only be able to retail the data for around 100 seconds, and so the secret in the design is to do with a new type of flash cell they have developed.

The SONOS cell uses a distributed charge trap design relying on a Silicon-Oxide-Nitride-Oxide-Silicon layout, and the company points to an effective silicon nitride film in the middle where the charges are trapped to allow for high retention. In simple voltage program and erase cycles, the company showcases 100k+ cycles with a very low voltage drift. The oxide-nitride-oxide layers rely on SiO2 and Si3N4, the latter of which is claimed to be easy to manufacture. This allows a non-volatile SONOS cell to be used in NV-SRAM or embedded designs, such as microcontrollers.

It’s actually that last point which means we’re a long time from seeing this in modern NAND flash. Floadia is currently partnering with companies like Toshiba  to implement the SONOS cell in a variety of microcontrollers, rather than large NAND flash deployments, at the 40nm process node as embedded flash IP with compute-in-memory properties. Those aren’t at 7 bits-per-cell yet, to the effect that the company is promoting that two cells can store up to 8-bits of network weights for machine learning inference – when we get to 8 bits-per-cell, then it might be more applicable. The 10-year retention of the cell data is where it gets interesting, as embedded platforms will use algorithms with fixed weights over the lifetime of the product, except for the rare update perhaps. Even with increased longevity, Floadia doesn’t go into detail regarding cyclability at 7bpc at this time.

An increase from modern 3bpc to 6bpc NAND flash would afford a double density increase, however larger cells would be needed, which would negate the benefits. There’s also the performance aspect if the development of >4bpc ever made it to consumers, which hasn’t been touched upon.

It will be an interesting technology to follow.

Source: Floadia Press Release

POST A COMMENT

48 Comments

View All Comments

  • mode_13h - Monday, December 20, 2021 - link

    I was recently looking askance at Crucial's 4 TB refresh of their MX500. However, I found a review that went into considerable depth about Micron's shift in cell architecture and came away with the impression that endurance of their latest TLC memory could actually be somewhat comparable to their old MLC stuff. Not sure if that's accurate, but they've certainly regained some ground. Reply
  • Kvaern1 - Sunday, January 2, 2022 - link

    It's accurate.
    The TLC Samsung 980 Pro has the same TBW as the MLC 950 Pro and wipes the floor with it on all other parameters, at 25% of the GB price of a 950 Pro in 2016.
    Not to mention the vast majority of usecases will never wear out a TLC or even a QLC SSD and if your needs are beyond that you're now part of a very small market for which properly spec'ed professional parts exist and you should be aware of that.
    Reply
  • linuxgeex - Wednesday, December 15, 2021 - link

    P/E is Program/Erase, lol. The slowdown isn't logarithmic. It won't be floppy speed, or even SMR HDD speed. The main problem will almost certainly be that the energy cost per MB transferred will be higher, which will prevent Mobile adoption, which will seriously hamper scaling because that is where scaling starts. I suspect this is part of why their MCU efforts are at QLC, albeit there's low-power synergy with 2 cells = 8 bits as well. As for P/E, the article claims 100k P/E cycles. That's probably for when it's operating in SLC mode at 40nm. Generally you divide by 3 each time you add a level, so 100k / 3^7 is approx 45 7-layer P/E cycles, which is probably still OK for consumer-level photography memory. However only 10 years of retention when it's damn near an archival format... that is sucky and again will harm adoption and scaling. So for those reasons I suspect this will be dead in the water. Reply
  • vlad42 - Thursday, December 16, 2021 - link

    Yeah, P/E was a derp moment with no edit button. I realized the mistake right after hitting reply.

    Floppy speed was definitely meant as a hyperbole. I hope you are right that that this remains a niche product.
    Reply
  • back2future - Friday, December 17, 2021 - link

    "repeated read/writes to a high capacity cell will make that voltage drift until the cell is unusable"
    That's the difference to previous nand cells?
    Reply
  • back2future - Sunday, December 19, 2021 - link

    Maybe someone can elaborate, at what amount *read* rate on stored data (equal read address for longer period compared to writing cells with addresses being available empty pages) is impacting voltage drift? Thx Reply
  • back2future - Sunday, December 19, 2021 - link

    if this (https://media.springernature.com/lw785/springer-st... would be reference for 5V power supply voltage nand storage system (compared to 1.8V nand power supply, etc.) and told 5-6% deviation for slc comparable 1bit level storage voltage_th (threshold) is limit for reliable avoidance of bit error rate above acceptable rate, then voltage level for acceptable recognition for 0/1 of storage state is limited to about 0.2-0.25V deviation from reference voltage for each state (0V or maybe slightly below GND, 4-4.2V for a logical 1).
    These 5-6% are divided by 4 on MLC, div by 8 on TLC, /16 on QLC and /128 on 7bit level storage cell, what given relatively high 5V device supply might get into 1.9mV for limit of deviation from reference voltage for reliable recognition of cell state and stored bit data? Seen 0.15V drift for a 40days data retention time on room temperatures (P3 testing conditions: https://www.researchgate.net/profile/Onur-Mutlu-4/...  ), this 10yrs retention time is the real progress and news for this? Still interested, what read rate is affecting cells voltage treshold and how does storage management invisibly to users monitoring statistics (and tools) refreshes cells in reaction to amount of read capacity leakages to cell gates?
    Reply
  • back2future - Sunday, December 19, 2021 - link

    Sorry, linking to graphs was not like intended:
    if this ( https://rb.gy/youkdw ) would be reference for 5V power supply voltage nand storage system (instead of compared to 1.8V nand power supply, etc.) and told 5-6% deviation for slc comparable 1bit level storage voltage_th (threshold) is limit for reliable avoidance of bit error rate above acceptable rate, then voltage level for acceptable recognition for 0/1 of storage state is limited to about 0.2-0.25V deviation from reference voltage for each state (0V or maybe slightly below GND, ~4-4.2V for a logical 1).
    These 5-6% for slc high state, are divided by 4 on MLC, div by 8 on TLC, /16 on QLC and /128 on 7bit level storage cell, what, given relatively high 5V device supply, might get into 1.9mV for limit of deviation from reference voltage for reliable recognition of cell state and stored bit data? Seen 0.15V drift for a 40days data retention time on room temperatures (P3 testing conditions: https://rb.gy/eebjgg, https://rb.gy/sbush0 ), this 10yrs retention time is the real progress and news for this? Still interested, what read rate is affecting cells voltage treshold and how does storage management invisibly to users monitoring statistics (and tools) refreshes cells in reaction to amount of read capacity leakages to cell gates?
    Reply
  • back2future - Monday, January 3, 2022 - link

    While for MLC cells it was mentioned, there could be a degradation with read accesses, resulting into higher demand for data refresh write for about 1 million reads on SLC compared to 100k-20k reads on MLC cells, data for TLC or QLC are rare. Samsung (for e.g. 845DC EVO 3-bit MLC read intensive, 850DC PRO 2-bit, 24-layer MLC NAND mixed and write intensive) advertises for read intensive use cases
    "Read-intensive use cases are
    typical of presentation platforms
    such as web servers, social media
    hosts, search engines and content-
    delivery networks. Data is written
    once, tagged and categorized,
    updated infrequently if ever, and
    read on-demand by millions of
    users. Planar NaND has excellent
    read performance, and limiting
    the number of writes extends the
    longevity of an SSD using it."

    with mixed loading for read/write on 50/50 to 70/30 assumptions.
    https://downloadcenter.samsung.com/content/UM/2017...
    details on endurance (retention?)
    https://downloadcenter.samsung.com/content/UM/2017...

    datacenters failed storage SSDs, parameters for accurate failure identification, table 5 (page 8)
    https://www.microsoft.com/en-us/research/wp-conten...
    Reply
  • back2future - Monday, January 3, 2022 - link

    retention for MLC (2012, some overview, page 7)
    https://citeseerx.ist.psu.edu/viewdoc/download?doi...
    SLC (P/E cycles) with varying recovery in seconds from programming (~100's) ~63yrs, (100's-1000's) ~20yrs
    MLC (~100's) ~20yrs, (~1000's) ~5yrs (also varying with prog. recovery from none/10s to 10000s/1d, with x10-x100 factor difference between for P/E cycles allowed for accepted retention needs, page 8 diagrams, and about x10 factor between SLC and 2-bit MLC NAND type, but absolute 1/2-1/3 data retention periods (years on 100's P/E) for MLC's on 2012 knowlegde and tech, just for overview on retention development and request for TLC/QLC/'7-bit'LC data and studies)
    Reply

Log in

Don't have an account? Sign up now