Why We Need 3D NAND

For years, it has been known that traditional NAND (i.e. 2D NAND) is running on its last legs. Many analysts predicted that we would not see NAND scaling below 20nm because the reliability would simply be too low to make such a small lithography feasible. However, thanks to some clever engineering on both hardware and firmware sides NAND has scaled to 15nm without any significant issues but now the limit has been reached for real. To understand the limits of 2D NAND scaling, let's say hello to our old good friend Mr. N-channel MOSFET.

Unfortunately the diagram above is a bit too simplified to truly show what we need, so let's look at a real cross-section photo instead:

Let me walk you through the structure first. At the top is the control gate, which is a part of a structure known as a wordline. In a standard NAND design the control gate wraps around the floating gate and the two gates are separated by an insulating oxide-nitride-oxide layer (i.e. ONO), which is sometimes called Inter Poly Dielectric (IPD). Under the floating gate is the tunnel oxide, which is also an insulator, followed by the silicon substrate that acts as the bitline. 

The reason why the control gate is wrapped around the floating gate is to maximize capacitance between the two. As you will soon learn, the capacitance between the gates is the key factor in NAND as it means that the control gate can control the floating gate.

The purpose of bitlines and wordlines can be rather difficult to understand when looking at a cross-section, so here is what it all looks like from the top. Basically, bitlines and wordlines are just lines going in perpendicular directions and the floating gate and other materials reside between them.

When programming a cell, a high voltage of around 20V is applied to the wordline of that cell. Of course, the problem is that you cannot apply voltage to just one cell because the whole wordline will be activated so in order to select a specific cell, the bitline of that cell is held at 0V. At the same time, the neighbouring bitlines are charged to about 6V because this increases the capacitance between the bitline and floating gate, which is turn negates enough of the capacitance between the control and floating gate that the electrons cannot tunnel through the tunnel oxide. This is crucial because if all the bitlines were held at 0V, then all the cells along that wordline would be programmed with the same value. 

To erase a cell, a reverse operation is performed by keeping the wordline at 0V while issuing a ~20V voltage on the bitline, which makes the electrons flow in the opposite direction (i.e. from the floating gate back to the bitline/silicon).

The way NAND is programmed and erased is also its Achilles' Heel. Because such high voltage is needed, the insulators around the floating gate (i.e. ONO and tunnel oxide) wear out as the NAND goes through program and erase cycles. The wear out causes the insulators to lose their insulating characters, meaning that electrons may now be able to escape the floating or get trapped in tunnel oxide during a program or erase. This causes a change in the voltage state of the cell.

Remember, NAND uses the voltage states to define the bit value. If the charge in the floating gate is not what it is supposed to be, the cell may return an invalid value when read. With MLC and TLC this is even worse because the voltage states are much closer to each other, meaning that even minor changes in the voltage state may cause the voltage state to shift from its original position, which means the cell value will also change. Basically, MLC and TLC have less room for voltage state changes, which is why their endurance is lower because a cell that cannot hold its charge reliably is useless.

Now that we have covered the operation of NAND briefly, let's see what this has to do with scaling. Here is the same cross-section as above but with some dimensions attached.

That is what a cross-section of a single cell looks like. When NAND is scaled, all these dimensions get smaller, which means that individual cells are smaller as well as the distance between each cell. The cross-section above is of IMFT's 25nm NAND (hence the bitline length of 25nm), so it is not exactly current generation but unfortunately I do not have any newer photos. There is no general rule to how much the dimensions shrink because 16nm simply means that one of the lengths is 16nm while others may not shrink that much.

The scaling introduces a variety of issues but I will start with the cell size. As the cell size is shrunk, the size of the floating gate is also shrunk, which means that the floating gate is able to hold less and less electrons every time the process node gets smaller. To put this into perspective, Toshiba's and SanDisk's 15nm NAND is stores less than 20 electrons per NAND cell. With TLC, that is less than three electrons per voltage state, so there is certainly not much headroom for escaped electrons. In other words, the cell becomes more vulnerable to the IPD and tunnel oxide wear out because even the loss of one electron can be vital to the voltage state. 

The second issue is the proximity of the cells. The key factor in NAND is the capacitance between the control and floating gate but as the cells move closer to each other through scaling, the neighboring cells will also introduce capacitive coupling. In simpler terms, the neighboring cells will interfere more as the distance between the cells shrinks. The obstacle is that the interference varies depending on the charge of the neighbouring cell, so there is no easy way to exclude the intereference. This in turn makes programming harder and more time consuming because a higher voltage will be needed to achieve the sufficient capacitance between the control and floating gate to make the eletrons tunnel through the oxide. 

The graph above outlines historic rate of how cell to cell intereference increases through die shrinks. At 90nm, the interference was only around 8-9% but at 20nm it is a rather significant 40%. The interference means that 40% of the capacitive coupling comes from the other cells, making it very hard to control the gate you are trying to program or read. Fortunately as a result of some clever engineering (i.e. an airgap between the wordlines), the intererence is only about 25% at 25nm, which is much more managable than the 40% the historic rate would have given us. 

The above can be fairly tough to digest, so let's do a simple analogy that everyone should be able to understand. Imagine that you have a bunch of speakers with each playing a different song. When these speakers are relatively large and far away from each other, it is easy to properly hear the song that the speaker closest to you is playing. Now, what happens if you bring the other speakers closer to the speaker you are listening? The other speakers will interfere and it becomes harder to tell your song apart from the others. If you turn down the volume or switch to smaller speakers with lower output volume, it becomes even harder to distinquish your song from the songs that the other songs that other speakers are playing. If you repeat this enough times, there will be a point when you are hearing your song as unclearly as the other songs.

The effect is essentially the same with NAND scaling. When the cells, or speakers in the analogy, move closer to each other, the amount of interference increases, making it harder to sense the cell or listen to the speaker. At the same time the size of the cell (or speakers) is shrunk, which further complicates the focus on one cell (or speaker). 

That is NAND scaling and its issues in a nutshell. We have seen innovations such as airgaps between the wordlines to reduce cell-to-cell interference and a high K metal gate instead of a traditional ONO IPD to increase control gate to floating gate capacitance, but the limit has now been reached. However, like other semiconductors NAND must follow the Moore's Law in order to get more cost efficient. If you can no longer scale in the X and Y dimensions, what do you do? You hit the reset button and introduce the Z dimension.

Introduction, The Drives & The Test 3D NAND: How It Works
Comments Locked

160 Comments

View All Comments

  • Krakadoom - Saturday, September 27, 2014 - link

    I bought the 850 Pro 500 GB version. The transfer speeds are around (and just under) 400 MB/s and the IOPS are always around 55-60k. Not impressed - considering returning it just due to the huge gap between the rated specs and actual performance. (Of course Samsung Magician over-reports the transfer speed by quite a large margin).
  • stealth_lee - Wednesday, October 1, 2014 - link

    Someone just tipped me that V-NAND in Samsung 850 Pro is actually TLC not MLC.
    The first reason is the 86Gbit/die number is odd, if 850 Pro uses a TLC 128Gbit/die and emulates it to MLC then it would be 86Gbit/die, the numbers fit well.
    The second reason is Chipworks confirmed it in the die shots:
    http://bit.ly/YTVm9Z
    http://bit.ly/1uByKcm

    I'm just the messenger here, I'm not expert.
    So...I was wodering is it possible to hack Samsung 850 Pro to get extra storage space in TLC?
  • wcatlan - Saturday, October 25, 2014 - link

    Why isn't the lack of power loss protection a showstopper for any of these drives? I love the speed and reliability benefits under normal operation, but how can anyone get excited about a drive that can get corrupted in an instant due to power loss or computer freeze, where a hard shutdown is required? Seems that these drives are more prone to massive data issues much more than HDDs under the same power fault conditions. I keep looking for a good answer, but it seems smart people are willing to look past this seemingly fatal Achilles heal. Not sure what I might be missing. Any thoughts?
  • futurefilm - Monday, December 1, 2014 - link

    Today, Cyber Monday deals on Amazon, the 850 Pro 256 is going for $150. The 128 for $100. Get it now while it's hot.
  • saagar - Thursday, January 22, 2015 - link

    Dear Kristian Vättö,
    Fantastic review of the drives and the technology behind it. This is what readers like me expect to see on Anandtech. Thanks for breaking it down. Keep up the good work!
  • gsuburban - Wednesday, April 8, 2015 - link

    As of April 8, 2015, the 850 Pro 256GB SSD can be had for about $144 if you look hard enough.
  • rockfella79 - Saturday, June 27, 2015 - link

    I love my 850 Pro 128 GB SSD :)
  • KDT - Thursday, March 24, 2016 - link

    Please update the endurance to 300TBW for 1TB model. This was my basis for buying this SSD. This is 2nd to Crucial MX200 (320TBW on 1TB model) in terms of endurance - for client/consumer SSDs.
  • BimmerInd - Sunday, June 26, 2016 - link

    Samsung is using 40nm over Micron's 16nm. Doing the math implies that for every 2.5 16nm Micron nodes in planar section, Samsung only does 1 40nm node. If we scale vertically to 32 layers, then Micron (or others for that matter) still do only 2.5 nodes for every 32 nodes of Samsung. Which means for every 16nm node, Samsung provides 12.8 nodes. Meaning the density scaling factor for every 32 layer increments is a multiple of 12.8. Assuming the current die size for 32 layers to be 128Gbit, then the density advantage for 256Gbit is 12.8x2 times, 512Gbit is 12.8x4 times and for 1Tbit in 2017 should be 12.8x8 times for 256 layers of nodes stacked on top of each other. So the density advantage is approximately 102.4% (theoretically). Samsung can theoretically produce a 1Tbit die at a cost advantage/space advantage of nearly 100 times compared to planar and manufacturers. It is almost like you are able to earn 100 times the profit for the same die provided the cost per bit is scaled along without passing on the price advantage to the end users until other players enter 3D market.

    The same is the case with Intel's 3D Cross Point Technology. They are having a new tech in their hands that is faster than NAND and closer to DRAM. So they are also planning to price it exactly between NAND and DRAM. We are already paying high costs to shift from platters to NAND and are going to pay even more to make a shift from NAND to 3D xPoint. I just wish I can jump a few years to the future, grab a high capacity NAND/xPoint drive for cheap and come back to the present and use it. Sigh !
  • BimmerInd - Sunday, June 26, 2016 - link

    By the way this is just a rough calculation and is not to be taken literally.

Log in

Don't have an account? Sign up now