NVIDIA's Tegra 2

The old definition of a computer is a microprocessor, memory, storage, input and output. The modern definition expands it a bit to include coprocessors (e.g. GPUs) as well as multiple types and levels of memory and storage. The type of input and output devices have changed as well. In smartphones keyboards are virtual and output is integrated into the phone. Although the definition of a computer has evolved, it’s not all that different.

In the old days, almost each one of these parts of a computer was a discrete component. You had a CPU, memory, a hard drive, a video processor (before they were GPUs) all independent of one another. Go back prior to the 486 and you’ll even find that your CPU had to rely on an external FPU for all floating point math.

Moore’s Law has given us bigger, faster, better in all of these areas. Intel’s 486 was the first million transistor x86 processor, introduced in 1989. Sandy Bridge, introduced in January, is just shy of a billion transistors. Sixteen megabytes of memory was a big deal 20 years ago, today high end PCs ship with several gigabytes of memory. Another side effect of Moore’s Law however is integration.

At first integration brought things like the FPU and a second level cache onto the processor die. Over time more components were brought in to the fold. AMD integrated the memory controller in its Athlon 64 processor. Intel brought graphics on-package with Clarkdale/Arrandale, and later on-die with Sandy Bridge. AMD is doing the same with Fusion.

In the smartphone space, the integration is even more pronounced. With physical space and power as major constraints, smartphone chip makers have been forced to further trade performance for integration. The level of integration is so high within a smartphone that you almost never hear about what CPU a phone uses, but rather what Application Processor it uses otherwise known as an SoC (System on Chip or System on a Chip).


Package on Package (DRAM on top, SoC on bottom)—source: statschippack.com

Integrate a CPU, GPU, memory controller, video decoder, audio decoder, video encoder (sometimes), camera processor, system memory and maybe even a modem onto a single chip and you’ve got something that can only be described as a System on a Chip. It’s a single physical chip that integrates nearly all of the functions of the entire computer. Nearly all of the aforementioned components are on a single piece of silicon, with the exception of any integrated memory. To save board real estate and enable smaller form factors, it’s not uncommon to stack DRAM on top of the SoC package instead of beside it. The SoC in a PoP (Package on Package) stack has contacts on its top surface that line up with the balls on the DRAM for power and signaling. PoP stacks work because the SoC underneath doesn’t dissipate much heat and thus doesn’t mind being insulated by some DRAM up top.

Examples of SoCs are Qualcomm’s Snapdragon, Texas Instruments’ OMAP 4 and of course the subject at hand, NVIDIA’s Tegra 2. Although this is a review of LG’s Optimus 2X, it’s just as much a review of NVIDIA’s Tegra 2.

Tegra 2: The SoC

As a System on a Chip, NVIDIA’s Tegra 2 has a number of processors that make up the whole. Having its roots in the PC industry and being used to briefing inquisitive press, NVIDIA put together this handy die shot that shows the various parts of the Tegra 2:

With the exception of two blocks, the Tegra 2 die is entirely NVIDIA’s own creation. The A7 and Cortex A9 blocks are IP licensed from ARM. The entire die is manufactured at TSMC on a 40nm process, similar to NVIDIA’s high end GPUs. While its GPUs are built on TSMC’s 40nm “G” process, Tegra 2 is a slightly different beast.

Most foundries offer two variants of the same manufacturing process: general purpose (G) and low power (LP). The feature size is the same, however the transistors are tuned differently. TSMC’s general purpose process transistors are very fast and low voltage, unfortunately they have very high leakage current. Transistors, as you may know, are electrical on/off switches. Apply a voltage to them and current flows, remove the voltage and current stops flowing. In reality sometimes current flows when you don’t want it to, and this is referred to as leakage. High leakage is a side effect of the nice high performance transistors we need to run the fastest processors.

TSMC’s 40nm LP process uses lower voltage, slower switching transistors (can’t run at as high of a clock speed) that, as a result, have very low leakage characteristics. The lower your leakage and the lower your voltage, the lower your overall power is.

For an SoC, you’d assume that the whole thing would be built at the 40nm LP process. See those two Cortex A9 cores in the diagram above? Remember how they’re licensed from ARM? Those things are pretty high performance, they run at 1GHz. Everything else in the chip runs at 300MHz or less for the most part. On top of that, the libraries ARM provides are optimized for TSMC’s 40nm G process.

As a result, Tegra 2 uses a mixture of G and LP 40nm transistors on two separate voltage rails. The two Cortex A9 cores and the L2 cache are built using TSMC’s 40nm G process transistors, while the rest of the SoC (including the GPU) is built using 40nm LP transistors. The pair of A9s can be powered down together although not independently. We’ll get to a deeper discussion of the ARM Cortex A9 shortly.

The ISP (Image Signal Processor), located in the upper left of the die shot above, is responsible for taking the output from a camera (still/video) controller and processing into a usable video state. The Tegra 2 ISP is capable of processing images at a rate of 80 megapixels per second. The ISP supports two cameras, a 12 megapixel primary and a 5 megapixel secondary. The math works out to a maximum of 6 frames per second captured from the primary sensor at 12MP. LG uses the Tegra 2’s ISP to enable a 6 fps burst mode as you’ll see later on in the review, unfortunately it only works at a 2MP resolution. NVIDIA’s strong ISP looks better on paper than in practice it seems.

The video encode processor does real time H.264 video encoding, which is used when capturing video from the camera sensor. NVIDIA doesn’t provide any specs on what the encoder is capable of, but we’re not too impressed with the quality of its output (again, you’ll see more later).

The audio processor is dedicated hardware for audio encoding and decoding. This is used for audio capture as well as audio playback. Even MP3 playback is done on the dedicated audio processor so the Cortex A9s remain powered down to maximize battery life.

The Tegra 2 video decoder can fully accelerate the decode of 1080p H.264 Baseline profile videos at up to 20Mbps. The specs mostly look good on paper as you won’t be playing anything near that bitrate on your smartphone. NVIDIA includes dual-display capabilities with Tegra 2. The SoC can output the frame buffer to the smartphone’s display as well as an external display via HDMI out.

The ARM A7 nestled in between the video decoder and the L2 cache is used as a system management core. It handles communication between blocks, power management and general SoC management. The remaining blocks (outside of I/O) are the two CPU cores and the GPU. Those both require a lot more detail.

LG Optimus 2X Hardware Impressions The CPU Comparison: NVIDIA, TI & Qualcomm in 2011
Comments Locked

75 Comments

View All Comments

  • matt b - Tuesday, February 8, 2011 - link

    Just curious because I've heard rumors that HP will use the Qualcomm chipset and I've also heard rumors that they will stick with Ti for their new tablets/phones. I just wondered if you know for sure since I know that you met with folks at CES. I hope that we all find out tomorrow at the HP event.
    Great review.
  • TareX - Wednesday, February 9, 2011 - link

    I'd like to see Tegra 2 on the Xoom compared to Tegra 2 on the Optimus 2X.

    Why? Well, simply put, the only Android version that seems to be optimized for dual-core is Honeycomb.
  • Dark Legion - Wednesday, February 9, 2011 - link

    Why is there no Incredible on 2.2? I could understand if you had both 2.1 and 2.2, like the Evo, but as it is now does not show the full/current performance.
  • Morke - Thursday, February 10, 2011 - link

    "It’s a strange dichotomy that LG sets up with this launcher scheme that divides “downloaded” apps from “system applications,” one that’s made on no other Android device I’ve ever seen but the Optimus One. The end result is that most of the stuff I want (you know, since I just installed it) is at the very last page or very bottom of the list, resulting in guaranteed scrolling every single time. If you’re a power user, just replace the launcher with something else entirely."

    You are not right there.
    First you can create additonal categories (aside from system applications and downloads) and move applications between them.
    Secondly you can rearrange the ordering of the applications inside a category (allowing you to have those on top which you access most frequently). You can also delete applications right away in this edit mode.

    There is a youtube video demonstrating this:
    http://www.youtube.com/watch?v=Dvvtl6pSNp8
    See time index starting with 4:21.

    Maybe you should correct your review on this?
  • Morke - Thursday, February 10, 2011 - link

    The correct youtube URL demonstrating application launcher management is actually
    http://www.youtube.com/watch?v=lDo-1-jwLko&fea...
  • brj_texas - Thursday, February 10, 2011 - link

    Anand,
    A question on the statement in the benchmarking section, "the SunSpider benchmark isn't explicitly multithreaded, although some of the tests within the benchmark will take advantage of more than one core. "

    My understanding was that all of the tests within sunspider are single-threaded, but a dual-core processor can run the javascript engine (and the sunspider tests) in a separate thread from the main browser thread when you call sunspider from a browser window.

    Can you clarify which tests support multi-threading in sunspider if that is in fact what you meant?

    On the topic of multi-threading, we've used moonbat, a multi-core variant of sunspider, to explicitly test multi-core performance with javascript code. I wonder if you have any other benchmarks under investigation that measure multi-core performance?
    Thanks

    -Brian
  • worldbfree4me - Saturday, February 12, 2011 - link

    Thanks for another thorough and in-depth analysis. But I have a question to ask,

    Should we upgrade (break our 2 year contract agreement for this phone) or ride out our contract?

    We trust and value your our opinion. Tom’s hardware does a GPU hierarchy chart every few months, can you do a phone hierarchy in the future?
  • lashton - Sunday, February 13, 2011 - link

    They have a really good idea and lead the market but it falls short because its not quite right
  • tnepres - Tuesday, April 5, 2011 - link

    I now own a optimus 2x. The first was dead on arrival, but this one is perfect. The LG software is innovative and pleasing to the eye. In various places they made real improvements to the UI that are just brilliant,ie. the ability to sort and categorize apps. At times the UI is not as fast as you would expect, especially when adding apps/widgets to one of the 7 pages. It seems LG generates a list of widgets for you, so you can see what apps support this mode, and that takes about a second. As I recall, on HTC devices you are just presented with a list of apps and u have to try and see if you can widget it.

    The LG keybord has a brilliant feature, you tab the side of the phone to move the cursor. Sadly in other respects the keyboard is lacking, ie. when you long-pres you do not get the alternates you might wish, such as numbers.

    The batterytime is superb, using the UI consumes much less power than on my desire.

    Copy/paste in the browser does not activate via long-pres, you have to hit menu button, but on the plus side its easier to use than what HTC made.

    During 2 days of very intensive use i have had 1 app (partially) crash and that was the marketplace. No other issues so far, its my verdict that the unstability issues are overrated.

    No problems with wifi using stock ISP (TDC) supplied router. (sagemcom)

    To engadget: How on earth (!!?!!?) can you state there is no use for dualcore. When browsing one loads flash the other the rest. Its so fast you cant believe it. Try loading www.ufc.com on a non dualcore phone and you get my drift.

    I do not hesitate to give the optimus 2x my warm recommendations.

    VERDICT: 9/10 (missing 4g)
  • Sannat - Thursday, May 12, 2011 - link

    gsmarena sound benchmark for optimus 2x isnt great...could it be a s/w issue...??

Log in

Don't have an account? Sign up now