Why Combine Quadro & Tesla?

So far we’ve covered how NVIDIA will be combining Quadro, but the more interesting question is “why?” In the NVIDIA hierarchy, Quadro is NVIDIA’s leading product. On the graphics side it’s fully unlocked, supporting quad-buffers, uncapped geometry performance, and uncapped viewport performance, while on the compute side it offers full speed FP64 support. Furthermore it’s available in the same configurations as a Tesla card, featuring the same number of CUDA cores and memory. Short of TCC, if you can compute it on a Tesla, you can compute it on a Quadro.

This actually creates some complexities for both NVIDIA and users. On a technical level, Fermi’s context switching is relatively fast for a GPU, but on an absolute level it’s still slow. CPUs can context switch in a fraction of the time, giving the impression of a concurrent thread execution even when we know that’s not the case. Furthermore for some reason context switching between rendering and compute on Fermi is particularly expensive, which means the number of context switches needs to be minimized in order to keep from wasting too much time just on context switching.

As a result of the time needed to context switch, Quadro products are not well suited to doing rendering and compute at the same time. They certainly can, but depending on what applications are being used and what they’re trying to do the result can be that compute eats up a great deal of GPU time, leaving the GUI to only update at a few frames per second with significant lag. On the consumer side NVIDIA’s ray-tracing Design Garage tech demo is a great example of this problem, and we took a quick video on a GTX 580 showcasing how running the CUDA based ray-tracer severely impacts GUI performance.

Alternatively, a highly responsive GUI means that the compute tasks aren’t getting a lot of time, and are only executing at a fraction of the performance that the hardware is capable of. As part of their product literature NVIDIA put together a few performance charts, and while they should be taken with a grain of salt, they do quantify the performance advantage of moving compute over to a dedicated GPU.

For these reasons if an application needs to do both compute and rendering at the same time then it’s best served by sending the compute task to a dedicated GPU. This is the allocation work developers previously had to take into account and that NVIDIA wants to eliminate. At the end of the day the purpose of Maximus is to efficiently allow applications to do both rendering and compute by throwing their compute workload on another GPU, because no one wants to spend $3500 on a Quadro 6000 only for it to get bogged down.

It’s worth noting that this situation closely mirrors the situation for software developers. For debug purposes NVIDIA recommends programmers have two GPUs, so that one GPU can be locked down debugging a compute or rendering task as necessary, while the other GPU is available to display the results. So NVIDIA encouraging users to have two GPUs for technical reasons is not new, but it is expanded. It also means there’s an obvious avenue for further development as NVIDIA wants to move GPU multitasking closer and closer to where CPU multitasking is today.

Moving on, the second reason NVIDIA is pursing Maximus is a result of their own actions. Because Quadro is NVIDIA’s leading product, it commands a leading price: a Quadro 6000 card is $3500 or more. This is a product of NVIDIA’s well engineered market segmentation – a GF110 GPU can be in a $500 GTX 580, a $2500 Tesla C2075, or a $3500 Quadro 6000. By disabling a few critical features on other products (e.g. geometry performance or FP64 performance) NVIDIA can push customers into buying a product at a price NVIDIA believes is best for the target market.

So what’s the problem? The Quadro 6000 is both a highly capable rendering product at a highly capable compute product, but not every professional user needs that much rendering power even if they need the compute power. Those users still need a Quadro card for its uncapped rendering performance, but they don’t necessarily need features such as Quadro 6000’s massive geometry throughput. The result is that NVIDIA was pricing themselves right out of their own market.

The solution to that is combining Quadro and Tesla. Maximus allows a Tesla C2075 to be used with any Fermi based Quadro (600/2K/4K/5K/6K), which allows NVIDIA to more appropriately tap the overlapping market of Quadro users that need top-tier compute performance. The end result for those users is that they not only pay less – a Quadro 2000 and a Tesla C2075 is $3000 versus over $3500 for a single Quadro 6000 – but they gain the aforementioned advantages of not having conflicting tasks slowing down the performance of a single Quadro card. Admittedly this is a lot of effort on NVIDIA’s part to tap a very specific market, but at the end of the day the professional market is a highly profitable market, making it worth NVIDIA’s time.

Final Words

Wrapping things up, NVIDIA has made it clear that they’re going to be pushing Maximus hard right out of the gate. Today of course was also the launch of Intel’s new Sandy Bridge E platform for high-end desktops and workstations, and in this industry there’s very little coincidence. It’s in NVIDIA’s interest to latch into workstation upgrade sales, and this is how you do it. They’ve already lined up HP, Lenovo, Fujitsu, and Dell to offer workstations pre-configured for Maximus, and we’re told those workstations will be made available for purchase today.

As to whether Maximus will be successful or not, this is going to depend both on software and marketing. On the software side NVIDIA needs to deliver on the transparency Maximus promises to developers and users – the concept is simple, but for the professional market the execution must be precise. Optimus graphics switching misbehaves now and then, but professional users will not be as willing to put up with any undesired behavior out of Maximus.

Marketing on the other hand is equally about promoting Maximus and promoting CUDA. A lot of NVIDIA’s promotional material for Maximus could easily be confused for CUDA promotional material, and this is because the videos and case studies are largely about how CUDA improved a process or a product while Maximus was the icing on the cake. Though we consider CUDA old, the fact of the matter is that much of the professional market NVIDIA is targeting has still not heard of CUDA, or has a limited understanding at best. As such NVIDIA will be using the launch of Maximus to promote the benefits of CUDA to certain targeted markets such as manufacturing, design, and broadcasting, just as much as they will be promoting the benefits of having multiple GPUs.

NVIDIA’s Maximus Technology: Quadro + Tesla, Launching Today Press Release
Comments Locked

29 Comments

View All Comments

  • MrSpadge - Monday, November 14, 2011 - link

    Short answer: no.

    The chips are the same, but it's hard-wired which functionality they are allowed to expose. The professional cards also have ECC memory.. but anyone asking for an unlock probably wouldn't be terribly interested in this anyway ;)

    MrS
  • hpvd - Monday, November 14, 2011 - link

    will the quadro be automatically used for cuda calculation if their ist no strong graphic load on it?
    Or do I loose its compute power completely if I put in an additional Tesla??

    if it could be used for computing together with the new tesla: What happens if graphic load increase?
    will there be a smooth transition? eg
    Tesla: compute load 100% , graphic load 0%
    Quadro: compute load 30%, graphic load 70%
    ?
  • Ryan Smith - Monday, November 14, 2011 - link

    It's going to depend on the software you're using. If the compute load can easily be split among multiple CUDA devices, then you can still use both the Quadro and the Tesla for compute as long as the software has a way to select this.

    NVIDIA has a case study video of 3ds Max where they show off compute device selection: http://www.youtube.com/watch?v=xCIAsvT5mYo

    However using both cards for compute automatically doesn't seem like it's possible right now. Keep in mind that NVIDIA's favorite pairing is the Quadro 2000 - a GF106 part - with the C2075, so the Q2000 isn't even in the same league as the C2075. In any case if you did assign compute workloads to both GPUs, then things would be graphically sluggish until the application in question terminated the load on the Quadro.
  • Havor - Tuesday, November 15, 2011 - link

    Dose Open CL not cover the same goals as this Maximus, so why use Maximus?

    Specially if you can use a open standard, used by all big players.

    How good your product works with OpenCL depends on how good your drivers are, so why not focus on that and have the best product with the best OpenCL drivers?

    It looks to me this is one of these product ware some will fall for the marketing crap, but as a product it will fail in the long run.
  • Dribble - Tuesday, November 15, 2011 - link

    No, Maximus is a way of using compute hardware. Open CL is compute software that competes with CUDA. I would have thought the nvidia hardware does support open cl, but everyone uses CUDA because it's much more advanced.
  • Nenad - Tuesday, November 15, 2011 - link

    Why not allowing Tesla+GeForce?

    So far Nvidia was marketing cards more or less like:
    - GeForce : primarily graphics/video
    - Tesla: primarily/only compute
    - Quadro: equaly capable of compute and graphics

    Now with Maximus they say "if you have Quadro and Tesla, use Quadro for graphics and Tesla for compute", but that leaves one question:

    If so far it was GeForce that was best suited for graphics (and its cheaper and much better performance/$ than Quadro for graphics), then why would someone want to buy Quadro just to limit it to graphics?

    Why instead not allowing to have cheaper but faster GeForce (like GTX580), and pair it with Tesla in Maximus?
  • jecs - Wednesday, November 16, 2011 - link

    It all depends on the Nvidia license and not on the hardware used.

    Look at it this way: If Nvidia allowed you to use GeForces on professional applications you either would have to pay the price difference to buy the additional "Quadro Drivers".

    The important part here are the very specilized drivers Nvidia only allows to run with qualified Quadros or Tesla cards.

    Quadro and Tesla class Drivers are very expensive in R&D, Nvidia puts to many engineering resources in the professional drivers, but those licenses are only distributed (sold) on a smaller professional user base and the why these drivers cost more even if used on a very similar hardware.

    Also you can't use a Quadro driver on any similar GeForce because is illegal, sometimes there is not and equivalent hardware on the GeForce side and also because Nvidia physically fixes the installation on the card with transistors.
  • Freakie - Sunday, November 20, 2011 - link

    PhysX, now for workstations! Unless I'm understanding it a bit wrong... but PhysX is quite similar in theory, is it not? Compute the physics of specific things (explosions/smoke/plasma cannon) with a separate card so that you have more realistic effects in your game, and then use a more powerful card to actually display everything else in the game. This is just reversing the power roles.
  • Hobstob - Saturday, May 5, 2012 - link

    It is extremely outdated! Why have they not released a new Line of quadro cards? Are they even planning on releasing a new line of card for workstations?

Log in

Don't have an account? Sign up now