Architecture

There was a great deal of talk about why architectural decisions were made, but we will concern ourselves more with what exists rather than why this path was chosen. Every architecture will have its advantages and disadvantages, but understanding what lies beneath is a necessary part of the equation for developers to create efficient code for any architecture.

The first thing of note is NVIDIA's confirmation that 3dcenter.de did a very good job of wading through the patents that cover the NV3x architecture. We will be going into the block diagram of the shader/texture core in this description, but we won't be able to take quite as technical a look at the architecture as 3dcenter. Right now, we are more interested in bringing you the scoop on how the NV36 gets its speed.

For our architecture coverage, we will jump right into the block diagram of the Shader/Texture core on NV35:

As we can see from this diagram, the architecture is very complex. The shader/texture core works by operating on "quads" at a time (in a SIMD manner). These quads enter the pipeline via the gatekeeper which handles managing which ones need to go through the pipe next. This includes quads that have come back for a second pass through the shader.

What happens in the center of this pipeline is dependent upon the shader code running or the texturing operations being done on the current set of quads. There are a certain few restrictions on what can be going on in here that go beyond simply the precision of the data. For instance, NV35 has a max of 32 registers (less if higher precision is used), the core texture unit is able to put (at most) two textures on a quad every clock cycle, the shader and combiners cannot all read the same register at the same time, along with limits on the number of triangles and quads that can be in flight at a time. These things have made it necessary for developers to pay more attention to what they are doing with their code than just writing code that produces the desired mathematic result. Of course, NVIDIA is going to try to make this less of a task through their compiler technology (which we will get to in a second).

Let us examine why the 5700 Ultra is able to pull out the performance increases we will be exploring shortly. Looking in the combiner stage of the block diagram, we can see that we are able to either have two combiners per clock or complete two math operations per clock. This was the same as NV31, with a very important exception: pre-NV35 architectures implement the combiner in fx12 (12 bit integer), NV35, NV36, and NV38 all have combiners that operate in full fp32 precision mode. This allows two more floating point operations to be done per clock cycle and is a very large factor in the increase in performance we have seen when we step up from NV30 to NV35 and from NV31 to NV36. In the end, the 5700 Ultra is a reflection of the performance delta between NV30 and NV38 for the midrange cards.

If you want to take a deeper look at this technology, the previously mentioned 3dcenter article is a good place to start. From here, we will touch on NVIDIA's Unified Compiler technology and explain how NVIDIA plans on making code run as efficiently as possible on their hardware with less hand optimization.

The GeForce FX 5700 Ultra Compilation Integration
Comments Locked

114 Comments

View All Comments

  • Anonymous User - Thursday, October 23, 2003 - link

    it seems toms review puts into question ati's optimizations moreso than nvidia's image quality
  • Anonymous User - Thursday, October 23, 2003 - link

    In any case,.....it's another round of new card releases and hopefully cheaper prices around for the
    "older" models.
  • Anonymous User - Thursday, October 23, 2003 - link

    #19, I don't think it is a fanboy thing. It's an AT thing that's costing them their respect from other hardware sites and readers.
  • Anonymous User - Thursday, October 23, 2003 - link

    ati fanboys above dont look to happy :)
  • Anonymous User - Thursday, October 23, 2003 - link

    # 15, If someone writes a crappy review then he deserves all the problems and flak the come with it.
  • gordon151 - Thursday, October 23, 2003 - link

    #14, "The GeForce FX 5700 Ultra will be debuting at $199 after a mail in rebate. If $200 is your hard limit, and you need a midrange card right now, the 5700 Ultra is the way to go if you want ****solid frame rates****." Now you could say they dodged the image quality bullet on that comment, but that's really the only recommendation they made on the 5700 Ultra.

    When the new article comes out and they do an image quality analysis, if their findings are similar to that of HardOCP and TomsHardware the conclusion will be something similar to "5700 Ultra still for solid frame rates and 9600 XT for solid frame rates *AND* image quality".

    BTW Derek I don't believe was even at the press event, that was Anand. Derek is the sole author of this article it seems and unlike Toms and HardOCP he didn't have any direct aide from other staff.
  • Anonymous User - Thursday, October 23, 2003 - link

    #13,

    No, we don't need to bitch at every AT review. But when the conclusion CONTRADICTS the very data he supplies us, then something is seriously wrong. Wouldn't you say?
  • Anonymous User - Thursday, October 23, 2003 - link

    #14, if anyone buys an expensive video card based on 1 review from 1 tech site, they deserve the problems that could come with it.
  • Anonymous User - Thursday, October 23, 2003 - link

    The review crowned a new midrange segment winner without dealing with image quality. What are they going to do, retract that later after their image tests? What about the people that bought the cards based on their review - and then they find out the cards have image quality problems?

    Other sites in the past when they discovered issues waited until they had done further testing before coming out with any review. Perhaps anandtech should have followed hardocp's lead, and instead of partying it up and brown-nosing at nvidia press events they should have been doing their image tests so they could put out a full review.
  • gordon151 - Thursday, October 23, 2003 - link

    Do we seriously need the comments crying for the authors head with *EVERY* review? They already said they were working on an article which will do a study on the image quality tests and will be posted laters. This review will clearly stress the numbers and that's where they draw conclusions. Damn, give them a frigging break.

Log in

Don't have an account? Sign up now