Motherboards Memory Storage Cases/Cooling/PSUs IT Computing Displays Mobile Mac CPUs & Chipsets Video Digital Cameras Linux Gadgets Systems Trade Shows Guides Home Increase Font Size Decrease Font Size Change Page Size
NVIDIA nForce Professional Brings Huge I/O to Opteron
NVIDIA nForce Professional Brings Huge I/O to Opteron
Date: January 24th, 2005
Topic: CPU & Chipset
Manufacturer: NVIDIA
Author: Derek Wilson
Buy the Tyan S2932WG2NR Thunder Dual n3600M
Blank
 Newegg $419.99
 
 


The Kicker(s)

On server setups with available HyperTransport lanes to processors, up to 3 nForce Pro 2050 MCPs can be added alongside a 2200 MCP.

As each MCP makes a HT connection to a processor, a dual or quad processor setup is required for a server to take advantage of 3 2050 MCPs (in addition to the 2200), but in the case where such I/O power is needed, the processing power would be an obvious match. It is possible for a single Opteron to connect to one 2200 and 2 2050 MCPs for slightly less than the maximum I/O, as any available HyperTransport link can be used to connect to one of NVIDIA's MCPs. Certainly, the flexibility now offered by AMD and NVIDIA to third party vendors has been increased multiple orders of magnitude.

So, what does a maximum of 4 MCPs in one system get us? Here's the list:

  • 80 PCI Express lanes across 16 physical connections
  • 16 SATA 3Gb/s channels
  • 4 GbE interfaces

Keep in mind that these features are all part of MCPs connected directly to physical processors with 1GHz HyperTransport connections. Other onboard solutions with massive I/O have provided multiple GbE, SATA, and PCIe lanes through a south bridge, or even the PCI bus. The potential for onboard scalability with zero performance hit is staggering. We wonder if spreading out all of this I/O across multiple HyperTransport connections and processors may even help to increase performance over the traditional AMD I/O topography.


Server configurations can drop an MCP off of each processor in an SMP system.

The fact that NVIDIA's RAID can span across all of the devices in the system (all 16 SATA 3Gb/s and the 4 PATA devices from the 2200) is quite interesting as well. Performance will be degraded by mixing PATA and SATA devices (as the system will have to wait longer on PATA drives), and likely, most will want to keep RAID limited to SATA. We are excited to see what kind of performance that we can get from this kind of system.

So, after everyone catches their breath from the initial "wow" factor of the maximum nForce Pro configuration, keep in mind that this will only be used in very high end servers using Infiniband connections, 10 GbE, and other huge I/O interfaces. Boards will be configured with multiple x8 and x4 PCIe slots. This will be a huge boon to the high end server market. We've already shown the scalability of the Opteron at the quad level to be significantly higher than Intel's Xeon (single front side bus limitations hurt large I/O), but unless Intel has some hidden card up their sleeve, this new boost will put AMD based high end servers well beyond the reach of anything that Intel could put out in the near term.

On the workstation side, NVIDIA will be pushing dual processor platforms with one 2200 and one 2050 MCP. This will allow the following:

  • 40 PCI Express lanes across 8 physical connections
  • 8 SATA 3Gb/s channels
  • 2 GbE interfaces


This is the workstation layout for a DP NFPro system.

Again, it may be interesting to see if a vendor will come out with a single processor workstation board requiring an Opteron 2xx to enable a 2200 and 2050. This way, a single processor workstation could be had with all the advantages of the system.


IWill is shipping their DK8ES as we speak and should have availability soon.

While NVIDIA hasn't announced SLI versions of Quadro or support for them, boards based on the nForce Pro will support SLI version of Quadro when available. And the most attractive feature of the nFroce Pro when talking about SLI is this: 2 full x16 PCI Express slots. Not only do both physical x16 slots have all x16 electrical connections, but NVIDIA has enough headroom left for an x4 slot and some x1 slots as well. In fact, this feature is big enough to be a boon without the promise of SLI in the future. The nForce Pro offers full bandwidth and support for multiple PCI Express cards, meaning that multimonitor support on PCI Express just got a shot in the arm.

One thing that's been missing since the move away from PCI graphics cards has been multiple card support in systems. Due to the nature of AGP, running a PCI card alongside an AGP card is nice, but PCI cards are much lower performance. Finally, those who need multiple graphics cards will have a solution with support for more than one of the latest and greatest graphics cards. This means two 9MP monitors for some, and more than two monitors for others. And each graphics card has a full x16 connection back to the rest of the system.


Tyan motherboards based on nForce Pro are also shipping soon according to NVIDIA.

The two x16 connections will also become important for workstation and server applications that wish to take advantage of GPUs for scientific, engineering, or other vector-based applications that lend themselves to huge arrays of parallel floating point hardware. We've seen plenty of interest in doing everything from fluid flow analysis to audio editing on graphics cards. It's the large bi-directional bandwidth of PCI Express that enables this, and now that we'll have two PCI Express graphics cards with full bandwidth, workstations have even more potential.

Again, workstations will be able to have RAID across all 8 SATA devices and 4 PATA devices if desired. We can expect mid-range and lower end workstation solutions only to have a single nForce 2200 MCP on them. NVIDIA sees only the very high end of the market looking at pairing the 2200 with 2050 in the workstation space.

Perhaps the only down side to the huge number of potential PCI Express lanes available is the inability of NVIDIA MCPs to share lanes between MCPs and slight lack of physical PCI Express connections. In otherwords, it will not be possible to have a motherboard with 5 x16 PCI Express slots or 10 x8 PCI Express slots, even though there are 80 lanes available. Of course, having 4 x16 slots is nothing to sneeze at. On the flip side, it's not possible to put 1 x16, 2 x4, and 6 x1 PCIe slots on a dual processor workstation. Even though there are a good 16 PCI Express lanes available, there are no more physical connections to be had. Again, this is an extreme example, and not likely to be a real world problem. But it's worth noting nonetheless.


Final Words   Next Page

 
  Index

Tools Share
Find lowest prices Find the lowest prices
Digg   del.icio.us   E-mail  
Print This Article Print this article  

55 Comments - Last by smn198, 1838 days ago
Username:
Password:
No Subject by CBone, 1842 days ago
Sweet. Can't wait.

Reply
No Subject by SDA, 1842 days ago
PCI-X is supposed to mean PCI-E, right? PCI-X != PCI-Express, now I'm confused...

Anyway, looking good. Wonder what performance will be like.

Reply
No Subject by KristopherKubicki, 1842 days ago
SDA: PCI-X is 64-bit PCI. PCI-Express also known as PCIe is a totally different animal. PCI-X is old, PCIe is new.

Kristopher

Reply
No Subject by ksherman, 1842 days ago
Ill i can say is... DAMN! I wish i had the money do get one of these my self :(

Reply
No Subject by miketheidiot, 1842 days ago
I'm impressed.

Reply
No Subject by mickyb, 1842 days ago
One has to wonder what the 2 million extra transisters are for. I would be surprised if it was "just" to allow multiple MCPs. Sounds like a lot of logic. I am also surprised about the 4 physical connector limit. I didn't realize the PCI-e lanes had to partitioned off like so. I assumed that if there were 20 lanes, they could create up to 20 connectors.

Reply
No Subject by ksherman, 1842 days ago
I agree with #6... I think the extra transistors would be used to allow all the chips to communicate.

Reply
No Subject by DerekWilson, 1842 days ago
The Intel way of doing PCI Express devices is MUCH simpler. 3 dedicated x8 PCI Express ports on their MCH. These can be split in half and treated as two logical x4 connections or combined into a x16 PEG port. This is easy to approach from a design and implimentation perspective.

Aside from NVIDIA having a more complex crossbar that needs to be setup at boot by the bios, allowing 20 devices would meant NVIDIA would potentially have to setup and stream data to 20 different PCI Express devices rather than 4 ... I'm not well versed in the issues of PCI Express, or NVIDIA's internal architecture, but this could pose a problem.

There is also absolutely no reason (or physical spec) to have 20 x1 PCI Express lanes on a motherboard ;-)

I could see an argument to have 5 physcial connections in case someone there was a client that simply needed 5 x4 connections. But that's as far as I would go with it.

The only big limitation is that the controllers can't span MCPs :-)

meaning it is NOT possible to have 5 x16 PCI Express connectors a quad Opteron mobo with the 2200 and 3 2050s. Nor is it possible to have 10 x8 slots. Max bandwidth config would be 8 x8 slots and 4 x4 slots ... or maybe throw in 2 x16, 4 x8, 4 x4 ... Thats still too many connectors for conventional boards ... I think I'll add this note to the article.

#7 ... There shouldn't be much reason for MCPs to communicate explicityly with eachother. It was probably necessary for NVIDIA to extend the RAID logic to allow it to span across multiple MCPs, and it is possible that some glue logic may have been neccessary to allow a boot device to be locate on a 2050 for instance. I can't see it being 2M transistors in MCP-to-MCP kind of stuff though.

Reply
No Subject by R3MF, 1842 days ago
two comments:

the age of cheap dual opteron speed demons is not yet up on us, because although you only need one 2200 chip to have a dual CPU rig the second CPU connects via the first, thus you only get 6.4GB bandwidth as opposed to 12.8GB. yes you can pair a 2200 and a 2050 together but i bet they will be very pricey!

the article makes mention of SLI for quaddro cards, presumably driver specific to accomodate data sharing over two different PCI-E bridges as opposed to one PCI-E bridge as is the case with nForce4 SLI. this would seem to indicate that regular PCI-E 6xxx series cards will not be able to be used in an SLI confiuration on nForce Pro boards as the ability will not be enabled in the driver. am i right?

Reply
No Subject by Illissius, 1842 days ago
Isn't the TCP offload engine thingy just ActiveArmor with a different name?

Reply
Comments Page 1 of 6

Unlicensed Software at Your Last Company
Anonymously Report Unlicensed Software with Our Form Now. Get Up to $1 Million.
We Buy Laptop and PC Memory! Sell to Us!
Min of 25 pieces required. Call us today at 239.354.1230.
Special Offer from The Economist
Get 12 issues of The Economist for $12. US subscribers only.
Free Forrester Risk Management Report
Demystifying Enterprise Risk Management. Download Free With Registration.
Download Microsoft Visual Studio ® Team System
Streamline Dev processes, Reduce time to market. Try Microsoft Visual Studio Team System, FREE!




Latest news by
DailyTech

 February 9, 2010

Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank
Blank

 February 8, 2010

Blank


more CPU & Chipset Discussions



pipeboost
Copyright © 1997-2010 AnandTech, Inc. All rights reserved. Terms, Conditions and Privacy Information.
Click Here for Advertising Information