• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Unwrapping the NVIDIA B200 and GB200 AI GPU Announcements

Joined
Aug 21, 2013
Messages
1,934 (0.47/day)
The consumer version of that chip will probably be very different rather than a simple cut down. Those datacenters GPU are generally pretty bad for gaming
Also the x100 variants lack display outputs as they are meant to be used as accelerators - even the PCIe variants.
 
Joined
Apr 5, 2023
Messages
71 (0.11/day)
Fp4! I dread to think about the type 1 and type 2 errors that can occur with ultra-low precision nibble Artificial Inference. It is such a blunt tool. If it’s a nail, it will work. If it’s a screw it won’t. And will the “users” of the Ai output have any clue
4bit inference is old hat at this point, commonly used to get parameter sets of LLMs small enough to run on client gpus. The networks are fine-tuned (i.e., re-trained) to operate at this precision precisely to minimize additional error.

A recent paper making waves proposes "1.58 bit" inference (i.e., single digit ternary arithmetic).
 
Joined
Jun 8, 2022
Messages
388 (0.42/day)
Location
Ohio, USA
System Name Trackstar
Processor AMD Ryzen 7 5800X3D -30 All Core CO (on Corsair XC5 block)
Motherboard Gigabyte B550 AORUS Elite V2 Rev 1.0 (F17 BIOS)
Cooling Corsair XD5 pump / Corsair XR5 1x 360mm (front) + 1x 420mm (top) rads
Memory 32GB G.Skill DDR4-3600 CL14 1:1 (F4-3600C14Q-32GVKA kit)
Video Card(s) ASRock RX 6950XT OC Formula (on Bykski A-AR6900XTOCF-X block)
Storage WD_BLACK SN850X 2TB w/HS (FW ver. 620361WD)
Display(s) Dell S3222DGM 32" 1440p/165Hz FreeSync
Case Fractal Design Meshify S2
Audio Device(s) Realtek ALC1200 Integrated Audio
Power Supply Super Flower Leadex Platinum SE 1200W on Liebert GXT4-1500RT120 UPS
Mouse Corsair Nightsword RGB
Keyboard Corsair K60 RGB PRO
VR HMD N/A
Software Windows 11 Pro 23H2 (Build 22631.3958)
Benchmark Scores https://www.3dmark.com/sw/1131940 https://www.3dmark.com/fs/29315810
The biggest surprise is the use of N4P node. I thought for sure Nvidia was going to use 3nm by now, at least for these 20k+ costing chips.
This does not bode well for RTX 5000 series. I very much doubt those will use 3nm either.
I don't know, I think they could still get some sizable gains out of reordering the architecture alone. The 780 Ti and 980 Ti shared the same TSMC 28nm node but GM200 easily bested GK110B. This launch could still be disappointing however there is precedent for Jensen's team pulling a rabbit out of their collective hat while using the same lithography.

780 Ti:
1710862084871.png


980 Ti:
1710862099159.png
 
Top