• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Inflection AI Builds Supercomputer with 22,000 NVIDIA H100 GPUs

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,275 (0.92/day)
The AI hype continues to push hardware shipments, especially for servers with GPUs that are in very high demand. Another example is the latest feat of AI startup, Inflection AI. Building foundational AI models, the Inflection AI crew has secured an order of 22,000 NVIDIA H100 GPUs and built a supercomputer. Assuming a configuration of a single Intel Xeon CPU with eight GPUs, almost 700 four-node racks should go into the supercomputer. Scaling and connecting 22,000 GPUs is easier than it is to acquire them, as NVIDIA's H100 GPUs are selling out everywhere due to the enormous demand for AI applications both on and off premises.

Getting 22,000 H100 GPUs is the biggest challenge here, and Inflection AI managed to get them by having NVIDIA as an investor in the startup. The supercomputer is estimated to cost around one billion USD and consume 31 Mega-Watts of power. The Inflection AI startup is now valued at 1.5 billion USD at the time of writing.



View at TechPowerUp Main Site | Source
 
Joined
Jan 5, 2006
Messages
18,050 (2.69/day)
System Name AlderLake / Laptop
Processor Intel i7 12700K P-Cores @ 5Ghz / Intel i3 7100U
Motherboard Gigabyte Z690 Aorus Master / HP 83A3 (U3E1)
Cooling Noctua NH-U12A 2 fans + Thermal Grizzly Kryonaut Extreme + 5 case fans / Fan
Memory 32GB DDR5 Corsair Dominator Platinum RGB 6000MT/s CL36 / 8GB DDR4 HyperX CL13
Video Card(s) MSI RTX 2070 Super Gaming X Trio / Intel HD620
Storage Samsung 980 Pro 1TB + 970 Evo 500GB + 850 Pro 512GB + 860 Evo 1TB x2 / Samsung 256GB M.2 SSD
Display(s) 23.8" Dell S2417DG 165Hz G-Sync 1440p / 14" 1080p IPS Glossy
Case Be quiet! Silent Base 600 - Window / HP Pavilion
Audio Device(s) Panasonic SA-PMX94 / Realtek onboard + B&O speaker system / Harman Kardon Go + Play / Logitech G533
Power Supply Seasonic Focus Plus Gold 750W / Powerbrick
Mouse Logitech MX Anywhere 2 Laser wireless / Logitech M330 wireless
Keyboard RAPOO E9270P Black 5GHz wireless / HP backlit
Software Windows 11 / Windows 10
Benchmark Scores Cinebench R23 (Single Core) 1936 @ stock Cinebench R23 (Multi Core) 23006 @ stock
Joined
Jul 16, 2016
Messages
275 (0.10/day)
Location
Rochester, NY
System Name Xbox Series S
Processor AMD Zen2 8 core 3.6 GHz
Memory 10GB GDDR6
Video Card(s) RDNA2 with 20 CUs
Storage 512Gb SSD NVMe Internal + 8TB WD Black USB External
Display(s) Acer VG270U P 2k
Just when we are getting over a GPU shortage...
 
Joined
Aug 30, 2006
Messages
7,199 (1.11/day)
System Name ICE-QUAD // ICE-CRUNCH
Processor Q6600 // 2x Xeon 5472
Memory 2GB DDR // 8GB FB-DIMM
Video Card(s) HD3850-AGP // FireGL 3400
Display(s) 2 x Samsung 204Ts = 3200x1200
Audio Device(s) Audigy 2
Software Windows Server 2003 R2 as a Workstation now migrated to W10 with regrets.
I don't understand the scaling assumption of 1:8.

If something Retail like this MSI B360-F PRO can do 18x, then a specialist bespoke xeon board could easily do many more. After all Intel Xeon W-3400 Series has 112 pcie lanes and could therefore run 112 GPUs, let's call it 100.
 
Joined
Jan 3, 2021
Messages
2,815 (2.26/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
I don't understand the scaling assumption of 1:8.

If something Retail like this MSI B360-F PRO can do 18x, then a specialist bespoke xeon board could easily do many more. After all Intel Xeon W-3400 Series has 112 pcie lanes and could therefore run 112 GPUs, let's call it 100.
"Can do"? For mining, sure, with an i3 CPU at that.
Here there are huge amounts of data to move from and to storage, and processing also takes place on the CPUs in part. Monster computing nodes with 8 GPU accelerators and twin Xeons or Epycs aren't uncommon. One variant of the MI300 is going to have as many as 24 CPU cores in the same package as the GPU, which will enable operation without a separate Epyc - and think about how much bandwidth those CPU cores need to communicate with the GPU part.
 
Joined
Aug 22, 2007
Messages
3,466 (0.57/day)
Location
CA, US
System Name :)
Processor Intel 13700k
Motherboard Gigabyte z790 UD AC
Cooling Noctua NH-D15
Memory 64GB GSKILL DDR5
Video Card(s) Gigabyte RTX 4090 Gaming OC
Storage 960GB Optane 905P U.2 SSD + 4TB PCIe4 U.2 SSD
Display(s) Alienware AW3423DW 175Hz QD-OLED + Nixeus 27" IPS 1440p 144Hz
Case Fractal Design Torrent
Audio Device(s) MOTU M4 - JBL 305P MKII w/2x JL Audio 10 Sealed --- X-Fi Titanium HD - Presonus Eris E5 - JBL 4412
Power Supply Silverstone 1000W
Mouse Roccat Kain 122 AIMO
Keyboard KBD67 Lite / Mammoth75
VR HMD Reverb G2 V2
Software Win 11 Pro
lol I read that as "Infection AI." :laugh:
 
Joined
Apr 24, 2020
Messages
2,591 (1.73/day)
The Inflection AI startup is now valued at 1.5 billion USD at the time of writing.

Assuming $10,000 per GPU, that's $220 Million on GPUs alone, let alone datacenter costs, CPU costs, RAM, hard drives...

A valuation of $1.5 Billion sounds fair because that's barely much more than the underlying hardware.
 
Joined
Jan 3, 2021
Messages
2,815 (2.26/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
So AI has actually achieved an Inflection point.
Maybe we're lucky to have a limited amount of sand and electricity to produce chips, and of course a limited number of TSMCs who can print them.
 
Joined
Apr 24, 2020
Messages
2,591 (1.73/day)
So AI has actually achieved an Inflection point.

An inflection point of venture capitalist money for sure.

For creative use: AI seems like it will be with us with Photoshop's Generative Fill: (https://www.adobe.com/products/photoshop/generative-fill.html). I'm not convinced text is quite ready yet, even with GPT4. ChatGPT / GPT4 is good enough to make very annoying spambots, but the content / hallucinations / lying is just awful and makes practical use of GPT4 just unworkable in many cases.
 
Joined
Jul 16, 2013
Messages
205 (0.05/day)
System Name latest-greatest
Processor i7 12700K
Motherboard Z690 Rog Strix-E
Cooling Lian Li Galahad 360
Memory corsair vengeance Ddr5 4800
Video Card(s) 2080ti
Storage 980 pro gen4
Display(s) LG C1 4K 120Mhz
Case fractal meshify2
Audio Device(s) Realtec 4080
Power Supply Corsair rm1000x
31 megawatts of juice required, that is enormous amount of power required and to what end? I wonder if adding this demand ups the cost for residential power.
 
Joined
Mar 22, 2011
Messages
213 (0.04/day)
Location
USA
System Name Liquid 2022
Processor Intel i7-12700k
Motherboard Asus Strix Z690-A GAMING WIFI D4
Cooling Custom loop with 9x120mm radiator area
Memory Team 16GB (2x8GB) DDR4@4133 C18-18-18
Video Card(s) EVGA GeForce RTX 2080ti on nickel Heatkiller IV block with Aluminum backplate
Storage 10TB SSD: Samsung 970 PRO 512GB (OS), Samsung 980 PRO 2TB, ADATA SX8200 PRO 2TB/500GB, 4TB/1TB MX500
Display(s) Dell S2716DG 27" 1440p G-SYNC, Samsung Odyssey
Case Phanteks ENTHOO 719 (grey)
Audio Device(s) Creative Sound BlasterX AE-5, Logitech Z906 5.1 speaker system
Power Supply Cooler Master V1200, custom sleeved white cables
Mouse Logitech G502
Keyboard Corsair K70 Lux RGB
Software Windows 10 Pro 64-bit (maybe 11 soon?)
Assuming $10,000 per GPU, that's $220 Million on GPUs alone, let alone datacenter costs, CPU costs, RAM, hard drives...

A valuation of $1.5 Billion sounds fair because that's barely much more than the underlying hardware.
The H100's are going for $40,000 each!
 
Joined
Jan 5, 2006
Messages
18,050 (2.69/day)
System Name AlderLake / Laptop
Processor Intel i7 12700K P-Cores @ 5Ghz / Intel i3 7100U
Motherboard Gigabyte Z690 Aorus Master / HP 83A3 (U3E1)
Cooling Noctua NH-U12A 2 fans + Thermal Grizzly Kryonaut Extreme + 5 case fans / Fan
Memory 32GB DDR5 Corsair Dominator Platinum RGB 6000MT/s CL36 / 8GB DDR4 HyperX CL13
Video Card(s) MSI RTX 2070 Super Gaming X Trio / Intel HD620
Storage Samsung 980 Pro 1TB + 970 Evo 500GB + 850 Pro 512GB + 860 Evo 1TB x2 / Samsung 256GB M.2 SSD
Display(s) 23.8" Dell S2417DG 165Hz G-Sync 1440p / 14" 1080p IPS Glossy
Case Be quiet! Silent Base 600 - Window / HP Pavilion
Audio Device(s) Panasonic SA-PMX94 / Realtek onboard + B&O speaker system / Harman Kardon Go + Play / Logitech G533
Power Supply Seasonic Focus Plus Gold 750W / Powerbrick
Mouse Logitech MX Anywhere 2 Laser wireless / Logitech M330 wireless
Keyboard RAPOO E9270P Black 5GHz wireless / HP backlit
Software Windows 11 / Windows 10
Benchmark Scores Cinebench R23 (Single Core) 1936 @ stock Cinebench R23 (Multi Core) 23006 @ stock
Top