FuriosaAI Unveils RNGD Power-Efficient AI Processor at Hot Chips 2024

Nomad76 · Aug 27, 2024

Today at Hot Chips 2024, FuriosaAI is pulling back the curtain on RNGD (pronounced "Renegade"), our new AI accelerator designed for high-performance, highly efficient large language model (LLM) and multimodal model inference in data centers. As part of his Hot Chips presentation, Furiosa co-founder and CEO June Paik is sharing technical details and providing the first hands-on look at the fully functioning RNGD card.

With a TDP of 150 watts, a novel chip architecture, and advanced memory technology like HBM3, RNGD is optimized for inference with demanding LLMs and multimodal models. It's built to deliver high performance, power efficiency, and programmability all in a single product - a trifecta that the industry has struggled to achieve in GPUs and other AI chips.

A key milestone for RNGD
As industry experts will know, the process of squeezing every drop of performance from a chip takes many steps. Furiosa achieved a full bring-up of RNGD just weeks after obtaining the first silicon samples - an exceptionally rapid timeline in the chip industry. TSMC delivered the first RNGD chips in May, we booted the hardware less than a week later, and we were running industry standard Llama 3.1 models in early June.

We started delivering the first RNGD silicon to early access customers in July and showed our first private demo last week. There's much more work to do before RNGD is running in data centers around the world, but we've reached an exciting milestone and we're pleased to be able to share these updates on our progress.

With more updates to come
Our priority now is refining our software stack as we ramp up RNGD production. This roadmap follows our successful track record with Furiosa's first-generation chip, introduced in 2021.

With our first-gen product, which targeted computer vision applications in data centers and edge server deployments, Furiosa submitted our first MLPerf benchmark results three weeks after receiving first silicon. We then used compiler enhancements to achieve a 113% performance increase in the next MLPerf submission six months later.

This is a typical path for new silicon. For example, six months after launching their powerful H100 chip and submitting it to MLPerf, NVIDIA announced 2.4x performance improvements achieved entirely through software improvements.

The process will be similar with RNGD. Right now, a single RNGD is generating about 12 queries per second when running the GPT-J 6B model, but we expect that number to increase as we refine our software stack over the coming weeks and months. We're also sharing RNGD target performance numbers on several LLMs:

Furiosa has deliberately kept a low profile until now, because we know the industry doesn't need more hype and bold promises about things that don't yet exist. (Also, Furiosa is 95% engineers, so marketing hasn't exactly been top of mind.)

Stay informed on the latest RNGD news
But Hot Chips is an exciting turning point for Furiosa and RNGD. If you come by our Hot Chips booth this week, you'll see we've brought a large engineering team to talk with anyone who is interested in our work. We're eager to hear what the AI community thinks of RNGD, what questions you have, and what you want to hear from us as we work to make the chip widely available in early 2025. We'll also showcase the first live demo of RNGD.

Stay tuned for more benchmark results, availability details, and other updates in the coming weeks and months.

View at TechPowerUp Main Site | Source

azrael · Aug 27, 2024

Daven · Aug 27, 2024

Regardless of AI, I'm happy to see a diversification of processor and co-processor architectures. Intel peaked at 478 systems in the top 500 supercomputer list in June 2019. A whopping 96%! Now we are seeing computing coming from all over the place instead of just one provider.

JasBC · Aug 27, 2024

Ugly-ass Cybertruck typeface

ZoneDymo · Aug 27, 2024

WITNESS MEEEEE

Tropick · Aug 27, 2024

That's some crazy ass power delivery circuitry for a 150W TDP :wtf:

Jism · Aug 28, 2024

Tropick said:
That's some crazy ass power delivery circuitry for a 150W TDP

At least 17 phase, with an additional 4 for it's HBM. I guess they want a peak perfect transient response - nothing should be left to it's on in regards of compute.

Wirko · Aug 28, 2024

Tropick said:
That's some crazy ass power delivery circuitry for a 150W TDP

There's also a crazy big processor die in there - unless they seriously blew it up in the render (it's either a photo or a superb render). Calculated from the known length of the PCIe connector (89 mm), the die is about 27 x 28 mm = 756 mm². Modern chips with the best cooling can reach up to about 1W/mm².

Processor	AMD Ryzen 7 3800X
Motherboard	ASUS Prime X470-Pro
Cooling	bequiet! Dark Rock Slim
Memory	64 GB ECC DDR4 2666 MHz (Samsung M391A2K43BB1-CTD)
Video Card(s)	eVGA GTX 1080 SC Gaming, 8 GB
Storage	1 TB Samsung 970 EVO Plus, 1 TB Samsung 850 EVO, 4 TB Lexar NM790, 12 TB WD HDDs
Display(s)	Acer Predator XB271HU
Case	Corsair Obsidian 550D
Audio Device(s)	Creative X-Fi Fatal1ty
Power Supply	Seasonic X-Series 560W
Mouse	Logitech G502
Keyboard	Glorious GMMK

System Name	Cyberline
Processor	Intel Core i7 2600k -> 12600k
Motherboard	Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling	Tuniq Tower 120 -> Custom Watercoolingloop
Memory	Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s)	AMD RX480 -> RX7800XT
Storage	Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s)	Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case	antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s)	Focusrite 2i4 (USB)
Power Supply	Seasonic 620watt 80+ Platinum
Mouse	Elecom EX-G
Keyboard	Rapoo V700
Software	Windows 10 Pro 64bit

System Name	Trackstar
Processor	AMD Ryzen 7 5800X3D -30 All Core CO (on Corsair XC5 block)
Motherboard	Gigabyte B550 AORUS Elite V2 Rev 1.0 (F17 BIOS)
Cooling	Corsair XD5 pump / Corsair XR5 1x 360mm (front) + 1x 420mm (top) rads
Memory	32GB G.Skill DDR4-3600 CL14 1:1 (F4-3600C14Q-32GVKA kit)
Video Card(s)	ASRock RX 6950XT OC Formula (on Bykski A-AR6900XTOCF-X block)
Storage	WD_BLACK SN850X 2TB w/HS (FW ver. 620361WD)
Display(s)	Dell S3222DGM 32" 1440p/165Hz FreeSync
Case	Fractal Design Meshify S2
Audio Device(s)	Realtek ALC1200 Integrated Audio
Power Supply	Super Flower Leadex Platinum SE 1200W on Liebert GXT4-1500RT120 UPS
Mouse	Corsair Nightsword RGB
Keyboard	Corsair K60 RGB PRO
VR HMD	N/A
Software	Windows 11 Pro 23H2 (Build 22631.3958)
Benchmark Scores	https://www.3dmark.com/sw/1131940 https://www.3dmark.com/fs/29315810

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

FuriosaAI Unveils RNGD Power-Efficient AI Processor at Hot Chips 2024

News Editor