• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

FuriosaAI Unveils RNGD Power-Efficient AI Processor at Hot Chips 2024

Nomad76

News Editor
Staff member
Joined
May 21, 2024
Messages
675 (3.61/day)
Today at Hot Chips 2024, FuriosaAI is pulling back the curtain on RNGD (pronounced "Renegade"), our new AI accelerator designed for high-performance, highly efficient large language model (LLM) and multimodal model inference in data centers. As part of his Hot Chips presentation, Furiosa co-founder and CEO June Paik is sharing technical details and providing the first hands-on look at the fully functioning RNGD card.

With a TDP of 150 watts, a novel chip architecture, and advanced memory technology like HBM3, RNGD is optimized for inference with demanding LLMs and multimodal models. It's built to deliver high performance, power efficiency, and programmability all in a single product - a trifecta that the industry has struggled to achieve in GPUs and other AI chips.





A key milestone for RNGD
As industry experts will know, the process of squeezing every drop of performance from a chip takes many steps. Furiosa achieved a full bring-up of RNGD just weeks after obtaining the first silicon samples - an exceptionally rapid timeline in the chip industry. TSMC delivered the first RNGD chips in May, we booted the hardware less than a week later, and we were running industry standard Llama 3.1 models in early June.

We started delivering the first RNGD silicon to early access customers in July and showed our first private demo last week. There's much more work to do before RNGD is running in data centers around the world, but we've reached an exciting milestone and we're pleased to be able to share these updates on our progress.

With more updates to come
Our priority now is refining our software stack as we ramp up RNGD production. This roadmap follows our successful track record with Furiosa's first-generation chip, introduced in 2021.

With our first-gen product, which targeted computer vision applications in data centers and edge server deployments, Furiosa submitted our first MLPerf benchmark results three weeks after receiving first silicon. We then used compiler enhancements to achieve a 113% performance increase in the next MLPerf submission six months later.

This is a typical path for new silicon. For example, six months after launching their powerful H100 chip and submitting it to MLPerf, NVIDIA announced 2.4x performance improvements achieved entirely through software improvements.

The process will be similar with RNGD. Right now, a single RNGD is generating about 12 queries per second when running the GPT-J 6B model, but we expect that number to increase as we refine our software stack over the coming weeks and months. We're also sharing RNGD target performance numbers on several LLMs:



Furiosa has deliberately kept a low profile until now, because we know the industry doesn't need more hype and bold promises about things that don't yet exist. (Also, Furiosa is 95% engineers, so marketing hasn't exactly been top of mind.)

Stay informed on the latest RNGD news
But Hot Chips is an exciting turning point for Furiosa and RNGD. If you come by our Hot Chips booth this week, you'll see we've brought a large engineering team to talk with anyone who is interested in our work. We're eager to hear what the AI community thinks of RNGD, what questions you have, and what you want to hear from us as we work to make the chip widely available in early 2025. We'll also showcase the first live demo of RNGD.

Stay tuned for more benchmark results, availability details, and other updates in the coming weeks and months.

View at TechPowerUp Main Site | Source
 
Joined
Dec 16, 2021
Messages
332 (0.31/day)
Location
Denmark
Processor AMD Ryzen 7 3800X
Motherboard ASUS Prime X470-Pro
Cooling bequiet! Dark Rock Slim
Memory 64 GB ECC DDR4 2666 MHz (Samsung M391A2K43BB1-CTD)
Video Card(s) eVGA GTX 1080 SC Gaming, 8 GB
Storage 1 TB Samsung 970 EVO Plus, 1 TB Samsung 850 EVO, 4 TB Lexar NM790, 12 TB WD HDDs
Display(s) Acer Predator XB271HU
Case Corsair Obsidian 550D
Audio Device(s) Creative X-Fi Fatal1ty
Power Supply Seasonic X-Series 560W
Mouse Logitech G502
Keyboard Glorious GMMK
<insert random Mad Max quip here> :p
 
Joined
Dec 12, 2016
Messages
1,840 (0.63/day)
Regardless of AI, I'm happy to see a diversification of processor and co-processor architectures. Intel peaked at 478 systems in the top 500 supercomputer list in June 2019. A whopping 96%! Now we are seeing computing coming from all over the place instead of just one provider.
 
Joined
Feb 11, 2009
Messages
5,550 (0.96/day)
System Name Cyberline
Processor Intel Core i7 2600k -> 12600k
Motherboard Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling Tuniq Tower 120 -> Custom Watercoolingloop
Memory Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s) AMD RX480 -> RX7800XT
Storage Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s) Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s) Focusrite 2i4 (USB)
Power Supply Seasonic 620watt 80+ Platinum
Mouse Elecom EX-G
Keyboard Rapoo V700
Software Windows 10 Pro 64bit
WITNESS MEEEEE
 
Joined
Jun 8, 2022
Messages
388 (0.43/day)
Location
Ohio, USA
System Name Trackstar
Processor AMD Ryzen 7 5800X3D -30 All Core CO (on Corsair XC5 block)
Motherboard Gigabyte B550 AORUS Elite V2 Rev 1.0 (F17 BIOS)
Cooling Corsair XD5 pump / Corsair XR5 1x 360mm (front) + 1x 420mm (top) rads
Memory 32GB G.Skill DDR4-3600 CL14 1:1 (F4-3600C14Q-32GVKA kit)
Video Card(s) ASRock RX 6950XT OC Formula (on Bykski A-AR6900XTOCF-X block)
Storage WD_BLACK SN850X 2TB w/HS (FW ver. 620361WD)
Display(s) Dell S3222DGM 32" 1440p/165Hz FreeSync
Case Fractal Design Meshify S2
Audio Device(s) Realtek ALC1200 Integrated Audio
Power Supply Super Flower Leadex Platinum SE 1200W on Liebert GXT4-1500RT120 UPS
Mouse Corsair Nightsword RGB
Keyboard Corsair K60 RGB PRO
VR HMD N/A
Software Windows 11 Pro 23H2 (Build 22631.3958)
Benchmark Scores https://www.3dmark.com/sw/1131940 https://www.3dmark.com/fs/29315810
That's some crazy ass power delivery circuitry for a 150W TDP :wtf:
 
Joined
Dec 30, 2010
Messages
2,198 (0.43/day)
That's some crazy ass power delivery circuitry for a 150W TDP :wtf:

At least 17 phase, with an additional 4 for it's HBM. I guess they want a peak perfect transient response - nothing should be left to it's on in regards of compute.
 
Joined
Jan 3, 2021
Messages
3,491 (2.46/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
That's some crazy ass power delivery circuitry for a 150W TDP :wtf:
There's also a crazy big processor die in there - unless they seriously blew it up in the render (it's either a photo or a superb render). Calculated from the known length of the PCIe connector (89 mm), the die is about 27 x 28 mm = 756 mm². Modern chips with the best cooling can reach up to about 1W/mm².
 
Top