• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD's VEGA Alive and Well - Announced MI25 VEGA as Deep Learning Accelerator

Raevenlord

News Editor
Joined
Aug 12, 2016
Messages
3,755 (1.24/day)
Location
Portugal
System Name The Ryzening
Processor AMD Ryzen 9 5900X
Motherboard MSI X570 MAG TOMAHAWK
Cooling Lian Li Galahad 360mm AIO
Memory 32 GB G.Skill Trident Z F4-3733 (4x 8 GB)
Video Card(s) Gigabyte RTX 3070 Ti
Storage Boot: Transcend MTE220S 2TB, Kintson A2000 1TB, Seagate Firewolf Pro 14 TB
Display(s) Acer Nitro VG270UP (1440p 144 Hz IPS)
Case Lian Li O11DX Dynamic White
Audio Device(s) iFi Audio Zen DAC
Power Supply Seasonic Focus+ 750 W
Mouse Cooler Master Masterkeys Lite L
Keyboard Cooler Master Masterkeys Lite L
Software Windows 10 x64
The team at Videocardz has published a story with some interesting slides regarding AMD's push towards the highly-lucrative deep learning market with their INSTINCT line-up of graphics cards - and VEGA being announced as a full-fledged solution means we are perhaps (hopefully) closer to seeing a solution based on it for the consumer market as well.

Alongside the VEGA-based MI25, AMD also announced the MI6 (5.7 TFLOPS in FP32 operations, with 224 GB/s of memory bandwidth and <150 W of board power), looking suspiciously like a Polaris 10 card in disguise; and the MI8 (which appropriately delivers 8.2 TFLOPS in FP32 computations, as well as 512 GB/s memory bandwidth and <175 W typical board power), with the memory bandwidth numbers being the most telling, and putting the MI8 closely along a Fiji architecture-based solution.





The MI25 VEGA-based deep learning accelerator reportedly offers 25 TFLOPS in FP16 operations (which amounts to roughly 12.5 TFLOPS when working on FP32 mode) - still about 50% higher than AMD's Fiji architecture-based solutions. The MI25 is being touted as a passively cooled Training Accelerator, offering real competition towards NVIDIA's deep learning forays. Being accelerators as they are, they don't has any display outputs, putting it closely alongside NVIDIA's Tesla line of purely computing-oriented accelerators.

AMD pegs the MI25 as being almost 2 times faster than TITAN X Maxwell in DeepBench GEMM operations, and in the same press release, touts the symbiosis between their INSTINCT line of computing accelerators and the ZEN "Naples" platform as being optimized for GPU and Accelerator Throughput computing, with lower system costs, a lower latency architecture, peer to peer communication, and a high-density footprint - endowing a 39U computing rack with 120 VEGA MI25 INSTINCT accelerators and 3 PFLOPs in FP16 performance.



View at TechPowerUp Main Site
 
Joined
Sep 6, 2013
Messages
3,328 (0.81/day)
Location
Athens, Greece
System Name 3 desktop systems: Gaming / Internet / HTPC
Processor Ryzen 5 5500 / Ryzen 5 4600G / FX 6300 (12 years latter got to see how bad Bulldozer is)
Motherboard MSI X470 Gaming Plus Max (1) / MSI X470 Gaming Plus Max (2) / Gigabyte GA-990XA-UD3
Cooling Νoctua U12S / Segotep T4 / Snowman M-T6
Memory 32GB - 16GB G.Skill RIPJAWS 3600+16GB G.Skill Aegis 3200 / 16GB JUHOR / 16GB Kingston 2400MHz (DDR3)
Video Card(s) ASRock RX 6600 + GT 710 (PhysX)/ Vega 7 integrated / Radeon RX 580
Storage NVMes, ONLY NVMes/ NVMes, SATA Storage / NVMe boot(Clover), SATA storage
Display(s) Philips 43PUS8857/12 UHD TV (120Hz, HDR, FreeSync Premium) ---- 19'' HP monitor + BlitzWolf BW-V5
Case Sharkoon Rebel 12 / CoolerMaster Elite 361 / Xigmatek Midguard
Audio Device(s) onboard
Power Supply Chieftec 850W / Silver Power 400W / Sharkoon 650W
Mouse CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Keyboard CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Software Windows 10 / Windows 10&Windows 11 / Windows 10
3 unknowns here.

What is NCU? Compute Unit?

What do they mean with 2x Packed Math ( compute unit(?) + gpu or maybe 2 gpus? )

And what do they mean with the High Bandwidth Cache and controller. Maybe a combination of HBM and GDDR5(X)?
 

bug

Joined
May 22, 2015
Messages
13,749 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
Weird. AMD brings the generation melange to their professional line. Also, while these cards seem to be competing with Nvidia's Tesla, AMD compares them to Titan (and previous generation Titan at that). Though it seems even Nvidia isn't even sure which market segment the Titans belong to, so it's easy to be confused about it.
 

the54thvoid

Super Intoxicated Moderator
Staff member
Joined
Dec 14, 2009
Messages
13,041 (2.39/day)
Location
Glasgow - home of formal profanity
Processor Ryzen 7800X3D
Motherboard MSI MAG Mortar B650 (wifi)
Cooling be quiet! Dark Rock Pro 4
Memory 32GB Kingston Fury
Video Card(s) Gainward RTX4070ti
Storage Seagate FireCuda 530 M.2 1TB / Samsumg 960 Pro M.2 512Gb
Display(s) LG 32" 165Hz 1440p GSYNC
Case Asus Prime AP201
Audio Device(s) On Board
Power Supply be quiet! Pure POwer M12 850w Gold (ATX3.0)
Software W10
Post deleted, hadn't looked at slide...
 

bug

Joined
May 22, 2015
Messages
13,749 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
Joined
Mar 14, 2008
Messages
511 (0.08/day)
Location
DK
System Name Main setup
Processor i9 12900K
Motherboard Gigabyte z690 Gaming X
Cooling Water
Memory Kingston 32GB 5200@cl30
Video Card(s) Asus Tuf RTS 4090
Storage Adata SX8200 PRO 1 adn 2 TB, Samsung 960EVO, Crucial MX300 750GB Limited edition
Display(s) HP "cheapass" 34" 3440x1440
Case CM H500P Mesh
Audio Device(s) Logitech G933
Power Supply Corsair RX850i
Mouse G502
Keyboard SteelSeries Apex Pro
Software W11
The Mi8 is a fury NANO.... :) but yes there is a good chance that the MI25 is a dual gpu....

so my guess is that it is the Fiji gpu re-done and build on 14nm, that we with a 50% chance will find on the RX490.
 

AsRock

TPU addict
Joined
Jun 23, 2007
Messages
19,076 (3.00/day)
Location
UK\USA
No Compute Unit? Specially optimized for no-op execution :D

Was thinking it had some thing than not, thinking coming from that being from the higher tier.

I guess wewill find out sooner or later.
 
Joined
Oct 2, 2004
Messages
13,791 (1.88/day)
The Mi8 is a fury NANO.... :) but yes there is a good chance that the MI25 is a dual gpu....

so my guess is that it is the Fiji gpu re-done and build on 14nm, that we with a 50% chance will find on the RX490.

Not sure how doable it is, but if you take Fiji core, replace it's shader units with latest GCN units, maybe even update them further, replace older video engine with latest one and slam everything into a smaller manufacturing node, it's probably cheaper than doing it all from scratch. And considering AMD is sticking with magical 4096 shader units, sounds like a deal. I mean, Fiji was pretty badass core, it just wasn't exactly ready to exist at the time it was launched.
 
Joined
Mar 14, 2008
Messages
511 (0.08/day)
Location
DK
System Name Main setup
Processor i9 12900K
Motherboard Gigabyte z690 Gaming X
Cooling Water
Memory Kingston 32GB 5200@cl30
Video Card(s) Asus Tuf RTS 4090
Storage Adata SX8200 PRO 1 adn 2 TB, Samsung 960EVO, Crucial MX300 750GB Limited edition
Display(s) HP "cheapass" 34" 3440x1440
Case CM H500P Mesh
Audio Device(s) Logitech G933
Power Supply Corsair RX850i
Mouse G502
Keyboard SteelSeries Apex Pro
Software W11
Top