• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD CDNA3 Architecture Sees the Inevitable Fusion of Compute Units and x86 CPU at Massive Scale

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,288 (7.53/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
AMD in its 2022 Financial Analyst Day presentation unveiled its next-generation CDNA3 compute architecture, which will see something we've been expecting for a while—a compute accelerator that has a large number of compute units for scalar processing, and a large number of x86-64 CPU cores based on some future "Zen" microarchitecture, onto a single package. The presence of CPU cores on the package would eliminate the need for the system to have an EPYC or Xeon processor at its head, and clusters of Instinct CDNA3 processors could run themselves without the need for a CPU and its system memory.

The Instinct CDNA3 processor will feature an advanced packaging technology that brings various IP blocks together as chiplets, each based on a node most economical to it, without compromising on its function. The package features stacked HBM memory, and this memory is shared not just by the compute units and x86 cores, but also forms part of large shared memory pools accessible across packages. 4th Generation Infinity Fabric ties it all together.



AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2, thanks to the combination of 5 nm processor for the compute dies, an advanced 3D chiplet packaging technology, 4th Gen Infinity Fabric, new math computing formats, Infinity Cache on the compute dies, and a unified memory architecture. The company is working toward a 2023 debut of CDNA3.

View at TechPowerUp Main Site
 
Joined
Dec 3, 2009
Messages
1,301 (0.24/day)
Location
The Netherlands
System Name PC ||Zephyrus G14 2023
Processor Ryzen 9 5900x || R9 7940HS @ 55W
Motherboard MAG B550M MORTAR WIFI || default
Cooling 1x Corsair XR5 360mm Rad||
Memory 2x16GB HyperX 3600 @ 3800 || 32GB DDR5 @ 4800MTs
Video Card(s) MSI RTX 2080Ti Sea Hawk EK X || RTX 4060 OC
Storage Samsung 9801TB x2 + Striped Tiered Storage Space (2x 128Gb SSD + 2x 1TB HDD) || 1TB NVME
Display(s) Iiyama PL2770QS + Samsung U28E590, || 14' 2560x1600 165Hz IPS
Case SilverStone Alta G1M ||
Audio Device(s) Asus Xonar DX
Power Supply Cooler Master V850 SFX || 240W
Mouse ROG Pugio II
Software Win 11 64bit || Win 11 64bit
"The Future Is Fusion" finally coming to complete fruition!
 
Joined
Dec 3, 2009
Messages
1,301 (0.24/day)
Location
The Netherlands
System Name PC ||Zephyrus G14 2023
Processor Ryzen 9 5900x || R9 7940HS @ 55W
Motherboard MAG B550M MORTAR WIFI || default
Cooling 1x Corsair XR5 360mm Rad||
Memory 2x16GB HyperX 3600 @ 3800 || 32GB DDR5 @ 4800MTs
Video Card(s) MSI RTX 2080Ti Sea Hawk EK X || RTX 4060 OC
Storage Samsung 9801TB x2 + Striped Tiered Storage Space (2x 128Gb SSD + 2x 1TB HDD) || 1TB NVME
Display(s) Iiyama PL2770QS + Samsung U28E590, || 14' 2560x1600 165Hz IPS
Case SilverStone Alta G1M ||
Audio Device(s) Asus Xonar DX
Power Supply Cooler Master V850 SFX || 240W
Mouse ROG Pugio II
Software Win 11 64bit || Win 11 64bit

Anc13ntEvil

New Member
Joined
Jun 10, 2022
Messages
4 (0.00/day)
"AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2"

I don't understand why so many people don't understand how percentages compared to multiplication works...

500% is SIX TIMES, not five. If you have $10 and I increase it by 100%, you would then have $20. 100% is double, 200% is triple, and so on, yet I constantly see people get this simple math wrong. Sorry not sorry but this annoys the crap out of me.

It also bugs the crap out of me when companies list something like "1.3 times the speed" (or power).
30%...just say 30%.
 
Joined
Jul 16, 2014
Messages
8,215 (2.16/day)
Location
SE Michigan
System Name Dumbass
Processor AMD Ryzen 7800X3D
Motherboard ASUS TUF gaming B650
Cooling Artic Liquid Freezer 2 - 420mm
Memory G.Skill Sniper 32gb DDR5 6000
Video Card(s) GreenTeam 4070 ti super 16gb
Storage Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s) 1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s) onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply Corsair HX1000i
Mouse Steeseries Esports Wireless
Keyboard Corsair K100
Software windows 10 H
Benchmark Scores https://i.imgur.com/aoz3vWY.jpg?2
"AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2"

I don't understand why so many people don't understand how percentages compared to multiplication works...

500% is SIX TIMES, not five. If you have $10 and I increase it by 100%, you would then have $20. 100% is double, 200% is triple, and so on, yet I constantly see people get this simple math wrong. Sorry not sorry but this annoys the crap out of me.

It also bugs the crap out of me when companies list something like "1.3 times the speed" (or power).
30%...just say 30%.
All compute starts at 0 not 1, its pretty binary. :D
 
Joined
Mar 21, 2016
Messages
2,508 (0.79/day)
Me from two days ago..."If AMD had a 5600X with larger 3D stacked cache or also paired APU chiplet that didn't castrate PCIE lanes in the process it would be pretty popular for a budget AM4 build. In fact the larger cache could be on the APU and if the chip could utilize the cache between either with maybe a slight latency penalty worst case scenario in doing so it would be really nice. Both CPU/GPU would have a set amount of cache that's equal and at the same time can tapped into by the other in a pinch with only a bit more added latency when and where needed."

Seems AMD thought similar, but I like that they took it steps further with HBM and Zen 4 is of course a step up for such a APU designed in tandem with 3D stacked cache. It seems they went with the pooled and shared unified memory as well! I'm not sure what the new "MATHS" is, but suspect it's referring to AVX related and/or FP precision stuff bit o this bit o that!
 
Joined
Aug 20, 2007
Messages
21,528 (3.40/day)
System Name Pioneer
Processor Ryzen R9 9950X
Motherboard GIGABYTE Aorus Elite X670 AX
Cooling Noctua NH-D15 + A whole lotta Sunon and Corsair Maglev blower fans...
Memory 64GB (4x 16GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s) XFX RX 7900 XTX Speedster Merc 310
Storage Intel 905p Optane 960GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s) 55" LG 55" B9 OLED 4K Display
Case Thermaltake Core X31
Audio Device(s) TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply FSP Hydro Ti Pro 850W
Mouse Logitech G305 Lightspeed Wireless
Keyboard WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software Gentoo Linux x64 / Windows 11 Enterprise IoT 2024
"AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2"

I don't understand why so many people don't understand how percentages compared to multiplication works...

500% is SIX TIMES, not five. If you have $10 and I increase it by 100%, you would then have $20. 100% is double, 200% is triple, and so on, yet I constantly see people get this simple math wrong. Sorry not sorry but this annoys the crap out of me.

It also bugs the crap out of me when companies list something like "1.3 times the speed" (or power).
30%...just say 30%.
That's not how "uplift" works.

A 100% uplift would be double the performance.

100% of the performance would indeed be the same.
 

eidairaman1

The Exiled Airman
Joined
Jul 2, 2007
Messages
42,531 (6.67/day)
Location
Republic of Texas (True Patriot)
System Name PCGOD
Processor AMD FX 8350@ 5.0GHz
Motherboard Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory 16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s) AMD Radeon 290 Sapphire Vapor-X
Storage Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s) NEC Multisync LCD 1700V (Display Port Adapter)
Case AeroCool Xpredator Evil Blue Edition
Audio Device(s) Creative Labs Sound Blaster ZxR
Power Supply Seasonic 1250 XM2 Series (XP3)
Mouse Roccat Kone XTD
Keyboard Roccat Ryos MK Pro
Software Windows 7 Pro 64
Joined
Oct 27, 2020
Messages
796 (0.53/day)
I wonder if the work that it's done on the CDNA3 unified memory architecture will bring some benefits for future Zen5 based APUs on the memory controller/ cache/ V-cache system integration or everything is sorted out in the APU space with HUMA later iterations.
 

Imaamdfanboy

New Member
Joined
Jun 12, 2022
Messages
4 (0.00/day)
Nope...500% is indeed 5 times more.
If you have two apples for comparison they are both equall.Lets says they are both exactly same size then they are 1 to 1.This is why they do the 1.5x or maybe 2.3x explanation these days.The one's are equalls so the points after is the presentation.So 1.5 will then be 50 present faster or if you like half more times faster than one.If you use presentation it not like normal maths witch comes down to 1*1 = 1.It excaly 0.
So 2.3 times wil hê 130% môre which relates to 1.3 times more powerful.
 
Top