• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Zen 5 Execution Engine Leaked, Features True 512-bit FPU

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,229 (7.55/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
AMD "Zen 5" CPU microarchitecture will introduce a significant performance increase for AVX-512 workloads, with some sources reported as high as 40% performance increases over "Zen 4" in benchmarks that use AVX-512. A Moore's Law is Dead report detailing the execution engine of "Zen 5" holds the answer to how the company managed this—using a true 512-bit FPU. Currently, AMD uses a dual-pumped 256-bit FPU to execute AVX-512 workloads on "Zen 4." The updated FPU should significantly improve the core's performance in workloads that take advantage of 512-bit AVX or VNNI instructions, such as AI.

Giving "Zen 5" a 512-bit FPU meant that AMD also had to scale up the ancillaries—all the components that keep the FPU fed with data and instructions. The company therefore increased the capacity of the L1 DTLB. The load-store queues have been widened to meet the needs of the new FPU. The L1 Data cache has been doubled in bandwidth, and increased in size by 50%. The L1D is now 48 KB in size, up from 32 KB in "Zen 4." FPU MADD latency has been reduced by 1 cycle. Besides the FPU, AMD also increased the number of Integer execution pipes to 10, from 8 on "Zen 4." The exclusive L2 cache per core remains 1 MB in size.



Update 07:02 UTC: Moore's Law is Dead reached out to us and said that the slide previously posted by them, which we had used in an earlier version of this article, is fake, but said that the information contained in that slide is correct, and that they stand by the information.

View at TechPowerUp Main Site | Source
 
Joined
Sep 9, 2015
Messages
287 (0.09/day)
At this point, I'm pretty sure this guy makes up these charts and plasters their YT name on it. ...womp womp, this is a 0/10 leak
 
Joined
Dec 5, 2017
Messages
157 (0.06/day)
I have no interest in AVX512 but the upgrades on the integer side look to be compelling. Look forward to seeing how beefed up the front end is, it's about time to go wider.
 

freeagent

Moderator
Staff member
Joined
Sep 16, 2018
Messages
8,502 (3.77/day)
Location
Winnipeg, Canada
Processor AMD R7 5800X3D
Motherboard Asus Crosshair VIII Dark Hero
Cooling Thermalright Frozen Edge 360, 3x TL-B12 V2, 2x TL-B12 V1
Memory 2x8 G.Skill Trident Z Royal 3200C14, 2x8GB G.Skill Trident Z Black and White 3200 C14
Video Card(s) Zotac 4070 Ti Trinity OC
Storage WD SN850 1TB, SN850X 2TB, SN770 1TB
Display(s) LG 50UP7100
Case Fractal Torrent Compact
Audio Device(s) JBL Bar 700
Power Supply Seasonic Vertex GX-1000, Monster HDP1800
Mouse Logitech G502 Hero
Keyboard Logitech G213
VR HMD Oculus 3
Software Yes
Benchmark Scores Yes
At this point, I'm pretty sure this guy makes up these charts and plasters their YT name on it. ...womp womp, this is a 0/10 leak
Lets see you do better :laugh:
 

AsRock

TPU addict
Joined
Jun 23, 2007
Messages
19,077 (3.00/day)
Location
UK\USA
"Leaked", sure... i will believe when some one gets fired for it.
 
Joined
Feb 10, 2023
Messages
275 (0.42/day)
Location
Lake Superior
This is a fake slide. But gcc patches do show 6 ALU and 4 AGU for znver5.
So whoever made it added elements of truth here and there.
 
Joined
Nov 29, 2022
Messages
819 (1.13/day)
Processor Intel i7 77OOK
Motherboard Gigabyte Aorus something
Cooling Noctua NH-U12S dual fan
Memory Ballistix 32 Go
Video Card(s) MSI 3060 Gaming X
Storage Mixed bag of M2 SSD and SATA SSD
Display(s) MSI 34" 3440x1440 Artimys 343CQR
Case Old Corsair Obsidian something
Audio Device(s) Integrated
Power Supply Old Antec HCG 620 still running good
Mouse Steelseries something
Keyboard Steelseries someting too
Benchmark Scores bench ? no time to lose with bench ! :)
Fake, legit ... who cares ?
Only trust the reviews and the FPS in game :)
 
Joined
Mar 6, 2018
Messages
131 (0.05/day)
Full 512-bit FPUs could be a curse with high power consumptions and high transistor budget.
 
Joined
Apr 19, 2018
Messages
1,227 (0.51/day)
Processor AMD Ryzen 9 5950X
Motherboard Asus ROG Crosshair VIII Hero WiFi
Cooling Arctic Liquid Freezer II 420
Memory 32Gb G-Skill Trident Z Neo @3806MHz C14
Video Card(s) MSI GeForce RTX2070
Storage Seagate FireCuda 530 1TB
Display(s) Samsung G9 49" Curved Ultrawide
Case Cooler Master Cosmos
Audio Device(s) O2 USB Headphone AMP
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Cherry MX
Software Windows 11
The low L2 cache size is an obvious planned mistake and low hanging fruit for Zen 6 to fix, we know AMD were experimenting with larger L2 cache sizes, and that 2MB was the sweet spot, and 3MB offering only slight low single-digit uplift in perf over 2MB. One of the reasons for the infamous "AMD dip".

And it's also borderline criminal AMD do not rectify the L3 cache starvation issue without the "3D cache band-aid" cash grab. Even a better memory controller would help in this regard.
 
Last edited:
Joined
Jun 12, 2020
Messages
66 (0.04/day)
Processor AMD Ryzen 9 9950X
Motherboard Asus ROG Strix B650E-E Gaming Wifi bios 3040 w/Agesa 1.2.0.2
Cooling Thermalright Phantom Spirit 120
Memory 64 GB Kingston FURY Beast DDR5-6000 CL30 - 2x32 GB
Video Card(s) ASRock Radeon RX 7900 XTX Phantom Gaming OC
Storage 1 x WD Black SN850 1TB, 1 x Samsung 990 PRO 2TB, 2 x Samsung 860 1TB, 1 x Segate 16TB HDD
Display(s) Dell G3223Q 4K UHD
Case NZXT H7 Flow (2024) - All Black
Audio Device(s) ROG SupremeFX 7.1 Surround Sound High Definition Audio CODEC ALC4080
Power Supply Thermalright TP 1000 Watt
Mouse Razer DeathAdder v3.0 PRO
Keyboard Razer BlackWidow V4
Software Windows 11 PRO 24H2 build 26100.1882
At this point, I'm pretty sure this guy makes up these charts and plasters their YT name on it. ...womp womp, this is a 0/10 leak

Because it makes perfect sense to show the leaked slides unedited .. oh wait.
 
Joined
Jan 3, 2021
Messages
3,479 (2.46/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
I have no interest in AVX512 but the upgrades on the integer side look to be compelling. Look forward to seeing how beefed up the front end is, it's about time to go wider.
AVX512 is for integer and bitwise operations too, not only for FP. That's where SPEC-int gains, purportedly very big, come from.
 
Joined
Oct 6, 2021
Messages
1,605 (1.41/day)
This is a fake slide. But gcc patches do show 6 ALU and 4 AGU for znver5.
So whoever made it added elements of truth here and there.
Yes, it's practically confirmed that zen5 will bring some drastic changes compared to its predecessor.
Someone must have just made slides on top of this info.
 

bug

Joined
May 22, 2015
Messages
13,753 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10

bug

Joined
May 22, 2015
Messages
13,753 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
Maybe they've found a way to use the full AVX512 without the thermal implications and power consumption.
Thermal have certainly improved, but the discussion was more about the large amount of die space being used for specialized purposes. That's still the case. Considering the increased competition for fab capacity, you'd think "wasted" transistors is more of o problem today than it was 4 years ago.
 
Joined
Dec 12, 2016
Messages
1,824 (0.63/day)
Thermal have certainly improved, but the discussion was more about the large amount of die space being used for specialized purposes. That's still the case. Considering the increased competition for fab capacity, you'd think "wasted" transistors is more of o problem today than it was 4 years ago.

Isn’t there some AI / machine learning algorithms that can use AVX512 now?
 

bug

Joined
May 22, 2015
Messages
13,753 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
Isn’t there some AI / machine learning algorithms that can use AVX512 now?
If run locally, maybe. But currently most models worth anything are too big to run a consumer PC. And that's not going to change: no matter how capable PCs will grow, the cloud will always be better.
 

SL2

Joined
Jan 27, 2006
Messages
2,438 (0.35/day)
If run locally, maybe. But currently most models worth anything are too big to run a consumer PC. And that's not going to change: no matter how capable PCs will grow, the cloud will always be better.
Zen 5 isn't for consumer PC's alone, tho.

I've stopped counting all the times I've read Zen as Ryzen in a leak, without thinking. That's not to say that Ryzen won't have this.
 
Joined
Mar 13, 2021
Messages
471 (0.35/day)
Processor AMD 7600x
Motherboard Asrock x670e Steel Legend
Cooling Silver Arrow Extreme IBe Rev B with 2x 120 Gentle Typhoons
Memory 4x16Gb Patriot Viper Non RGB @ 6000 30-36-36-36-40
Video Card(s) XFX 6950XT MERC 319
Storage 2x Crucial P5 Plus 1Tb NVME
Display(s) 3x Dell Ultrasharp U2414h
Case Coolermaster Stacker 832
Power Supply Thermaltake Toughpower PF3 850 watt
Mouse Logitech G502 (OG)
Keyboard Logitech G512
I'm a bit confused. A few years ago we were burning Intel to the stake for AVX-512 (https://linuxiac.com/linus-torvalds-criticizes-intel-avx-512/, but not only). Now we're cheering for the same AVX-512?
There was a lot of hubub about Intel marketing using the AVX benchmarking to show it still having a massive lead in general. When in actual fact there was little to no lead in anything that didn't use avx512

Similar to nVidia when they were releasing benchmarks with the tiniest of writing saying "using dlsss"
 

bug

Joined
May 22, 2015
Messages
13,753 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
Zen 5 isn't for consumer PC's alone, tho.

I've stopped counting all the times I've read Zen as Ryzen in a leak, without thinking. That's not to say that Ryzen won't have this.
That's true, but so far AMD has made no difference in that regard between server and desktop.

And I'm not even saying AVX-512 is bad, my question was more about what changed in the meantime.
 

SL2

Joined
Jan 27, 2006
Messages
2,438 (0.35/day)
That's true, but so far AMD has made no difference in that regard between server and desktop.
That's why there's no point questioning any feature in a Ryzen CPU as long as it makes sense in EPYC. I'm pretty sure the latter dictates a lot of the design due to $.
And I'm not even saying AVX-512 is bad, my question was more about what changed in the meantime.
I think you've answered that already. ;)
And that's not going to change: no matter how capable PCs will grow, the cloud will always be better.
 
Last edited:
Joined
Oct 6, 2021
Messages
1,605 (1.41/day)
Thermal have certainly improved, but the discussion was more about the large amount of die space being used for specialized purposes. That's still the case. Considering the increased competition for fab capacity, you'd think "wasted" transistors is more of o problem today than it was 4 years ago.
If it translates into an advantage in AMD's most valuable market, I suppose it's worth it. The gains that AVX512 brings when used properly are massive.

I'd just like to see more mainstream consumer applications using such an instruction set.
 

bug

Joined
May 22, 2015
Messages
13,753 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
If it translates into an advantage in AMD's most valuable market, I suppose it's worth it. The gains that AVX512 brings when used properly are massive.

I'd just like to see more mainstream consumer applications using such an instruction set.
I'm a bit more in the other camp: if it only benefits like 10% of the typical workloads, I'd rather do without and have CPUs that are 20-30% cheaper instead.

At the same time, I realize this is basically a chicken-and-egg problem: if AVX-512 isn't available, apps that use it won't be either.
 
Top