AMD Details ZEN Microarchitecture IPC Gains

btarunr · Aug 24, 2016

AMD Tuesday hosted a ZEN microarchitecture deep-dive presentation in the backdrop of Hot Chips, outlining its road to a massive 40 percent gain in IPC (translated roughly as per-core performance gains), over the current "Excavator" microarchitecture. The company credits the gains to three major changes with ZEN: better core engine, better cache system, and lower power. With ZEN, AMD pulled back from its "Bulldozer" approach to cores, in which two cores share certain number-crunching components to form "modules," and back to a self-sufficient core design.

Beyond cores, the next-level subunit of the ZEN architecture is the CPU-Complex (CCX), in which four cores share an 8 MB L3 cache. This isn't different from current Intel architectures, the cores share nothing beyond L3 cache, making them truly independent. What makes ZEN a better core, besides its independence from other cores, and additional integer pipelines; subtle upscaling in key ancillaries such as micro-Op dispatch, instruction schedulers; retire, load, and store queues; and a larger quad-issue FPU.

AMD also improved the cache system. The hierarchy is similar to pre-Bulldozer AMD architectures, with L3 cache being shared between full-fledged cores, and each core having a dedicated L2 cache. The L1 cache is now write-back (and not write-through), the SRAM that makes up the L2 and L3 caches are faster.

The L3 cache SRAM has 5 times higher bandwidth than the L3 cache found on current AMD architectures. The L1 and L2 caches have 2 times the bandwidth. Load from cache to FPU is now faster. The core is endowed with 64 KB each of L1I cache, 32 KB L1D cache; 512 KB of dedicated L2 cache, and 8 MB of L3 cache shared between four cores in a CCX.

ZEN introduces simultaneous multi-threading (SMT) to AMD processors. Intel's SMT implementation is the popular HyperThreading Technology. AMD's SMT is similar in that each core is addressed to as two threads, with each thread competing for the resources on the core.

The third key area is lower-power, and this is attributed not just to the silicon-level gains yielded from the move to the 14 nm FinFET process. The design team focused on power-draw from the very inception of the ZEN core project. The L1 write-back cache, and the Op cache lower power-draw; the various components on ZEN processors feature aggressive clock-gating, although there's no power-gating.

AMD expanded the ISA CPU instruction-sets, with AVX, AVX2, BMI1, BMI2, AES, RDRAND, sMEP, SHA1/SHA256, ADX, CFLUSHopt, XSAVEC/XSAVES/XRSTORS, and SMAP. The company also introduced a few AMD-exclusive instruction sets, which can be taken advantage of for better performance, including CLzero, and PTE Coalescing.

View at TechPowerUp Main Site

hardcore_gamer · Aug 24, 2016

Looks interesting indeed. BTW, does the agreement between AMD and Intel allow one company to start implementing new instructions introduced by the other ?
Thanks for waking up early and posting this, Tarun. :toast:

natr0n · Aug 24, 2016

Chaitanya · Aug 24, 2016

Read this couple of days back, really excited to see reviews of these new CPUs and hoping finally there will a choice for PC builders.

Durvelle27 · Aug 24, 2016

Holy hell

This really got me even more excited for Zen

ZoneDymo · Aug 24, 2016

I love that 3rd slide, "better" , "faster"

Prima.Vera · Aug 24, 2016

btarunr said:
...back to a self-sufficient core design

This is the key, right there. Specially for Games or unoptimized apps.

chaosmassive · Aug 24, 2016

with introducing AMD-exclusive instruction will path of CPU diverge like GPU ?

Arumio · Aug 24, 2016

hardcore_gamer said:
Looks interesting indeed. BTW, does the agreement between AMD and Intel allow one company to start implementing new instructions introduced by the other ?
Thanks for waking up early and posting this, Tarun.

You mean SMT? Actually AMD found it, but for some reason they don't implement it in their products in time

ViperXTR · Aug 24, 2016

Instructions? AMD 3Dnow! anyone? though it's more of an MMX enhancement at that time

medi01 · Aug 24, 2016

I'm rather skeptical on power draw claims after rather disappointing results on 480, will wait for benchmarks (although not from TPU, sorry guys)

overlord · Aug 24, 2016

hardcore_gamer said:
Looks interesting indeed. BTW, does the agreement between AMD and Intel allow one company to start implementing new instructions introduced by the other ?
Thanks for waking up early and posting this, Tarun.

Short answer yes, as part of their cross license agreement https://www.sec.gov/Archives/edgar/data/2488/000119312509236705/dex102.htm
That's why AMD64 coexist with x86 instructions.
http://www.kitguru.net/components/c...nge-of-control-terminates-agreement-for-both/
http://www.theinquirer.net/inquirer...oks-to-hsa-foundation-to-avoid-amd64-mistakes
http://www.cnet.com/news/intel-ftc-settle-antitrust-case/
http://web.archive.org/web/20000302151607/http://www1.amd.com/newsroom/display/1,1528,435,00.html

overlord · Aug 24, 2016

ViperXTR said:
Instructions? AMD 3Dnow! anyone? though it's more of an MMX enhancement at that time

Correct along with AMD64

$ReaPeR$ · Aug 24, 2016

all this is fine but with no benchmarks, there is really no point in this pr crap.

the54thvoid · Aug 24, 2016

$ReaPeR$ said:
all this is fine but with no benchmarks, there is really no point in this pr crap.

They have to release PR. It isn't for us, it's for the investors, hence the rise in share price. AMD need to be seen to be releasing a 'confident' statement on their new CPU.
Anandtech has a 'discussion' on their recent PR, mostly around the Blender bench and explain what may be happening. AT says they (AMD) aren't being as hyperbolic as Bulldozer release and are vague enough with the benchmark as to keep things within expectations.
But as said, this keeps investors happy. They all do it (Intel, Nvidia), so it's not an AMD peculiarity.

dj-electric · Aug 24, 2016

Usually 40% would impress me between CPU gens. This actually worries me a bit.
40% is what i would think is a bare minimum to compete with today's intel's IPC

$ReaPeR$ · Aug 24, 2016

the54thvoid said:
They have to release PR. It isn't for us, it's for the investors, hence the rise in share price. AMD need to be seen to be releasing a 'confident' statement on their new CPU.
Anandtech has a 'discussion' on their recent PR, mostly around the Blender bench and explain what may be happening. AT says they (AMD) aren't being as hyperbolic as Bulldozer release and are vague enough with the benchmark as to keep things within expectations.
But as said, this keeps investors happy. They all do it (Intel, Nvidia), so it's not an AMD peculiarity.

well yes, obviously. i know they all do it, i was just stating the fact. AMD seems more restrained this time and that gives me hope for the capabilities of the zen arch, but, without independent benchmarks, there is no way to know for sure what it can do. also, i don't expect investors to be that dumb and trust the PR from AMD, or any company for that matter. maybe i'm wrong though and investors are that dumb, and they buy into the hype just to complain later that it didn't match their expectations.

TheinsanegamerN · Aug 24, 2016

$ReaPeR$ said:
well yes, obviously. i know they all do it, i was just stating the fact. AMD seems more restrained this time and that gives me hope for the capabilities of the zen arch, but, without independent benchmarks, there is no way to know for sure what it can do. also, i don't expect investors to be that dumb and trust the PR from AMD, or any company for that matter. maybe i'm wrong though and investors are that dumb, and they buy into the hype just to complain later that it didn't match their expectations.

Given how gamers will be suckered by hype again and again and again despite having been burned enough to need a skin graft, I'd say its human nature to fall for this PR BS. investors are definitely not immune to that (theranos, anybody?)

ArdWar · Aug 24, 2016

How "lower power" translate into IPC gain?

TheinsanegamerN · Aug 24, 2016

ArdWar said:
How "lower power" translate into IPC gain?

Focusing on lower power draw instead of super high clocks? I doubt that lower power is part of the IPC gain, but rather is an additional bonus on top of the IPC gains.

Vayra86 · Aug 24, 2016

Well, at least we know the right ingredients are in the mix now.

And we have yet to see what AMD will really cook up with it.

RejZoR · Aug 24, 2016

Dj-ElectriC said:
Usually 40% would impress me between CPU gens. This actually worries me a bit.
40% is what i would think is a bare minimum to compete with today's intel's IPC

Intel doesn't make 40% jumps between generations...

Cvrk · Aug 24, 2016

reading this. knowing it will be out somewhere in october . i would postpone building a i5 6600k pc. whats 2 moremonths considering i will be having this computer for about 3-4 years.

EarthDog · Aug 24, 2016

Ive seen now, in a couple threads, you keep saying "October". The latest I recall seeing is 4Q 2016. This means Oct-Dec. Sorry to split hairs, but, people will take that and run with it.

That said, if you have a link that shows October, post it up!

RejZoR said:
Intel doesn't make 40% jumps between generations...

Your point? Did you quote the wrong person? He didn't say nor allude to that fact. He is looking for better than Intel performance.

ArdWar · Aug 24, 2016

TheinsanegamerN said:
Focusing on lower power draw instead of super high clocks? I doubt that lower power is part of the IPC gain, but rather is an additional bonus on top of the IPC gains.

That would means an efficiency or performance gains, which is reasonable since the slideshows are actually never indicate anything about these improvements results in IPC gain.

Or maybe lower power allows them to use more complex cores.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	ASUS ROG Strix B450-E Gaming
Cooling	DeepCool Gammax L240 V2
Memory	2x 8GB G.Skill Sniper X
Video Card(s)	Palit GeForce RTX 2080 SUPER GameRock
Storage	Western Digital Black NVMe 512GB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	ITX Desktop
Processor	Core i7 9700K
Motherboard	Gigabyte Aorus Pro WiFi Z390
Cooling	Arctic esports 34 duo.
Memory	Corsair Vengeance LPX 16GB 3000MHz
Video Card(s)	Gigabyte GeForce RTX 2070 Gaming OC White PRO
Storage	Samsung 970 EVO Plus \| Intel SSD 660p
Case	NZXT H200
Power Supply	Corsair CX Series 750 Watt

System Name	natr0n-PC
Processor	Ryzen 5950x-5600x \| 9600k
Motherboard	B450 AORUS M \| Z390 UD
Cooling	EK AIO 360 - 6 fan action \| AIO
Memory	Patriot - Viper Steel DDR4 (B-Die)(4x8GB) \| Samsung DDR4 (4x8GB)
Video Card(s)	EVGA 3070ti FTW
Storage	Various
Display(s)	Pixio PX279 Prime
Case	Thermaltake Level 20 VT \| Black bench
Audio Device(s)	LOXJIE D10 + Kinter Amp + 6 Bookshelf Speakers Sony+JVC+Sony
Power Supply	Super Flower Leadex III ARGB 80+ Gold 650W \| EVGA 700 Gold
Software	XP/7/8.1/10
Benchmark Scores	http://valid.x86.fr/79kuh6

System Name	Black Prometheus
Processor	Ryzen 7 3700X
Cooling	Thermalright PA120 SE
Storage	Sandisk X300 512GB + WD Black 6TB+WD Black 6TB
Display(s)	ACER AOPEN 34" 3440x1440 144Hz
Case	DeepCool Matrexx 55 V3 w/ 6x120mm Intake + 3x120mm Exhaust
Audio Device(s)	LG Dolby Atmos 5.1
Power Supply	EVGA 600W
Mouse	Logitech Trackman
Keyboard	Logitech K350
Software	Windows 10 EDU x64

System Name	Cyberline
Processor	Intel Core i7 2600k -> 12600k
Motherboard	Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling	Tuniq Tower 120 -> Custom Watercoolingloop
Memory	Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s)	AMD RX480 -> RX7800XT
Storage	Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s)	Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case	antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s)	Focusrite 2i4 (USB)
Power Supply	Seasonic 620watt 80+ Platinum
Mouse	Elecom EX-G
Keyboard	Rapoo V700
Software	Windows 10 Pro 64bit

AMD Details ZEN Microarchitecture IPC Gains

btarunr

Editor & Senior Moderator

hardcore_gamer

natr0n

Chaitanya

Durvelle27

Moderator

ZoneDymo

Prima.Vera

chaosmassive

Arumio

ViperXTR

medi01

overlord

overlord

$ReaPeR$

the54thvoid

Super Intoxicated Moderator

dj-electric

$ReaPeR$

TheinsanegamerN

ArdWar

TheinsanegamerN

Vayra86

RejZoR

Cvrk

EarthDog

ArdWar

Processor	Intel® Core™ i7-13700K
Motherboard	Gigabyte Z790 Aorus Elite AX
Cooling	Noctua NH-D15
Memory	32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s)	ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage	2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s)	Acer Predator X34 3440x1440@100Hz G-Sync
Case	NZXT PHANTOM410-BK
Audio Device(s)	Creative X-Fi Titanium PCIe
Power Supply	Corsair 850W
Mouse	Logitech Hero G502 SE
Software	Windows 11 Pro - 64bit
Benchmark Scores	30FPS in NFS:Rivals

System Name	Scrapped Parts, Unite !
Processor	Ryzen 5 3600 @4.0 Ghz
Motherboard	MSI B450-A Pro MAX
Cooling	Stock
Memory	Team Group Elite 16 GB 3133Mhz
Video Card(s)	Colorful iGame GeForce GTX1060 Vulcan U 6G
Storage	Hitachi 500 GB, Sony 1TB, KINGSTON 400A 120GB // Samsung 160 GB
Display(s)	HP 2009f
Case	Xigmatek Asgard Pro // Cooler Master Centurion 5
Power Supply	OCZ ModXStream Pro 500 W
Mouse	Logitech G102
Software	Windows 10 x64
Benchmark Scores	Minesweeper 30fps, Tetris 40 fps, with overheated CPU and GPU

System Name	Ultima
Processor	AMD Ryzen 7 5800X
Motherboard	MSI Mag B550M Mortar
Cooling	Arctic Liquid Freezer II 240 rev4 w/ Ryzen offset mount
Memory	G.SKill Ripjaws V 2x16GB DDR4 3600
Video Card(s)	Palit GeForce RTX 4070 12GB Dual
Storage	WD Black SN850X 2TB Gen4, Samsung 970 Evo Plus 500GB , 1TB Crucial MX500 SSD sata,
Display(s)	ASUS TUF VG249Q3A 24" 1080p 165-180Hz VRR
Case	DarkFlash DLM21 Mesh
Audio Device(s)	Onboard Realtek ALC1200 Audio/Nvidia HD Audio
Power Supply	Corsair RM650
Mouse	Rog Strix Impact 3 Wireless \| Wacom Intuos CTH-480
Keyboard	A4Tech B314 Keyboard
Software	Windows 10 Pro

System Name	M3401 notebook
Processor	5600H
Motherboard	NA
Memory	16GB
Video Card(s)	3050
Storage	500GB SSD
Display(s)	14" OLED screen of the laptop
Software	Windows 10
Benchmark Scores	3050 scores good 15-20% lower than average, despite ASUS's claims that it has uber cooling.

System Name	Ryzen/Laptop/htpc
Processor	R9 3900X/i7 6700HQ/i7 2600
Motherboard	AsRock X470 Taichi/Acer/ Gigabyte H77M
Cooling	Corsair H115i pro with 2 Noctua NF-A14 chromax/OEM/Noctua NH-L12i
Memory	G.Skill Trident Z 32GB @3200/16GB DDR4 2666 HyperX impact/24GB
Video Card(s)	TUL Red Dragon Vega 56/Intel HD 530 - GTX 950m/ 970 GTX
Storage	970pro NVMe 512GB,Samsung 860evo 1TB, 3x4TB WD gold/Transcend 830s, 1TB Toshiba/Adata 256GB + 1TB WD
Display(s)	Philips FTV 32 inch + Dell 2407WFP-HC/OEM/Sony KDL-42W828B
Case	Phanteks Enthoo Luxe/Acer Barebone/Enermax
Audio Device(s)	SoundBlasterX AE-5 (Dell A525)(HyperX Cloud Alpha)/mojo/soundblaster xfi gamer
Power Supply	Seasonic focus+ 850 platinum (SSR-850PX)/165 Watt power brick/Enermax 650W
Mouse	G502 Hero/M705 Marathon/G305 Hero Lightspeed
Keyboard	G19/oem/Steelseries Apex 300
Software	Win10 pro 64bit

Processor	Ryzen 7800X3D
Motherboard	MSI MAG Mortar B650 (wifi)
Cooling	be quiet! Dark Rock Pro 4
Memory	32GB Kingston Fury
Video Card(s)	Gainward RTX4070ti
Storage	Seagate FireCuda 530 M.2 1TB / Samsumg 960 Pro M.2 512Gb
Display(s)	LG 32" 165Hz 1440p GSYNC
Case	Asus Prime AP201
Audio Device(s)	On Board
Power Supply	be quiet! Pure POwer M12 850w Gold (ATX3.0)
Software	W10

System Name	Skunkworks 3.0
Processor	5800x3d
Motherboard	x570 unify
Cooling	Noctua NH-U12A
Memory	32GB 3600 mhz
Video Card(s)	asrock 6800xt challenger D
Storage	Sabarent rocket 4.0 2TB, MX 500 2TB
Display(s)	Asus 1440p144 27"
Case	Old arse cooler master 932
Power Supply	Corsair 1200w platinum
Mouse	squeak
Keyboard	Some old office thing
Software	Manjaro

System Name	Asus X450JB
Processor	Intel Core i7-4720HQ
Motherboard	Asus
Memory	2x 4GiB
Video Card(s)	nVidia GT940M
Storage	2x 1TB

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	Dark Monolith
Processor	AMD Ryzen 7 5800X3D
Motherboard	ASUS Strix X570-E
Cooling	Arctic Cooling Freezer II 240mm + 2x SilentWings 3 120mm
Memory	64 GB G.Skill Ripjaws V Black
Video Card(s)	XFX Radeon RX 9070 XT Mercury OC Magnetic Air
Storage	Seagate Firecuda 530 4 TB SSD + Samsung 850 Pro 2 TB SSD + Seagate Barracuda 8 TB HDD
Display(s)	ASUS ROG Swift PG27AQDM 240Hz OLED
Case	Silverstone Kublai KL-07
Audio Device(s)	Sound Blaster AE-9 MUSES Edition + Altec Lansing MX5021 2.1 Nichicon Gold
Power Supply	BeQuiet DarkPower 11 Pro 750W
Mouse	Logitech G502 Core
Keyboard	UVI Pride MechaOptical
Software	Windows 11 Pro

Processor	Ryzen 5700x
Motherboard	MSI B350 Gaming Pro Carbon
Cooling	be quiet dark rock pro 3
Memory	GSKill Aegis 32GB (4x8GB) DDR4 3200MHz CL16
Video Card(s)	PowerColor Radeon RX 7800 XT Hellhound 16GB GDDR6 256-bit
Storage	Seagate Barracuda SATA-II 1TB , HyperX Savage 240GB SATA 3
Display(s)	Benq EX2780Q
Case	Be Quiet! Dark Base Pro 900
Audio Device(s)	Sound BlasterX G6
Power Supply	Seasonic prime TX-650
Mouse	Marvo Scorpion G981
Keyboard	Razer Blackwidow Elite - Yellow Switch
Software	Windows 10 Pro