AMD Launches 4th Gen EPYC "Genoa" Zen 4 Server Processors: 100% Performance Uplift for 50% More Cores

btarunr · Nov 10, 2022

AMD at a special media event titled "together we advance_data centers," formally launched its 4th generation EPYC "Genoa" server processors based on the "Zen 4" microarchitecture. These processors debut an all new platform, with modern I/O connectivity that includes PCI-Express Gen 5, CXL, and DDR5 memory. The processors come in CPU core-counts of up to 96-core/192-thread. There are as many as 18 processor SKUs, differentiated not just in CPU core-counts, but also the way the the cores are spread across the up to 12 "Zen 4" chiplets (CCDs). Each chiplet features up to 8 "Zen 4" CPU cores, depending on the model; up to 32 MB of L3 cache, and is built on the 5 nm EUV process at TSMC. The CCDs talk to a centralized server I/O die (sIOD), which is built on the 6 nm process.

The processors AMD is launching today are the EPYC "Genoa" series, targeting general purpose servers, although they can be deployed in large cloud data-centers, too. To large-scale cloud providers such as AWS, Azure, and Google Cloud, AMD is readying a different class of processor, codenamed "Bergamo," which is plans to launch later. In 2023, the company will launch the "Genoa-X" line of processor for technical-compute and HPC applications, which benefit from large on-die caches, as they feature the 3D Vertical Cache technology. There will also be "Siena," a class of EPYC processors targeting the telecom and edge-computing markets, which could see an integration of more Xilinx IP.

The EPYC "Genoa" processor, as we mentioned, comes in core-counts of up to 96-core/192-thread, dominating the 40-core/80-thread counts of the 3rd Gen Xeon Scalable "Ice Lake-SP," and also staying ahead of the 60-core/120-thread counts of the upcoming Xeon Scalable "Sapphire Rapids." The new AMD processor also sees a significant buff of its I/O capabilities, featuring a 12-channel (24 sub-channel) DDR5 memory interface, and a gargantuan 160-lane PCI-Express Gen 5 interface (that's ten Gen 5 x16 slots running at full bandwidth). and platform support for CXL and 2P xGMI links by subtracting some of those multipurpose lanes.

The new 6 nm server I/O die (sIOD) has a significantly higher transistor count than the 12 nm one powering past-gen EPYC processors. The high transistor count is due to two large 80-lane configurable SERDES (serializer-deserializer) components, which can be made to put out PCIe Gen 5 lanes, CXL 1.1 lanes, SATA 6 Gbps ports, or even the inter-socket Infinity Fabric enabling 2P platforms. The processor supports up to 64 CXL 1.1 lanes that can be used to connect to networked memory-pooling devices. 3rd generation Infinity Fabric connects the various components inside the sIOD, the sIOD to the twelve "Zen 4" CCDs via IFOP, and as an inter-socket interconnect. The processor features a 12-channel (24 x 40-bit sub-channels) memory interface, which supports up to 6 TB of ECC DDR5-4800 memory per socket. The latest generation Secure Processor provides SEV-SNP (secure nested paging), and AES-256-XTS, for a larger number of secure VMs.

Each of the 5 nm CPU complex dies (CCDs) is physically identical to the ones you find in Ryzen 7000-series "Raphael" desktop processors. It packs 8 "Zen 4" CPU cores, each with 1 MB of dedicated L2 cache, and 32 MB of L3 cache shared among the 8 cores. Each "Zen 4" core provides a 14% generational performance uplift compared to "Zen 3," with clock-speed kept constant. Much of this uplift comes from updates to the core's Front-end and Load/store unit, while the branch predictor, larger L2 cache, and execution engine, make smaller contributions. The biggest generational change is the ISA, which sees the introduction of support for the AVX-512 instruction-set, VNNI, and bfloat16. The new instruction sets should accelerate AVX-512 math workloads, as well as accelerate performance with AI applications. AMD says that its AVX-512 implementation is more die-efficient compared to Intel's, as it is using existing 256-bit wide FPU in a double-pumped fashion to enable 512-bit operations.

AMD is launching a total of 18 processor SKUs today, all meant for the Socket SP5 platform. It follows the nomenclature as described in the slide below. EPYC is the top-level brand, "9" is the product series. The next digit indicates core-count, with "0" denoting 8 cores, "1" denoting 16, "2" denoting 24, "3" denoting 32, "4" denoting 48, "5" being 64, and "6" being 84-96. The next digit denotes performance on a 1-10 scale. The last digit is actually a character, which could either be "P" or "F," with P denoting 2P-capable SKUs, and "F" denoting special SKUs that focus on fewer cores per CCD to improve per-core performance. The configurable TDP of all SKUs is rated up to 400 W, which seems high, but one should take into account the CPU core-count, and the impact it has on the number of server blades per rack. This is one of the reason AMD isn't scaling beyond 2 sockets per server. The company's core-density translates into 67% fewer servers, 52% less power.

In terms of performance, AMD only has Intel's dated 3rd Gen Xeon Scalable "Ice Lake-SP" processors for comparison, since "Sapphire Rapids" is still unreleased. With core-counts equalized, the 16-core EPYC 9174F is shown being 47% faster than the Xeon Gold 6346; the 32-core EPYC 9374F is 55% faster than the Xeon Platinum 8362; and the 48-core EPYC 9474F is 51% faster than the 40-core Xeon Platinum 8380. The same test group also sees 58-96% floating-point performance leadership in favor of AMD.

The complete slide-deck follows.

View at TechPowerUp Main Site

Tek-Check · Nov 10, 2022

An onslaught of slides. Need a few hours to digest this news.

Frick · Nov 10, 2022

AMD 4th Gen EPYC 9004 Series Launched: Genoa Tested In A Data Center Benchmark Gauntlet - Page 2

AMD's many-core Zen 4 EPYC beasts are here to take on serious data center workloads and scalability, and we've got benchmarks to prove it. - Page 2

hothardware.com

Some tests.

zlobby · Nov 10, 2022

Given how Zen4 fares by far, it's a safe bet these will be monsters!

Tek-Check said:
An onslaught of slides. Need a few hours to digest this news.

Them endnotes, though.

Imsochobo · Nov 10, 2022

zlobby said:
Given how Zen4 fares by far, it's a safe bet these will be monsters!

Them endnotes, though.

https://www.servethehome.com/amd-epyc-genoa-gaps-intel-xeon-in-stunning-fashion/8/

Hofnaerrchen · Nov 10, 2022

zlobby said:
Given how Zen4 fares by far, it's a safe bet these will be monsters!

They'd rather be. Desktop CPU sales are down and AM5 still is to expensive and I doubt it will change in the near future. The launch of 7600/7700 non-X will not change the problem of high motherboard and RAM prices.

CapNemo72 · Nov 10, 2022

I think that AMD has a big stock of 5000 series CPUs so is not very aggressive with 7000 series pricing. Once those stocks are gone, they will probably start to lower their prices.
By that time, there will be cheaper motherboards and DDR5 should go down in price too (I am aiming to get 64Gb DDR5 / 6000).

As for Epyc, now let's hope that OEMs will be pushing them more.

ncrs · Nov 10, 2022

Phoronix has published a very comprehensive set of benchmarks under Linux.
I am a bit surprised the "auto oc" setting with Power Determinism mode is able to achieve such results. It's an improvement from the behavior of this setting on EPYC Rome I've tested.

Wirko · Nov 10, 2022

That guy in the blue Ferrari, he might need to fit larger rearview mirrors to it very soon.

AnotherReader · Nov 10, 2022

Imsochobo said:
https://www.servethehome.com/amd-epyc-genoa-gaps-intel-xeon-in-stunning-fashion/8/

As expected, these are monsters that'll probably increase AMD's server market share. The most impressive part is AMD's diversity:

ARM servers will also be handled by Bergamo
Lower cost uses will be served by Siena
HPC will be served by Genoa-X and the MI300 APU

Tek-Check · Nov 10, 2022

AnotherReader said:
As expected, these are monsters that'll probably increase AMD's server market share.

Not probably, but surely. Conservative prediction is 23-25% server market penetration by the end of next year. And this comes on the top of ARM's entry into the game. ARM is predicted to take 8-9% by Q4 2023. So, Intel's share is being eaten by two companies. See bellow.

Performance efficiency is the mantra in server now. Why? Well, if your company can save millions every year on electricity bills, it's no brainer what to do. In 5-6 years, 2017-2023, Intel is on track to lose ~30% of server market share. It's a massive and rapid shift.

Minus Infinity · Nov 10, 2022

Can someone explain why v-cache for Epyc is being touted for HPC, but in Zen3 it only seemed to benefit gaming. I know their must be non-gaming software that surely will benefit but TechP doesn't seem to have anything in their benchmarks. I would be far more tempted to get a 7900X3D for example if I saw tangible gains in productivity apps like COMSOL, Ansys, other physics/chemistry simulations where currently Raptor Lake is much stronger than Zen 4 in general.

Wirko · Nov 11, 2022

Minus Infinity said:
Can someone explain why v-cache for Epyc is being touted for HPC, but in Zen3 it only seemed to benefit gaming. I know their must be non-gaming software that surely will benefit but TechP doesn't seem to have anything in their benchmarks. I would be far more tempted to get a 7900X3D for example if I saw tangible gains in productivity apps like COMSOL, Ansys, other physics/chemistry simulations where currently Raptor Lake is much stronger than Zen 4 in general.

There are some Epyc 7003 X3D benchmarks out there, like this one at Phoronix. Some of the results are impressive.

evernessince · Nov 11, 2022

160 lanes of integrated IO? I want that on the consumer end. Leaves space on the board for plenty of PCIe and M.2 slots.

Patriot · Nov 11, 2022

evernessince said:
160 lanes of integrated IO? I want that on the consumer end. Leaves space on the board for plenty of PCIe and M.2 slots.

Only if you use 3 links instead of 4 between the cpus. 128-160 lanes depending on configuration.

Minus Infinity · Nov 11, 2022

Wirko said:
There are some Epyc 7003 X3D benchmarks out there, like this one at Phoronix. Some of the results are impressive.

Cheers, very informative. I see OpenFoam loves cache. Given Zen 4 v-cache runs cooler and faster and there will be minimal clock speed regression this time around, Zen 4 x3d models should be very strong and at least for gaming wipe the floor with RL.

Wirko · Nov 11, 2022

evernessince said:
160 lanes of integrated IO? I want that on the consumer end. Leaves space on the board for plenty of PCIe and M.2 slots.

I won't comment on CPUs but given the price increases on the consumer end, a good Zen 3 Epyc board by Supermicro has become as cheap as an average X670E board.

Patriot · Nov 11, 2022

Wirko said:
I won't comment on CPUs but given the price increases on the consumer end, a good Zen 3 Epyc board by Supermicro has become as cheap as an average X670E board.
View attachment 269440

You can actually get a Gen3 H11 board+rome 16core off ebay for mid 500s. YMMV
Personally... I have an H12 for my Milan.

Jism · Nov 11, 2022

evernessince said:
160 lanes of integrated IO? I want that on the consumer end. Leaves space on the board for plenty of PCIe and M.2 slots.

They multiply the number based on the additional CCD added to the chip. You cant get so many lanes for a regular desktop CPU unless you opt for threadripper.

dgianstefani · Nov 12, 2022

Bruh.

Wirko · Nov 12, 2022

dgianstefani said:
Bruh.

View attachment 269670

I didn't see that (even if I often catch missspellings), however, the increased L2 and L3 latency I did notice. Doubled size may be an excuse for L2 but what about L3? And it will probably be 4 cycles more for the 3D cache die.

Xajel · Nov 13, 2022

I wonder how Zen4 based Threadripper will be.

Will it be based on the same socket as SP5 but repackaged for TR? liike TR5?

Or will it be smaller, target 64Cores and 8Channels max?

Will they have versions with AI, ML & FPGA chiplets there as well or these might come with Zen5?

Wirko · Nov 13, 2022

Xajel said:
I wonder how Zen4 based Threadripper will be.

Will it be based on the same socket as SP5 but repackaged for TR? liike TR5?

Or will it be smaller, target 64Cores and 8Channels max?

There's an 80% probability that AMD will screw everything up. They are so good at that.

Xajel said:
Will they have versions with AI, ML & FPGA chiplets there as well or these might come with Zen5?

It's also possible that even the generally available Epycs won't have any special-purpose chiplets. Just the semi-custom models.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	ASUS ROG Strix B450-E Gaming
Cooling	DeepCool Gammax L240 V2
Memory	2x 8GB G.Skill Sniper X
Video Card(s)	Palit GeForce RTX 2080 SUPER GameRock
Storage	Western Digital Black NVMe 512GB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	Black MC in Tokyo
Processor	Ryzen 5 7600
Motherboard	MSI X670E Gaming Plus Wifi
Cooling	Be Quiet! Pure Rock 2
Memory	2 x 16GB Corsair Vengeance @ 6000Mhz
Video Card(s)	XFX 6950XT Speedster MERC 319
Storage	Kingston KC3000 1TB \| WD Black SN750 2TB \|WD Blue 1TB x 2 \| Toshiba P300 2TB \| Seagate Expansion 8TB
Display(s)	Samsung U32J590U 4K + BenQ GL2450HT 1080p
Case	Fractal Design Define R4
Audio Device(s)	Plantronics 5220, Nektar SE61 keyboard
Power Supply	Corsair RM850x v3
Mouse	Logitech G602
Keyboard	Dell SK3205
Software	Windows 10 Pro
Benchmark Scores	Rimworld 4K ready!

Processor	R9 5800x3d \| R7 3900X \| 4800H \| 2x Xeon gold 6142
Motherboard	Asrock X570M \| AB350M Pro 4 \| Asus Tuf A15
Cooling	Air \| Air \| duh laptop
Memory	64gb G.skill SniperX @3600 CL16 \| 128gb \| 32GB \| 192gb
Video Card(s)	RTX 4080 \|Quadro P5000 \| RTX2060M
Storage	Many drives
Display(s)	AW3423dwf.
Case	Jonsbo D41
Power Supply	Corsair RM850x
Mouse	g502 Lightspeed
Keyboard	G913 tkl
Software	win11, proxmox

System Name	Galaxy Tab S8+
Processor	Snapdragon 8 gen 1 SOC
Cooling	passive
Memory	8 GB
Storage	256 GB + 512 GB SD
Display(s)	2.800 x 1.752 Super AMOLED
Power Supply	10.090 mAh
Software	Android 12

System Name	Zen-TR16x
Processor	AMD Threadripper 1950x
Motherboard	Gigabyte Aurus x399 Gaming
Cooling	Arctic Freezer 33 TR
Memory	32Gb 3200Mhz (4x8Gb)
Video Card(s)	Asus RTX 3070 FE
Storage	Samsung Evo 860 SSD 2Tb
Display(s)	LG 34"
Case	Phantec 500s
Power Supply	Corsair 650W
Benchmark Scores	Gears 5 : 87fps at 1080p

AMD Launches 4th Gen EPYC "Genoa" Zen 4 Server Processors: 100% Performance Uplift for 50% More Cores

btarunr

Editor & Senior Moderator

Tek-Check

Frick

Fishfaced Nincompoop

AMD 4th Gen EPYC 9004 Series Launched: Genoa Tested In A Data Center Benchmark Gauntlet - Page 2

zlobby

Imsochobo

Hofnaerrchen

CapNemo72

ncrs

Wirko

AnotherReader

Tek-Check

Minus Infinity

Wirko

evernessince

Patriot

Minus Infinity

Wirko

Patriot

Jism

dgianstefani

TPU Proofreader

Wirko

Xajel

Wirko

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

Processor	Ryzen 7800X3D
Motherboard	ASRock X670E Taichi
Cooling	Noctua NH-D15 Chromax
Memory	32GB DDR5 6000 CL30
Video Card(s)	MSI RTX 4090 Trio
Storage	P5800X 1.6TB 4x 15.36TB Micron 9300 Pro 4x WD Black 8TB M.2
Display(s)	Acer Predator XB3 27" 240 Hz
Case	Thermaltake Core X9
Audio Device(s)	JDS Element IV, DCA Aeon II
Power Supply	Seasonic Prime Titanium 850w
Mouse	PMM P-305
Keyboard	Wooting HE60
VR HMD	Valve Index
Software	Win 10

System Name	[H]arbringer
Processor	4x 61XX ES @3.5Ghz (48cores)
Motherboard	SM GL
Cooling	3x xspc rx360, rx240, 4x DT G34 snipers, D5 pump.
Memory	16x gskill DDR3 1600 cas6 2gb
Video Card(s)	blah bigadv folder no gfx needed
Storage	32GB Sammy SSD
Display(s)	headless
Case	Xigmatek Elysium (whats left of it)
Audio Device(s)	yawn
Power Supply	Antec 1200w HCP
Software	Ubuntu 10.10
Benchmark Scores	http://valid.canardpc.com/show_oc.php?id=1780855 http://www.hwbot.org/submission/2158678 http://ww

System Name	Silent/X1 Yoga/S25U-1TB
Processor	Ryzen 9800X3D @ 5.575ghz all core 1.24 V, Thermal Grizzly AM5 High Performance Heatspreader/1185 G7
Motherboard	ASUS ROG Strix X670E-I, chipset fans replaced with Noctua A14x25 G2
Cooling	Optimus Block, HWLabs Copper 240/40 + 240/30, D5/Res, 4x Noctua A12x25, 1x A14G2, Mayhems Ultra Pure
Memory	64 GB Dominator Titanium White 6000 MT, 130 ns tRFC, active cooled
Video Card(s)	RTX 3080 Ti Founders Edition, Conductonaut Extreme, 18 W/mK MinusPad Extreme, Corsair XG7 Waterblock
Storage	Intel Optane DC P1600X 118 GB, Samsung 990 Pro 2 TB
Display(s)	32" 240 Hz 1440p Samsung G7, 31.5" 165 Hz 1440p LG NanoIPS Ultragear, MX900 dual gas VESA mount
Case	Sliger SM570 CNC Aluminium 13-Litre, 3D printed feet, custom front, LINKUP Ultra PCIe 4.0 x16 White
Audio Device(s)	Audeze Maxwell Ultraviolet w/upgrade pads & LCD headband, Galaxy Buds 3 Pro, Razer Nommo Pro
Power Supply	SF1000 Plat, full transparent custom cables, Sentinel Pro 1500 Online Double Conversion UPS w/Noctua
Mouse	Razer Viper V3 Pro 8 KHz Mercury White & Pulsar Supergrip tape, Razer Atlas, Razer Strider Chroma
Keyboard	Wooting 60HE+ module, TOFU-R CNC Alu/Brass, SS Prismcaps W+Jellykey, LekkerV2 mod, TLabs Leath/Suede
Software	Windows 11 IoT Enterprise LTSC 24H2
Benchmark Scores	Legendary

System Name	Xajel Main
Processor	AMD Ryzen 7 5800X
Motherboard	ASRock X570M Steel Legened
Cooling	Corsair H100i PRO
Memory	G.Skill DDR4 3600 32GB (2x16GB)
Video Card(s)	ZOTAC GAMING GeForce RTX 3080 Ti AMP Holo
Storage	(OS) Gigabyte AORUS NVMe Gen4 1TB + (Personal) WD Black SN850X 2TB + (Store) WD 8TB HDD
Display(s)	LG 38WN95C Ultrawide 3840x1600 144Hz
Case	Cooler Master CM690 III
Audio Device(s)	Built-in Audio + Yamaha SR-C20 Soundbar
Power Supply	Thermaltake 750W
Mouse	Logitech MK710 Combo
Keyboard	Logitech MK710 Combo (M705)
Software	Windows 11 Pro