Friday, October 11th 2024

AMD EPYC "Turin" with 192 Cores and 384 Threads Delivers Almost 40% Higher Performance Than Intel Xeon 6
AMD has unveiled its latest EPYC processors, codenamed "Turin," featuring Zen 5 and Zen 5C dense cores. Phoronix's thorough testing reveals remarkable advancements in performance, efficiency, and value. The new lineup includes the EPYC 9575F (64-core), EPYC 9755 (128-core), and EPYC 9965 (192-core) models, all showing impressive capabilities across various server and HPC workloads. In benchmarks, a dual-socket configuration of the 128-core EPYC 9755 Turin outperformed Intel's dual Xeon "Granite Rapids" 6980P setup with MRDIMM-8800 by 40% in the geometric mean of all tests. Surprisingly, even a single EPYC 9755 or EPYC 9965 matched the dual Xeon 6980P in expanded tests with regular DDR5-6400. Within AMD's lineup, the EPYC 9755 showed a 1.55x performance increase over its predecessor, the 96-core EPYC 9654 "Genoa". The EPYC 9965 surpassed the dual EPYC 9754 "Bergamo" by 45%.
These gains come with improved efficiency. While power consumption increased moderately, performance improvements resulted in better overall efficiency. For example, the EPYC 9965 used 32% more power than the EPYC 9654 but delivered 1.55x the performance. Power consumption remains competitive: the EPYC 9965 averaged 275 Watts (peak 461 Watts), the EPYC 9755 averaged 324 Watts (peak 500 Watts), while Intel's Xeon 6980P averaged 322 Watts (peak 547 Watts). AMD's pricing strategy adds to the appeal. The 192-core model is priced at $14,813, compared to Intel's 128-core CPU at $17,800. This competitive pricing, combined with superior performance per dollar and watt, has resonated with hyperscalers. Estimates suggest 50-60% of hyperscale deployments now use AMD processors.The Blue Empire is ready to strike back at AMD, with its upcoming "Sierra Forest" CPUs with up to 288 E-cores. Intel must deliver similar or greater performance metrics with its new E-core Xeon processor, keeping power consumption low and costs reasonable, so we expect to see a heated battle in the server space between Intel and AMD. Besides more cores, "Sierra Forest" will bring 12-channel DDR5 memory, so the massive core count will get adequate memory bandwidth. Until then, AMD has the crown of performance, efficiency, and value, and we are curious to see this driving competition and further innovation from both sides.
Source:
Phoronix
These gains come with improved efficiency. While power consumption increased moderately, performance improvements resulted in better overall efficiency. For example, the EPYC 9965 used 32% more power than the EPYC 9654 but delivered 1.55x the performance. Power consumption remains competitive: the EPYC 9965 averaged 275 Watts (peak 461 Watts), the EPYC 9755 averaged 324 Watts (peak 500 Watts), while Intel's Xeon 6980P averaged 322 Watts (peak 547 Watts). AMD's pricing strategy adds to the appeal. The 192-core model is priced at $14,813, compared to Intel's 128-core CPU at $17,800. This competitive pricing, combined with superior performance per dollar and watt, has resonated with hyperscalers. Estimates suggest 50-60% of hyperscale deployments now use AMD processors.The Blue Empire is ready to strike back at AMD, with its upcoming "Sierra Forest" CPUs with up to 288 E-cores. Intel must deliver similar or greater performance metrics with its new E-core Xeon processor, keeping power consumption low and costs reasonable, so we expect to see a heated battle in the server space between Intel and AMD. Besides more cores, "Sierra Forest" will bring 12-channel DDR5 memory, so the massive core count will get adequate memory bandwidth. Until then, AMD has the crown of performance, efficiency, and value, and we are curious to see this driving competition and further innovation from both sides.
63 Comments on AMD EPYC "Turin" with 192 Cores and 384 Threads Delivers Almost 40% Higher Performance Than Intel Xeon 6
I do not think splitting is a good idea, though. It requires double the R&D efforts, double the manufacturing allocation at TSMC, and results in an incoherent hardware/software stack, meaning you can only do certain things with one, but not the other. This is the same problem AMD has with RDNA and CDNA. AMD is trying to unify them within a few generations for a reason.
Edit: Splitting might be a reason for Intel's problems, imo. Too many projects going on at the same time, too much money spent on stuff that doesn't bear fruit, or doesn't have the time to bear fruit before they hop onto the next one. Not to mention, it makes their product stack a convoluted mess.
Their newest granite rapids uses the same redwood cove found in meteor lake.
Emerald rapids uses raptor cove, same found in desktop 13th gen.
Sapphire rapids used the same golden cove as the desktop 12th gen.
Sierra forest uses the same crestmont found in mtl E-cores.
Zen 5's design already allows it to be used in different nodes (3 and 4nm), while also focusing in both dense and high clock variants.
AMD just hasn't focused in improving gaming performance with this new μarch, but it's an amazing architecture for most other tasks, and is a multithreading best (its 2x4-wide decoder can't be properly used without SMT, as an example).
Besides, if gamers do not benefit as much, it doesn't mean that productivity folks do not. 9950X is a desktop productivity powerhouse that happens to be good enough in gaming, but gaming would not be a good reason to buy it, as there are cheaper and more suitable gaming CPUs.
they also have the big little thing going on
There are at least 4 implementations:
1. Zen 5 (512-bit data path)
2. Zen 5c (512-bit data path)
3. Zen 5 Lite (256-bit data path)
4. Zen 5c Lite (256-bit data path)
As mentioned above, Zen 5 also has different implementations, but it's still the same μarch in the end.
About the big little thing, they're using the little core in some xeon SKU as well, as I pointed before.
They're just not mixing and matching those cores in single SKUs, in the same manner that AMD has no epyc with both normal and C cores (whereas that mix is common in their mobile/APU offerings).
The fact Intel and AMD are showing poor game improvements despite decent IPC uplifts shows it's not a simple problem.
Whole market is a mess, I'm glad I'm not into games anymore so I honestly dgaf.
www.amd.com/en/products/processors/server/epyc/9005-series/amd-epyc-9755.html
www.amd.com/en/products/processors/server/epyc/9005-series/amd-epyc-9575f.html
Is this performance difference between the EPYC 9xx5 and 9xx4 CPUs due to improvements in the AVX-512 unit of the ZEN5 architecture?
Why doesn't the EPYC 9755 (128 cores) perform nearly twice as well as the 9575F (64 cores) in this AV1 video encode test? Is this a limitation of the CPUs or of the software?
Source:
www.tomshardware.com/pc-components/cpus/amd-launches-epyc-turin-9005-series-our-benchmarks-of-fifth-gen-zen-5-chips-with-up-to-192-cores-500w-tdp#section-encoding-benchmarks
For more consumer stuff, GN, for server stuff... www.servethehome.com/amd-epyc-9005-turin-turns-transcendent-performance-solidigm-broadcom/
If you need a server CPU for very specific workload, you will look more into those tests that benefit such workload. Performance never improves in linear way, with core count increase. CPU performance does not scale and work like that. 16 core CPU is never twice as faster in gaming than 8 core CPU.
Also, 64 core 9575F has almost double bandwidth throughout due to 8 GMI-Wide links to IOD, whereas 128 core Turin has single GMI links from each CCD to IOD. This makes 64 core EPYC very fast indeed in workloads requiring a lot of memory bandwidth.
See the article below.
chipsandcheese.com/p/amds-turin-5th-gen-epyc-launched