• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA GH200 72-core Grace CPU Benched Against AMD Threadripper 7000 Series

T0@st

News Editor
Joined
Mar 7, 2023
Messages
2,077 (3.17/day)
Location
South East, UK
GPTshop.ai is building prototypes of their "ultimate high-end desktop supercomputer," running the NVIDIA GH200 "Grace" CPU for AI and HPC workloads. Michael Larabel—founder and principal author of Phoronix—was first allowed to "remote access" a GPTshop.ai GH200 576 GB workstation converted model in early February—for the purpose of benchmarking it against systems based on AMD EPYC Zen 4 and Intel Xeon Emerald Rapids processors. Larabel noted: "it was a very interesting battle" that demonstrated the capabilities of 72 Arm Neoverse-V2 cores (in Grace). With this GPTshop.ai GH200 system actually being in workstation form, I also ran some additional benchmarks looking at the CPU capabilities of the GH200 compared to AMD Ryzen Threadripper 7000 series workstations."

Larabel had on-site access to two different Threadripper systems—a Hewlett-Packard (HP) Z6 G5 A workstation and a System76 Thelio Major semi-custom build. No comparable Intel "Xeon W hardware" was within reach, so the Team Green desktop supercomputer was only pitched against AMD HEDT processors. The HP review sample was configured with an AMD Ryzen Threadripper PRO 7995WX 96-core / 192-thread Zen 4 processor, 8 x 16 GB DDR5-5200 memory, and NVIDIA RTX A4000 GPU. Larabel said that it was an "all around nice high-end AMD workstation." The System76 Thelio Major was specced with an AMD Ryzen Threadripper 7980X processor "as the top-end non-PRO SKU." It is a 64-core / 128-thread part, working alongside 4 x 32 GB DDR5-4800 memory and a Radeon PRO W7900 graphics card.




Larabel noted that the three computer systems were: "all freshly tested with Ubuntu 23.10 with the Linux 6.5 kernel, the performance CPU frequency scaling governor, GCC 13.2, and other defaults for this latest Ubuntu release...All the CPUs were running at stock speeds. The high Threadripper frequencies reported is a known AMD P-State bug AMD is working to fix. As with the prior GH200 benchmarking article of the seventy-two Neoverse-V2 CPU cores with Grace, the current tests are just looking at the processor/system performance. I'm waiting on remote access again to the GH200 for running the GPU-accelerated portion of the tests so this article is intended at looking at how the Grace CPU compares to the AMD Ryzen Threadripper 7980X and PRO 7995WX x86_64 Linux workstations for various workloads. As noted in the prior article, no CPU power consumption numbers unfortunately due to no RAPL/PowerCap driver or similar exposure of just the CPU power consumption data currently under Linux for the GH200."



Phoronix's conclusion did not include any overall performance metrics for the competing processors/workstations (in 39 benchmarks)—according to further analysis conducted by Tom's Hardware, the GH200 Grace CPU beat the Threadripper 7980X in 17 tests and the 7995WX in 15. The NVIDIA processor is more of an efficiency-oriented server product, while the AMD Threadrippers are purpose-built for top desktop performance (as advertised).

The Phoronix chief author signed off with: "Those wanting to go through dozens more benchmarks of these three Linux workstations can find all of my raw data via this OB result page. So while there was a lot of NVIDIA GH200 vs. AMD EPYC vs. Intel Xeon benchmarks looking at the CPU performance earlier this month, those weighing the NVIDIA GH200 use for Linux workstation uses will hopefully find today's performance results against AMD Ryzen Threadripper useful. For HPC workloads that are AArch64-tuned and can leverage the available system memory effectively, the GH200 could deliver great performance against these Zen 4 Threadripper workstations. But for software extensively tuned for x86_64 and/or not as heavily dependent upon system memory bandwidth, the Threadripper 7980X and Threadripper PRO 7995WX are excellent workstation options."

View at TechPowerUp Main Site | Source
 
Joined
Nov 26, 2021
Messages
1,705 (1.52/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
The wins are primarily due to much higher memory bandwidth. For purely CPU bound situations, the ARM cores are spanked by Zen 4.

1708967146003.png
 
Joined
Jun 8, 2022
Messages
388 (0.42/day)
Location
Ohio, USA
System Name Trackstar
Processor AMD Ryzen 7 5800X3D -30 All Core CO (on Corsair XC5 block)
Motherboard Gigabyte B550 AORUS Elite V2 Rev 1.0 (F17 BIOS)
Cooling Corsair XD5 pump / Corsair XR5 1x 360mm (front) + 1x 420mm (top) rads
Memory 32GB G.Skill DDR4-3600 CL14 1:1 (F4-3600C14Q-32GVKA kit)
Video Card(s) ASRock RX 6950XT OC Formula (on Bykski A-AR6900XTOCF-X block)
Storage WD_BLACK SN850X 2TB w/HS (FW ver. 620361WD)
Display(s) Dell S3222DGM 32" 1440p/165Hz FreeSync
Case Fractal Design Meshify S2
Audio Device(s) Realtek ALC1200 Integrated Audio
Power Supply Super Flower Leadex Platinum SE 1200W on Liebert GXT4-1500RT120 UPS
Mouse Corsair Nightsword RGB
Keyboard Corsair K60 RGB PRO
VR HMD N/A
Software Windows 11 Pro 23H2 (Build 22631.3958)
Benchmark Scores https://www.3dmark.com/sw/1131940 https://www.3dmark.com/fs/29315810
Joined
Nov 26, 2021
Messages
1,705 (1.52/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
Power metrics will definitely be interesting once NV decides to expose those stats to the linux driver but on a raw performance basis it sounds like Threadripper/EPYC/Zen4 in general will remain the standard to beat.
I expect the GH200's CPU to consume less power, but it isn't due to the CPUs being inherently any more power-efficient. Rather they don't have an IO die responsible for a large chunk of power.
 

Aquinus

Resident Wat-man
Joined
Jan 28, 2012
Messages
13,171 (2.79/day)
Location
Concord, NH, USA
System Name Apollo
Processor Intel Core i9 9880H
Motherboard Some proprietary Apple thing.
Memory 64GB DDR4-2667
Video Card(s) AMD Radeon Pro 5600M, 8GB HBM2
Storage 1TB Apple NVMe, 4TB External
Display(s) Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays
Case MacBook Pro (16", 2019)
Audio Device(s) AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers
Power Supply 96w Power Adapter
Mouse Logitech MX Master 3
Keyboard Logitech G915, GL Clicky
Software MacOS 12.1
The wins are primarily due to much higher memory bandwidth.
Are they really though? The strong matrix math showing I think tells a different story. Memory copy performance doesn't suggest bandwidth differs by a huge margin.
1708972181404.png
 
Joined
Nov 26, 2021
Messages
1,705 (1.52/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
Are they really though? The strong matrix math showing I think tells a different story. Memory copy performance doesn't suggest bandwidth differs by a huge margin.
View attachment 336444
The results of the memory copy sub-test of stress-ng seem very odd; the 7995WX should have double the DRAM bandwidth of the 7980X, but the difference is barely 13%. Other matrix multiplication benchmarks like DGEMM show the GH200 to lag the Zen 4 based ThreadRippers appreciably. I think we need to be very careful when interpreting these results.

1708972969282.png
 
Last edited:
Joined
Jun 29, 2023
Messages
569 (1.05/day)
Location
Spain
System Name Gungnir
Processor Ryzen 5 7600X
Motherboard ASUS TUF B650M-PLUS WIFI
Cooling Thermalright Peerless Assasin 120 SE Black
Memory 2x16GB DDR5 CL36 5600MHz
Video Card(s) XFX RX 6800XT Merc 319
Storage 1TB WD SN770 | 2TB WD Blue SATA III SSD
Display(s) 1440p 165Hz VA
Case Lian Li Lancool 215
Audio Device(s) Beyerdynamic DT 770 PRO 80Ohm
Power Supply EVGA SuperNOVA 750W 80 Plus Gold
Mouse Logitech G Pro Wireless
Keyboard Keychron V6
VR HMD The bane of my existence (Oculus Quest 2)
Huh, I thought GH would do a lot better than it did, I am surprised.
 
Top