Thursday, April 6th 2023

NVIDIA H100 AI Performance Receives up to 54% Uplift with Optimizations

Apr 6th, 2023 11:41 Discuss (7 Comments)

On Wednesday, the MLCommons team released the MLPerf 3.0 Inference numbers, and there was an exciting submission from NVIDIA. Reportedly, NVIDIA has used software optimization to improve the already staggering performance of its latest H100 GPU by up to 54%. For reference, NVIDIA's H100 GPU first appeared on MLPerf 2.1 back in September of 2022. In just six months, NVIDIA engineers worked on AI optimizations for the MLPerf 3.0 release to find that basic software optimization can catalyze performance increases anywhere from 7-54%. The workloads for measuring the inferencing speed suite included RNN-T speech recognition, 3D U-Net medical imaging, RetinaNet object detection, ResNet-50 object classification, DLRM recommendation, and BERT 99/99.9% natural language processing.

What is interesting is that NVIDIA's submission is a bit modified. There are open and closed categories that vendors have to compete in, where closed is the mathematical equivalent of a neural network. In contrast, the open category is flexible and allows vendors to submit results based on optimizations for their hardware. The closed submission aims to provide an "apples-to-apples" hardware comparison. Given that NVIDIA opted to use the closed category, performance optimization of other vendors such as Intel and Qualcomm are not accounted for here. Still, it is interesting that optimization can lead to a performance increase of up to 54% in NVIDIA's case with its H100 GPU. Another interesting takeaway is that some comparable hardware, like Qualcomm Cloud AI 100, Intel Xeon Platinum 8480+, and NeuChips's ReccAccel N3000, failed to finish all the workloads. This is shown as "X" on the slides made by NVIDIA, stressing the need for proper ML system software support, which is NVIDIA's strength and an extensive marketing claim.

Source: via Tom's Hardware

Add your own comment

7 Comments on NVIDIA H100 AI Performance Receives up to 54% Uplift with Optimizations

P4-630

Does it play the last of us?..... :D

TumbleGeorge

What is this element?

Steevo

mlcommons.org/en/inference-edge-30/

Are the results not online yet? According to the official website the Qualcomm AI100 was processing 124K images a second VS 108K per second for the H100

SOAREVERSOR

TumbleGeorgeWhat is this element?

A finger to show the size, or if you want to have a bit of fun it's a tiny cock from someone in a leather jacket.

oxrufiioxo

P4-630Does it play the last of us?..... :D

They should rename it to The Last of 8GB Cards.

SOAREVERSOR

oxrufiioxoThey should rename it to The Last of 8GB Cards.

Any card under 12gb is already outdated by current consoles. Welcome to PC! The second tier red head headed step child of gaming land with 1080p, 60hz, and mid or low details.

erek

SOAREVERSORA finger to show the size, or if you want to have a bit of fun it's a tiny cock from someone in a leather jacket.

I see an entire humanoid figure holding a smartphone to take the picture and not just a finger as highlighted

NVIDIA H100 AI Performance Receives up to 54% Uplift with Optimizations

7 Comments on NVIDIA H100 AI Performance Receives up to 54% Uplift with Optimizations

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts

NVIDIA H100 AI Performance Receives up to 54% Uplift with Optimizations

Related News

7 Comments on NVIDIA H100 AI Performance Receives up to 54% Uplift with Optimizations

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts