AMD's Pain Point is ROCm Software, NVIDIA's CUDA Software is Still Superior for AI Development: Report

AleksandarK · Dec 23, 2024

The battle of AI acceleration in the data center is, as most readers are aware, insanely competitive, with NVIDIA offering a top-tier software stack. However, AMD has tried in recent years to capture a part of the revenue that hyperscalers and OEMs are willing to spend with its Instinct MI300X accelerator lineup for AI and HPC. Despite having decent hardware, the company is not close to bridging the gap software-wise with its competitor, NVIDIA. According to the latest report from SemiAnalysis, a research and consultancy firm, they have run a five-month experiment using Instinct MI300X for training and benchmark runs. And the findings were surprising: even with better hardware, AMD's software stack, including ROCm, has massively degraded AMD's performance.

"When comparing NVIDIA's GPUs to AMD's MI300X, we found that the potential on paper advantage of the MI300X was not realized due to a lack within AMD public release software stack and the lack of testing from AMD," noted SemiAnalysis, breaking down arguments in the report further, adding that "AMD's software experience is riddled with bugs rendering out of the box training with AMD is impossible. We were hopeful that AMD could emerge as a strong competitor to NVIDIA in training workloads, but, as of today, this is unfortunately not the case. The CUDA moat has yet to be crossed by AMD due to AMD's weaker-than-expected software Quality Assurance (QA) culture and its challenging out-of-the-box experience."

NVIDIA has a massive advantage in that the software is fully functional. "As fast as AMD tries to fill in the CUDA moat, NVIDIA engineers are working overtime to deepen said moat with new features, libraries, and performance updates," noted the SemiAnalysis report. Tinybox and Tinybox Pro developer Tinygrad also confirmed this multiple times on their X profile, which also had a massive issue with AMD software in the past.

When taking a look at AMD Instinct MI300X and NVIDIA H100/H200 chips from 2023, the MI300X emerges as a clear winner performance-wise. It reaches 1,307 TFLOP/s for FP16 calculations, surpassing NVIDIA's H100, which delivers 989 TFLOP/s. The MI300X has 192 GB of HBM3 memory and a memory bandwidth of 5.3 TB/s. These specifications are even favourable to NVIDIA's H200, which offers 141 GB of HBM3e memory and 4.8 TB/s of memory bandwidth. The AMD chip also even has a lower total cost of ownership model, which has a 40% cheaper networking alone. On paper, the AMD chip looks superior to NVIDIA's Hopper offerings, but in reality, not so much.

AMD's internal teams have little access to GPU boxes to develop and refine the ROCm software stack. Tensorwave, which is among the largest providers of AMD GPUs in the cloud, took their own GPU boxes and gave AMD engineers the hardware on demand, free of charge, just so the software could be fixed. This is all while Tensorwave paid for AMD GPUs, renting their own GPUs back to AMD free of charge. Finally, SemiAnalysis has noted that the AMD software stack has been improved based on their suggestions. Still, there is a long way to go before the company reaches NVIDIA's CUDA level of stability and performance. For detailed analysis, visit SemiAnalysis report here.

View at TechPowerUp Main Site | Source

Daven · Dec 23, 2024

Not retaining your own hardware for internal development is a big mistake. My company does the same thing, selling everything we produce. This severally limits any opportunity to take market share from competitors.

Finally, all hardware companies need to become software companies. Engineers and black box management are stuck in the past.

Edit: oh and the article didn’t say moat enough…moat.

dj-electric · Dec 23, 2024

Idk why my brain works like this, but upon reading the title I thought AMD has an SOC called Pain Point.

Timbaloo · Dec 23, 2024

dj-electric said:
Idk why my brain works like this, but upon reading the title I thought AMD has an SOC called Pain Point.

So I was not the only one

john_ · Dec 23, 2024

So, they still haven't learned.
Maybe if their share price drops down to $50?

phanbuey · Dec 23, 2024

they need to do what they did with Xilinx and partner->acquire an AI software company.

hsew · Dec 23, 2024

AleksandarK said:
Tensorwave, which is among the largest providers of AMD GPUs in the cloud, took their own GPU boxes and gave AMD engineers the hardware on demand, free of charge, just so the software could be fixed. This is all while Tensorwave paid for AMD GPUs, renting their own GPUs back to AMD free of charge.

Brutal. Meanwhile nVidia has Jetson Dev kits that anyone can buy for under $300. How does AMD justify this?

TPUnique · Dec 23, 2024

Damn, that's pretty bad. I want to start dipping my toes into ML projects starting from next year, and was looking forward to potentially getting a Strix Halo platform. Guess I'll put this plan on hold. And get an Intel-build as an interim product, since I really don't want to support nVidia's practices of giving as little VRAM as possible for as much as they can possibly charge..

Neo_Morpheus · Dec 23, 2024

AleksandarK said:
AMD's internal teams have little access to GPU boxes to develop and refine the ROCm software stack. Tensorwave, which is among the largest providers of AMD GPUs in the cloud, took their own GPU boxes and gave AMD engineers the hardware on demand, free of charge, just so the software could be fixed. This is all while Tensorwave paid for AMD GPUs, renting their own GPUs back to AMD free of charge.

Man, if true (not doubting but these days, lots of media love to distort things and the new normal is to only publish anti AMD articles and news) but this is beyond f*cked up on AMD's part.

But I will admit, it sounds way to crazy to be real.

Funny enough, I bumped into this:

AMD MI300X vs. Nvidia H100 SXM: Performance Comparison on Mixtral 8x7B Inference

There’s no denying Nvidia's historical dominance when it comes to AI training and inference. Nearly all production AI workloads run on their graphics cards. However, there’s been some optimism recently around AMD, seeing as the MI300X, their intended competitor to Nvidia's H100, is strictly...

blog.runpod.io

And someone that do work with MI300 hardware and ROCm, posted this:

Mere coincidence that both use the appropriate name Ngreedia.

bug · Dec 23, 2024

The "hey, we're the good guys because OSS" argument doesn't hold when there's $$$ at stake, it would seem.

Funny enough, at some point I believe AMD hardware was actually superior when it came to compute. However, what matters is the complete stack.

ymdhis · Dec 23, 2024

dj-electric said:
Idk why my brain works like this, but upon reading the title I thought AMD has an SOC called Pain Point.

It would be funny if one company decided to use that theme for codenames. Pain Point, followed by Torture Point, followed by Suffering Point, followed by Guillotine Point followed by Homicide Point followed by Genocide point, etc...

R-T-B · Dec 23, 2024

bug said:
Funny enough, at some point I believe AMD hardware was actually superior when it came to compute.

Was for a bit, for crypto compute mainly. Because everyone and their dog wrote up cheap mining programs in OpenCL...

Patriot · Dec 23, 2024

I think there is a reason semi-analysis focused on training. As AMD has focused on inference performance. Meta trains on h100/h200 and runs the models (inference) exclusively on mi300x.
This both backs up the analysis as well as shows distortion by not giving the full picture.
AMD needs to work on software to gain competitiveness on training, and there may be architectural limitations that cap its overall training performance (xGMI interconnect arch)
They definitely need better regression testing and testing in general. They have acquired several Ai software companies this year that may help with this.

So the current reality is...
If you are using off the shelf models mi300x excels, if you finetune those models, AMD excels, If you train from scratch... AMD kinda sucks.
The analysis also fails to grasp the reality of availability... sometimes its better to have not as good than nothing.

AnarchoPrimitiv · Dec 23, 2024

hsew said:
Brutal. Meanwhile nVidia has Jetson Dev kits that anyone can buy for under $300. How does AMD justify this?

This insinuated that AMD is PURELY limited by will power, is this what mostly everyone believes here? That AMD has access to equal resources that Nvidia does and the only thing limiting AMD is simply "not wanting to do better"? I'm seriously asking....

We all agree Lisa Su is competent, correct? Do any of us actually believe that people are telling her: "We need to do better with our software" and she's like "Ahhh, screw it"?

So what is it then? I imagine it's difficult for them to get ahold and maintain talent, Nvidia and Intel can afford to pay them more, and both competitors have far larger R&D budgets, is that the problem? Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case... If anyone has some educated and informed guesses, I'd love to hear them, because it surely cannot be that AMD is just being "stupid" or something.....but there definitely is a problem or problems

ScaLibBDP · Dec 23, 2024

AnarchoPrimitiv said:
This insinuated that AMD is PURELY limited by will power, is this what mostly everyone believes here? That AMD has access to equal resources that Nvidia does and the only thing limiting AMD is simply "not wanting to do better"? I'm seriously asking....

We all agree Lisa Su is competent, correct? Do any of us actually believe that people are telling her: "We need to do better with our software" and she's like "Ahhh, screw it"?

So what is it then? I imagine it's difficult for them to get ahold and maintain talent, Nvidia and Intel can afford to pay them more, and both competitors have far larger R&D budgets, is that the problem? Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case... If anyone has some educated and informed guesses, I'd love to hear them, because it surely cannot be that AMD is just being "stupid" or something.....but there definitely is a problem or problems

Here are my comments as a C/C++ Software Engineer who worked for AMD.

>>...Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case...

I worked for AMD as a contractor. I have very-very good memories for just a couple of fellow developers. No any good memories for the management of AMD. In overall: The Environment inside of AMD is Very Toxic.

>>...AMD's software stack, including ROCm, has massively degraded AMD's performance...

Worked with ROCm a lot and I would rate ROCm as A-Piece-of-Over-Complecated-Software-Crap.

>>...MI300X was not realized due to a lack within AMD public release software stack and the lack of testing from AMD...

Not true based on my experience however it is possible things have changed after my contract was over.

>>...AMD's software experience is riddled with bugs rendering out of the box training with AMD is impossible...

Partially true since I was able to see how a lot of bugs were Not fixed at all.

>>...We were hopeful that AMD could emerge as a strong competitor to NVIDIA

Not possible due to internal problems with retaining very experienced C/C++ software engineers.

>>...AMD's weaker-than-expected software Quality Assurance (QA) culture and its challenging out-of-the-box experience...

Very surprised to read about it since QA was Very Strong when I was working for AMD. It is possible things have changed after my contract was over.

>>...AMD's internal teams have little access to GPU boxes to develop and refine the ROCm software stack...

Absolutely surprised to read about it. Once again, it is possible things have changed...

remixedcat · Dec 23, 2024

AleksandarK said:
AMD's internal teams have little access to GPU boxes to develop and refine the ROCm software stack.

Stingy af!! And this is why they can't get ahead

MuhammedAbdo · Dec 23, 2024

Patriot said:
I think there is a reason semi-analysis focused on training. As AMD has focused on inference performance. Meta trains on h100/h200 and runs the models (inference) exclusively on mi300x.
This both backs up the analysis as well as shows distortion by not giving the full picture.
AMD needs to work on software to gain competitiveness on training, and there may be architectural limitations that cap its overall training performance (xGMI interconnect arch)
They definitely need better regression testing and testing in general. They have acquired several Ai software companies this year that may help with this.

Part 2 of the article is going to focus on Inference, the story is not that different there though. Companies still prefer NVIDIA for Inference for a reason, and no, Meta is not running inference exclusively on MI300.

Patriot said:
So the current reality is...
If you are using off the shelf models mi300x excels, if you finetune those models, AMD excels, If you train from scratch... AMD kinda sucks.

Wrong, the article used off the shelf models, and the MI300x sucked hard.

Patriot · Dec 24, 2024

MuhammedAbdo said:
Part 2 of the article is going to focus on Inference, the story is not that different there though. Companies still prefer NVIDIA for Inference for a reason, and no, Meta is not running inference exclusively on MI300.

Wrong, the article used off the shelf models, and the MI300x sucked hard.

Lol, I can't fault you for not reading my comment when you are replying to part 2 that hasn't even been published yet.
No no, they also suck at the thing they didn't cover...

They used off the shelf containers, to train.... not pretrained models to infer.
There is a reason Meta uses H100/200 to train and uses mi300x to infer.

Cheeseball · Dec 24, 2024

AnarchoPrimitiv said:
This insinuated that AMD is PURELY limited by will power, is this what mostly everyone believes here? That AMD has access to equal resources that Nvidia does and the only thing limiting AMD is simply "not wanting to do better"? I'm seriously asking....

We all agree Lisa Su is competent, correct? Do any of us actually believe that people are telling her: "We need to do better with our software" and she's like "Ahhh, screw it"?

So what is it then? I imagine it's difficult for them to get ahold and maintain talent, Nvidia and Intel can afford to pay them more, and both competitors have far larger R&D budgets, is that the problem? Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case... If anyone has some educated and informed guesses, I'd love to hear them, because it surely cannot be that AMD is just being "stupid" or something.....but there definitely is a problem or problems

It is most likely resources and the fact that NVIDIA has a 10/11 year head start with building their API.

From the article (I'm still reading it):

This contrasts with Nvidia’s NCCL team, which has access to R&D resources on Nvidia’s 11,000 H100 internal EOS cluster. Furthermore, Nvidia has Sylvain Jeaugey, who is the subject matter expert on collective communication. There are a lot of other world class collective experts working at Nvidia as well, and, unfortunately, AMD has largely failed to attract collective library talent due to less attractive compensation and resources – as opposed to engineers at Nvidia, where it is not uncommon to see engineers make greater than a million dollars per year thanks to appreciation in the value of RSUs.

Patriot said:
Lol, I can't fault you for not reading my comment when you are replying to part 2 that hasn't even been published yet.
No no, they also suck at the thing they didn't cover...

They used off the shelf containers, to train.... not pretrained models to infer.
There is a reason Meta uses H100/200 to train and uses mi300x to infer.

This is true:

Another core reason for this problem is that the lead maintainer of PyTorch (Meta) does not currently use MI300X internally for production LLM training, leading to code paths not used internally at Meta being buggy and not dogfooded properly. We believe AMD should partner with Meta to get their internal LLM training working on MI300X.

MuhammedAbdo · Dec 24, 2024

Patriot said:
No no, they also suck at the thing they didn't cover...

Cloud customers are sticking with NVIDIA GPUs for inferencing as they like the CUDA stack, so Microsoft is struggling to keep up with demand (Tegus).

x.com

Dr. Dro · Dec 24, 2024

I am not really surprised, this has long been a problem.

AMD Vega Frontier Edition VBIOS

8 GB HBM2, 1500 MHz GPU, 945 MHz Memory

www.techpowerup.com

Vega10 A1 XT AIR D05011 8GB 852e/945m 1.0V SWQA

This was intended for debugging the Vega FE without having access to the hardware itself by converting the regular gaming RX Vega 64 into one. It was leaked to TPU by a (likely disgruntled) AMD employee in back in March of 2020. To be fair, it becomes functionally identical, other than the halved VRAM capacity, but I still think it is absolutely mind blowing that even their driver developers do not have crates of cards at their disposal, so they can test every wild variant out there. Something no doubt both Intel and NVIDIA provided their software engineers with.

Solaris17 · Dec 24, 2024

ScaLibBDP said:
Here are my comments as a C/C++ Software Engineer who worked for AMD.

>>...Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case...

I worked for AMD as a contractor. I have very-very good memories for just a couple of fellow developers. No any good memories for the management of AMD. In overall: The Environment inside of AMD is Very Toxic.

>>...AMD's software stack, including ROCm, has massively degraded AMD's performance...

Worked with ROCm a lot and I would rate ROCm as A-Piece-of-Over-Complecated-Software-Crap.

>>...MI300X was not realized due to a lack within AMD public release software stack and the lack of testing from AMD...

Not true based on my experience however it is possible things have changed after my contract was over.

>>...AMD's software experience is riddled with bugs rendering out of the box training with AMD is impossible...

Partially true since I was able to see how a lot of bugs were Not fixed at all.

>>...We were hopeful that AMD could emerge as a strong competitor to NVIDIA

Not possible due to internal problems with retaining very experienced C/C++ software engineers.

>>...AMD's weaker-than-expected software Quality Assurance (QA) culture and its challenging out-of-the-box experience...

Very surprised to read about it since QA was Very Strong when I was working for AMD. It is possible things have changed after my contract was over.

>>...AMD's internal teams have little access to GPU boxes to develop and refine the ROCm software stack...

Absolutely surprised to read about it. Once again, it is possible things have changed...

We work with ROCm a lot. Not in the "devops haha docker" way; either. We help write some of the patches with them. The performance is fantastic, but the real metal workload ecosystem is terrible. Nvidia usually plays hard ball, everything is a "No" with them. AMD is more flexible but there tooling is basically writing equations in the sand with sticks.

tommo1982 · Dec 24, 2024

ScaLibBDP said:
>>...Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case...

I worked for AMD as a contractor. I have very-very good memories for just a couple of fellow developers. No any good memories for the management of AMD. In overall: The Environment inside of AMD is Very Toxic

Not surprising. Recent AMD behaviour doesn't look sound. Also, considering it is a large corporation focused primarily on making money, with middle management no concerned with anything but their career (I've seen this firsthand in another corp), I daubt AMD will ever catch up to Nvidia.

There's plenty decent people working hard, just not in the right places.

Visible Noise · Dec 24, 2024

Patriot said:
I think there is a reason semi-analysis focused on training. As AMD has focused on inference performance. Meta trains on h100/h200 and runs the models (inference) exclusively on mi300x.
This both backs up the analysis as well as shows distortion by not giving the full picture.
AMD needs to work on software to gain competitiveness on training, and there may be architectural limitations that cap its overall training performance (xGMI interconnect arch)
They definitely need better regression testing and testing in general. They have acquired several Ai software companies this year that may help with this.

So the current reality is...
If you are using off the shelf models mi300x excels, if you finetune those models, AMD excels, If you train from scratch... AMD kinda sucks.
The analysis also fails to grasp the reality of availability... sometimes it’s better to have not as good than nothing.

Read the linked article. Mi300 can’t even run OOB models.

AnarchoPrimitiv said:
This insinuated that AMD is PURELY limited by will power, is this what mostly everyone believes here? That AMD has access to equal resources that Nvidia does and the only thing limiting AMD is simply "not wanting to do better"? I'm seriously asking....

We all agree Lisa Su is competent, correct? Do any of us actually believe that people are telling her: "We need to do better with our software" and she's like "Ahhh, screw it"?

So what is it then? I imagine it's difficult for them to get ahold and maintain talent, Nvidia and Intel can afford to pay them more, and both competitors have far larger R&D budgets, is that the problem? Is it a workplace "culture" problem? It'd be amazing to hear from someone who has worked there to see if that's the case... If anyone has some educated and informed guesses, I'd love to hear them, because it surely cannot be that AMD is just being "stupid" or something.....but there definitely is a problem or problems

If only TPU linked to sources that answered these questions, right?

Patriot · Dec 24, 2024

Visible Noise said:
Read the linked article. Mi300 can’t even run OOB models.

Negative. The article focused only on training models using predefined containers not pretrained models for inference.
It is super super easy to run models on mi100/250x/300x 7900gre/xt/xtx

I read as far as the paywall goes.
I also... have used ROCm since vega64/mi25
I also... have used Cuda since K80/GTX690

I currently run a hive of mi100s, and a sxm v100 box.

When it comes to inference, MI300x gets day0 support. Training is very lacking and Nvidia's deep bench of software engineers shows.
I expect part 2 of the article to be a bit different.

I am fully aware of the lacking's of AMDs ecosystem, but I am also aware of its strengths.
And the ability to just grab containers and go exists... hugging face is full of native containers for ROCm, Hipify can convert most* things that are cuda native, abet at performance penalty.
But when it comes to inference AMD is not a 2nd class citizen. It has full support with triton, and flash attention...
And Llama 405b fp16 launched exclusively on mi300x, most likely due to the ram requirements.
As it was quantized down to fp8, then it could fit on 8x h100 80gb, but as it was announced, Meta and AMD announced together that all Meta 405b live instances were run on mi300x.
If that is still true or was just a limited exclusivity while it was quantized down... idk...

But claiming things like a mi300x cant run OOB models is just... ignorant af, and not even what the article claims.
It claims bad training performance and strange bugs and as a user of the ecosystem... yup. AMD has strange bugs.
They have known lockups for multi gpu instances... and the solution is to run additional grub parameters, perfectly stable with iommu=pt, randomly hangs without.
But all this information is in the tuning guides. The install process is easy, and hugging face is full of models to run.

System Name	3 desktop systems: Gaming / Internet / HTPC
Processor	Ryzen 5 7600 / Ryzen 5 4600G / Ryzen 5 5500
Motherboard	X670E Gaming Plus WiFi / MSI X470 Gaming Plus Max (1) / MSI X470 Gaming Plus Max (2)
Cooling	Aigo ICE 400SE / Segotep T4 / Νoctua U12S
Memory	Kingston FURY Beast 32GB DDR5 6000 / 16GB JUHOR / 32GB G.Skill RIPJAWS 3600 + Aegis 3200
Video Card(s)	ASRock RX 6600 / Vega 7 integrated / Radeon RX 580
Storage	NVMes, ONLY NVMes / NVMes, SATA Storage / NVMe, SATA, external storage
Display(s)	Philips 43PUS8857/12 UHD TV (120Hz, HDR, FreeSync Premium) / 19'' HP monitor + BlitzWolf BW-V5
Case	Sharkoon Rebel 12 / CoolerMaster Elite 361 / Xigmatek Midguard
Audio Device(s)	onboard
Power Supply	Chieftec 850W / Silver Power 400W / Sharkoon 650W
Mouse	CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Keyboard	CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Software	Windows 10 / Windows 10&Windows 11 / Windows 10

System Name	stress-less
Processor	9800X3D @ 5.42GHZ
Motherboard	MSI PRO B650M-A Wifi
Cooling	Thermalright Phantom Spirit EVO
Memory	64GB DDR5 6000 1:1 CL30-36-36-96 FCLK 2000
Video Card(s)	RTX 4090 FE
Storage	2TB WD SN850, 4TB WD SN850X
Display(s)	Alienware 32" 4k 240hz OLED
Case	Jonsbo Z20
Audio Device(s)	Yes
Power Supply	RIP Corsair SF750... Waiting for SF1000
Mouse	DeathadderV2 X Hyperspeed
Keyboard	65% HE Keyboard
Software	Windows 11
Benchmark Scores	They're pretty good, nothing crazy.

System Name	GameStation
Processor	AMD R5 5600X
Motherboard	Gigabyte B550
Cooling	Artic Freezer II 120
Memory	16 GB
Video Card(s)	Sapphire Pulse 7900 XTX
Storage	2 TB SSD
Case	Cooler Master Elite 120

Processor	Intel i5-12600k
Motherboard	Asus H670 TUF
Cooling	Arctic Freezer 34
Memory	2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s)	EVGA GTX 1060 SC
Storage	500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s)	Dell U3219Q + HP ZR24w
Case	Raijintek Thetis
Audio Device(s)	Audioquest Dragonfly Red :D
Power Supply	Seasonic 620W M12
Mouse	Logitech G502 Proteus Core
Keyboard	G.Skill KM780R
Software	Arch Linux + Win10

System Name	Pioneer
Processor	Ryzen 9 9950X
Motherboard	GIGABYTE Aorus Elite X670 AX
Cooling	Noctua NH-D15 + A whole lotta Sunon, Phanteks and Corsair Maglev blower fans...
Memory	64GB (2x 32GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s)	XFX RX 7900 XTX Speedster Merc 310
Storage	Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s)	55" LG 55" B9 OLED 4K Display
Case	Thermaltake Core X31
Audio Device(s)	TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply	FSP Hydro Ti Pro 850W
Mouse	Logitech G305 Lightspeed Wireless
Keyboard	WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software	Gentoo Linux x64 / Windows 11 Enterprise IoT 2024

AMD's Pain Point is ROCm Software, NVIDIA's CUDA Software is Still Superior for AI Development: Report

AleksandarK

News Editor

Daven

dj-electric

Timbaloo

john_

phanbuey

hsew

TPUnique

Neo_Morpheus

AMD MI300X vs. Nvidia H100 SXM: Performance Comparison on Mixtral 8x7B Inference

bug

ymdhis

R-T-B

Patriot

AnarchoPrimitiv

ScaLibBDP

remixedcat

MuhammedAbdo

Patriot

Cheeseball

Not a Potato

MuhammedAbdo

x.com

Dr. Dro

AMD Vega Frontier Edition VBIOS

Solaris17

Super Dainty Moderator

tommo1982

Visible Noise

Patriot

System Name	[H]arbringer
Processor	4x 61XX ES @3.5Ghz (48cores)
Motherboard	SM GL
Cooling	3x xspc rx360, rx240, 4x DT G34 snipers, D5 pump.
Memory	16x gskill DDR3 1600 cas6 2gb
Video Card(s)	blah bigadv folder no gfx needed
Storage	32GB Sammy SSD
Display(s)	headless
Case	Xigmatek Elysium (whats left of it)
Audio Device(s)	yawn
Power Supply	Antec 1200w HCP
Software	Ubuntu 10.10
Benchmark Scores	http://valid.canardpc.com/show_oc.php?id=1780855 http://www.hwbot.org/submission/2158678 http://ww

System Name	Lightbringer
Processor	Ryzen 7 2700X
Motherboard	Asus ROG Strix X470-F Gaming
Cooling	Enermax Liqmax Iii 360mm AIO
Memory	G.Skill Trident Z RGB 32GB (8GBx4) 3200Mhz CL 14
Video Card(s)	Sapphire RX 5700XT Nitro+
Storage	Hp EX950 2TB NVMe M.2, HP EX950 1TB NVMe M.2, Samsung 860 EVO 2TB
Display(s)	LG 34BK95U-W 34" 5120 x 2160
Case	Lian Li PC-O11 Dynamic (White)
Power Supply	BeQuiet Straight Power 11 850w Gold Rated PSU
Mouse	Glorious Model O (Matte White)
Keyboard	Royal Kludge RK71
Software	Windows 10

System Name	RemixedBeast-NX
Processor	Intel Xeon E5-2690 @ 2.9Ghz (8C/16T)
Motherboard	Dell Inc. 08HPGT (CPU 1)
Cooling	Dell Standard
Memory	24GB ECC
Video Card(s)	Gigabyte Nvidia RTX2060 6GB
Storage	2TB Samsung 860 EVO SSD//2TB WD Black HDD
Display(s)	Samsung SyncMaster P2350 23in @ 1920x1080 + Dell E2013H 20 in @1600x900
Case	Dell Precision T3600 Chassis
Audio Device(s)	Beyerdynamic DT770 Pro 80 // Fiio E7 Amp/DAC
Power Supply	630w Dell T3600 PSU
Mouse	Logitech G700s/G502
Keyboard	Logitech K740
VR HMD	Linktr.ee/remixedcat // for my music ♡♡
Software	Linux Mint 20
Benchmark Scores	Network: APs: Ubiquiti Unifi AP-AC-LR and Lite Router/Sw:Meraki MX64 MS220-8P

System Name	Titan
Processor	AMD Ryzen™ 7 7950X3D
Motherboard	ASRock X870 Taichi Lite
Cooling	Thermalright Phantom Spirit 120 EVO CPU
Memory	TEAMGROUP T-Force Delta RGB 2x16GB DDR5-6000 CL30
Video Card(s)	ASRock Radeon RX 7900 XTX 24 GB GDDR6 (MBA)
Storage	Crucial T500 2TB x 3
Display(s)	LG 32GS95UE-B, ASUS ROG Swift OLED (PG27AQDP), LG C4 42" (OLED42C4PUA)
Case	Cooler Master QUBE 500 Flatpack Macaron
Audio Device(s)	Kanto Audio YU2 and SUB8 Desktop Speakers and Subwoofer, Cloud Alpha Wireless
Power Supply	Corsair SF1000
Mouse	Logitech Pro Superlight 2 (White), G303 Shroud Edition
Keyboard	Keychron K2 HE Wireless / 8BitDo Retro Mechanical Keyboard (N Edition) / NuPhy Air75 v2
VR HMD	Meta Quest 3 512GB
Software	Windows 11 Pro 64-bit 24H2 Build 26100.2605

System Name	"Icy Resurrection"
Processor	13th Gen Intel Core i9-13900KS
Motherboard	ASUS ROG Maximus Z790 Apex Encore
Cooling	Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory	32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s)	NVIDIA RTX A2000
Storage	500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s)	55-inch LG G3 OLED
Case	Pichau Mancer CV500 White Edition
Power Supply	EVGA 1300 G2 1.3kW 80+ Gold
Mouse	Microsoft Classic IntelliMouse (2017)
Keyboard	IBM Model M type 1391405
Software	Windows 10 Pro 22H2
Benchmark Scores	I pulled a Qiqi~

System Name	RogueOne
Processor	Xeon W9-3495x
Motherboard	ASUS w790E Sage SE
Cooling	SilverStone XE360-4677
Memory	128gb Gskill Zeta R5 DDR5 RDIMMs
Video Card(s)	MSI SUPRIM Liquid X 4090
Storage	1x 2TB WD SN850X \| 2x 8TB GAMMIX S70
Display(s)	49" Philips Evnia OLED (49M2C8900)
Case	Thermaltake Core P3 Pro Snow
Audio Device(s)	Moondrop S8's on schitt Gunnr
Power Supply	Seasonic Prime TX-1600
Mouse	Razer Viper mini signature edition (mercury white)
Keyboard	Monsgeek M3 Lavender, Moondrop Luna lights
VR HMD	Quest 3
Software	Windows 11 Pro Workstation
Benchmark Scores	I dont have time for that.

Processor	Ryzen 5 5600
Motherboard	ASRock B450M Steel Legend
Cooling	bequiet! Pure Rock Slim (BK008)
Memory	16GB DDR4 GoodRAM
Video Card(s)	ASUS Expedition RX570 4GB
Storage	WD Blue 500GB SSD
Display(s)	iiyama ProLite T2252MTS
Case	CoolerMaster Silencio 352
Power Supply	bequiet! Pure Power 12M 650W
Mouse	Logitech M590
Keyboard	Logitech K270
Software	Linux Mint

System Name	XPS, Lenovo and HP Laptops, HP Xeon Mobile Workstation, HP Servers, Dell Desktops
Processor	Everything from Turion to 13900kf
Motherboard	MSI - they own the OEM market
Cooling	Air on laptops, lots of air on servers, AIO on desktops
Memory	I think one of the laptops is 2GB, to 64GB on gamer, to 128GB on ZFS Filer
Video Card(s)	A pile up to my knee, with a RTX 4090 teetering on top
Storage	Rust in the closet, solid state everywhere else
Display(s)	Laptop crap, LG UltraGear of various vintages
Case	OEM and a 42U rack
Audio Device(s)	Headphones
Power Supply	Whole home UPS w/Generac Standby Generator
Software	ZFS, UniFi Network Application, Entra, AWS IoT Core, Splunk
Benchmark Scores	1.21 GigaBungholioMarks