• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

New AMD ROCm 6.1 Software for Radeon Release Offers More Choices to AI Developers

GFreeman

News Editor
Staff member
Joined
Mar 6, 2023
Messages
1,530 (2.43/day)
AMD has unveiled the latest release of its open software, AMD ROCm 6.1.3, marking the next step in its strategy to make ROCm software broadly available across its GPU portfolio, including AMD Radeon desktop GPUs. The new release gives developers broader support for Radeon GPUs to run ROCm AI workloads. "The new AMD ROCm release extends functional parity from data center to desktops, enabling AI research and development on readily available and accessible platforms," said Andrej Zdravkovic, senior vice president at AMD.

Key feature enhancements in this release focus on improving compatibility, accessibility, and scalability, and include:
  • Multi-GPU support to enable building scalable AI desktops for multi-serving, multi-user solutions.
  • Beta-level support for Windows Subsystem for Linux, allowing these solutions to work with ROCm on a Windows OS-based system.
  • TensorFlow Framework support offering more choice for AI development.



ROCm 6.1.3 now supports up to four qualified Radeon RX Series or Radeon PRO GPUs, allowing users to leverage configurations with data parallelism where each GPU independently computes inference and outputs the response. This enables client-based multi-user configurations powered by AMD ROCm software and Radeon GPUs.

With ROCm 6.1.3, we are making it even easier to develop for AI with Beta-level support for Windows Subsystem for Linux, also known as WSL 2. This means you can now run Linux-based AI tools on a Windows system. WSL 2 eliminates the need for a dedicated Linux system or a dual-boot setup.

Furthermore, we are announcing qualification of the TensorFlow framework, providing another choice for AI development aside from PyTorch and ONNX which were already supported, extending our robust open ecosystem of frameworks and libraries.

In addition, we are extending our client-based AI development offering with the introduction of the AMD Radeon PRO W7900 Dual Slot card which packs 192 AI accelerators and 48 GB of fast GDDR6 memory into a compact form factor for higher, system-level density.

AMD AI workstations equipped with a Radeon PRO W7900 GPU represent a new solution to fine-tune and run inference on large language models (LLMs) with high precision. For example, LLaMA-2 or LLaMA-3 with 70B parameters quantized at INT4 require at least 35 GB of local GPU memory, making the Radeon PRO W7900 GPU with 48 GB of fast on-board memory the right choice for workflows that make use of these models.



Generative AI for natural language processing (NLP) using such LLMs can help enterprises tailor interaction with customers, assist with development operations (DevOps), and improve the process of managing data and documents.

AMD is committed to building a highly scalable and open ecosystem with ROCm software. With the latest release of ROCm 6.1 software for Radeon desktop GPUs, AMD empowers system builders to take full advantage of our enhanced solution stack to create on-prem systems that add powerful AI performance to any IT infrastructure, making them ideal for mission-critical or low-latency projects - and allowing organizations to keep their sensitive data in-house.

With AMD ROCm 6.1 and the new AMD Radeon PRO W7900 Dual Slot card, which is now shipping, AI on desktops opens new opportunities for developers and enterprises to increase productivity, creativity, and innovation with performance and ease of use.

View at TechPowerUp Main Site | Source
 
Joined
Nov 3, 2014
Messages
264 (0.07/day)
They're just now beginning to add multi-GPU support? Halfway through 2024?

Am I crazy pilling takes?

So their datacenter/enterprise customers have been leaving what, 95% of the hardware sitting unused?
 
Joined
Apr 13, 2022
Messages
1,174 (1.23/day)
They're just now beginning to add multi-GPU support? Halfway through 2024?

Am I crazy pilling takes?

So their datacenter/enterprise customers have been leaving what, 95% of the hardware sitting unused?

This is for the workstation class cards not those you put in server racks.

Consumer < prosumer < workstation/pro < datacenter/enterprise

They are bringing it to pro. It was already in datacenter.
 
Joined
Jul 15, 2006
Messages
1,284 (0.19/day)
Location
Noir York
Processor AMD Ryzen 7 5700G
Motherboard ASUS A520M-K
Cooling Scythe Kotetsu Mark II
Memory 2 x 16GB SK Hynix CJR OEM DDR4-3200 @ 4000 20-22-20-48
Video Card(s) Colorful RTX 2060 SUPER 8GB GDDR6
Storage 250GB WD BLACK SN750 M.2 + 4TB WD Red Plus + 4TB WD Purple
Display(s) AOpen 27HC5R 27" 1080p 165Hz curved VA
Case AIGO Darkflash C285
Audio Device(s) Creative SoundBlaster Z + Kurtzweil KS-40A bookshelf / Sennheiser HD555
Power Supply Great Wall GW-EPS1000DA 1kW
Mouse Razer Deathadder Essential
Keyboard Cougar Attack2 Cherry MX Black
Software Windows 10 Pro x64 22H2
Anyone here use ROCm for stable diffusion? I'm thinking of buying Radeon cards but apart from Ray Tracing this is the other reason I don't wanna change to red team.
 
Joined
Oct 2, 2015
Messages
3,132 (0.94/day)
Location
Argentina
System Name Ciel / Akane
Processor AMD Ryzen R5 5600X / Intel Core i3 12100F
Motherboard Asus Tuf Gaming B550 Plus / Biostar H610MHP
Cooling ID-Cooling 224-XT Basic / Stock
Memory 2x 16GB Kingston Fury 3600MHz / 2x 8GB Patriot 3200MHz
Video Card(s) Gainward Ghost RTX 3060 Ti / Dell GTX 1660 SUPER
Storage NVMe Kingston KC3000 2TB + NVMe Toshiba KBG40ZNT256G + HDD WD 4TB / NVMe WD Blue SN550 512GB
Display(s) AOC Q27G3XMN / Samsung S22F350
Case Cougar MX410 Mesh-G / Generic
Audio Device(s) Kingston HyperX Cloud Stinger Core 7.1 Wireless PC
Power Supply Aerocool KCAS-500W / Gigabyte P450B
Mouse EVGA X15 / Logitech G203
Keyboard VSG Alnilam / Dell
Software Windows 11
Anyone here use ROCm for stable diffusion? I'm thinking of buying Radeon cards but apart from Ray Tracing this is the other reason I don't wanna change to red team.
All I heard from those that dared to try it is that it isn't great or stable enough yet.
 
Joined
Jul 13, 2016
Messages
3,279 (1.07/day)
Processor Ryzen 7800X3D
Motherboard ASRock X670E Taichi
Cooling Noctua NH-D15 Chromax
Memory 32GB DDR5 6000 CL30
Video Card(s) MSI RTX 4090 Trio
Storage Too much
Display(s) Acer Predator XB3 27" 240 Hz
Case Thermaltake Core X9
Audio Device(s) Topping DX5, DCA Aeon II
Power Supply Seasonic Prime Titanium 850w
Mouse G305
Keyboard Wooting HE60
VR HMD Valve Index
Software Win 10
All I heard from those that dared to try it is that it isn't great or stable enough yet.

Stability isn't an issue. Both TPU and Level1Techs have articles regarding using AMD with stable diffusion and they don't mention stability issues.

The issue is performance. Stable Diffusion on Linux is about twice as fast on AMD cards as compared to Windows and really the AMD cards need that boost to be competitive price / performance wise. That might change now that ROCm supports WSL. There's more hoops you have to jump through to configure SD for AMD cards on windows as well. It's getting better but there's still a lot of lot to be done to optimize performance and config.
 
Joined
Oct 2, 2015
Messages
3,132 (0.94/day)
Location
Argentina
System Name Ciel / Akane
Processor AMD Ryzen R5 5600X / Intel Core i3 12100F
Motherboard Asus Tuf Gaming B550 Plus / Biostar H610MHP
Cooling ID-Cooling 224-XT Basic / Stock
Memory 2x 16GB Kingston Fury 3600MHz / 2x 8GB Patriot 3200MHz
Video Card(s) Gainward Ghost RTX 3060 Ti / Dell GTX 1660 SUPER
Storage NVMe Kingston KC3000 2TB + NVMe Toshiba KBG40ZNT256G + HDD WD 4TB / NVMe WD Blue SN550 512GB
Display(s) AOC Q27G3XMN / Samsung S22F350
Case Cougar MX410 Mesh-G / Generic
Audio Device(s) Kingston HyperX Cloud Stinger Core 7.1 Wireless PC
Power Supply Aerocool KCAS-500W / Gigabyte P450B
Mouse EVGA X15 / Logitech G203
Keyboard VSG Alnilam / Dell
Software Windows 11
Stability isn't an issue. Both TPU and Level1Techs have articles regarding using AMD with stable diffusion and they don't mention stability issues.

The issue is performance. Stable Diffusion on Linux is about twice as fast on AMD cards as compared to Windows and really the AMD cards need that boost to be competitive price / performance wise. That might change now that ROCm supports WSL. There's more hoops you have to jump through to configure SD for AMD cards on windows as well. It's getting better but there's still a lot of lot to be done to optimize performance and config.
Oh, I mean stability in the software sense. If you don't use Ubuntu, and even if you do, ROCm sometimes just refuses to install/work. Plus, there's the issue of GPU compatibility, very few GPUs are compatible right now.
 

JLP

Joined
Jun 19, 2024
Messages
11 (0.07/day)
Location
Slovenia
System Name Tsiolkovsky
Processor AMD Ryzen 5 7600
Motherboard ASUS ROG STRIX B650E-F GAMING WIFI
Cooling AMD Wraith Stealth
Memory 32 GB G.Skill Trident Z5 NEO DDR5-6000 CL30
Video Card(s) PowerColor Hellhound Radeon RX 7900 GRE OC
Storage Samsung 980 Pro 2TB SSD, WD Black 4TB SATA 128MB 7200RPM
Case Corsair Carbide 330R Blackout Edition
Power Supply Seasonic Prime 80 Plus Titanium 850W
Mouse Logitech MX Vertical Advanced Ergonomic
Keyboard Das Keyboard 6 Pro
Software openSUSE Tumbleweed GNU/Linux with KDE Plasma
All I am waiting for is more official support for broader range of consumer GPUs and much better support for more recent and popular GNU/Linux distributions. Why oh why don't they just use the awesome Open Build Service and make it much easier to get this installed on more distros.
 
Joined
Oct 27, 2009
Messages
1,182 (0.21/day)
Location
Republic of Texas
System Name [H]arbringer
Processor 4x 61XX ES @3.5Ghz (48cores)
Motherboard SM GL
Cooling 3x xspc rx360, rx240, 4x DT G34 snipers, D5 pump.
Memory 16x gskill DDR3 1600 cas6 2gb
Video Card(s) blah bigadv folder no gfx needed
Storage 32GB Sammy SSD
Display(s) headless
Case Xigmatek Elysium (whats left of it)
Audio Device(s) yawn
Power Supply Antec 1200w HCP
Software Ubuntu 10.10
Benchmark Scores http://valid.canardpc.com/show_oc.php?id=1780855 http://www.hwbot.org/submission/2158678 http://ww
All I am waiting for is more official support for broader range of consumer GPUs and much better support for more recent and popular GNU/Linux distributions. Why oh why don't they just use the awesome Open Build Service and make it much easier to get this installed on more distros.
because... containers...
btw, there are lots a few distros picking up support for it that aren't on AMDs official list.
 
Last edited:
Joined
Nov 3, 2014
Messages
264 (0.07/day)
Stability isn't an issue. Both TPU and Level1Techs have articles regarding using AMD with stable diffusion and they don't mention stability issues.
I would argue their articles/videos are not truly representative of real world use.

If you keep to the base package and never touch extensions, sure, it works. As in, technically something gets generated.

However, a lot of SD workflows rely on extensions, which vary from "instantly bricks your entire install" to "doesn't work but doesn't affect anything else" to "runs but 20-200x slower than on an equivalent nvidia GPU"... And they tend to have very little documentation/discussion around AMD compatibility, so you might not find out until something gets bricked.

As of now, fiddling around with unofficial ZLUDA is your best bet, and that tends to work better on the high end 7000 series.
 
Joined
Jul 13, 2016
Messages
3,279 (1.07/day)
Processor Ryzen 7800X3D
Motherboard ASRock X670E Taichi
Cooling Noctua NH-D15 Chromax
Memory 32GB DDR5 6000 CL30
Video Card(s) MSI RTX 4090 Trio
Storage Too much
Display(s) Acer Predator XB3 27" 240 Hz
Case Thermaltake Core X9
Audio Device(s) Topping DX5, DCA Aeon II
Power Supply Seasonic Prime Titanium 850w
Mouse G305
Keyboard Wooting HE60
VR HMD Valve Index
Software Win 10
I would argue their articles/videos are not truly representative of real world use.

If you keep to the base package and never touch extensions, sure, it works. As in, technically something gets generated.

However, a lot of SD workflows rely on extensions, which vary from "instantly bricks your entire install" to "doesn't work but doesn't affect anything else" to "runs but 20-200x slower than on an equivalent nvidia GPU"... And they tend to have very little documentation/discussion around AMD compatibility, so you might not find out until something gets bricked.

As of now, fiddling around with unofficial ZLUDA is your best bet, and that tends to work better on the high end 7000 series.

What exactly is your source for this? I have never seen this claim before and I don't see how it makes much sense.

If you install the IP adapter extension for A1111 (webui) and your GPU is running inference via DirectML, the IP adapter would be running on directML as well. Functionality and stability should be equivalent.

Drawing the line at extensions seems arbitrary given that many stable diffusion front ends will have a lot of features rolled into them. SD.Next for example supports AMD GPUs via DirectML and has a ton of features rolled into the base software. You could install zero extensions and that would more than cover the vast majority of generation workloads.
 
Last edited:
Joined
Nov 3, 2014
Messages
264 (0.07/day)
What exactly is your source for this? I have never seen this claim before and I don't see how it makes much sense.

If you install the IP adapter extension for A1111 (webui) and your GPU is running inference via DirectML, the IP adapter would be running on directML as well. Functionality and stability should be equivalent.

Drawing the line at extensions seems arbitrary given that many stable diffusion front ends will have a lot of features rolled into them. SD.Next for example supports AMD GPUs via DirectML and has a ton of features rolled into the base software. You could install zero extensions and that would more than cover the vast majority of generation workloads.
My source??? My own hundreds of hours of working with SD and LLMs on both AMD and Nvidia hardware.

There are a lot more extensions than IPadapter out there. Seems even more arbitrary for you to only consider that single thing.

For a long time inpainting produced buggy/artefacting outputs/obvious seams with inpainting, and that's not even an extension. Also for a long time Codeformers and GFPGan was incompatible with DirectML.
 
Top