• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Develops ROCm-based Solution to Run Unmodified NVIDIA's CUDA Binaries on AMD Graphics

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,657 (0.99/day)
AMD has quietly funded an effort over the past two years to enable binary compatibility for NVIDIA CUDA applications on their ROCm stack. This allows CUDA software to run on AMD Radeon GPUs without adapting the source code. The project responsible is ZLUDA, which was initially developed to provide CUDA support on Intel graphics. The developer behind ZLUDA, Andrzej Janik, was contracted by AMD in 2022 to adapt his project for use on Radeon GPUs with HIP/ROCm. He spent two years bringing functional CUDA support to AMD's platform, allowing many real-world CUDA workloads to run without modification. AMD decided not to productize this effort for unknown reasons but did open-source it once funding ended per their agreement. Over at Phoronix, there were several benchmarks testing AMD's ZLUDA implementation over a wide variety of benchmarks.

Benchmarks found that proprietary CUDA renderers and software worked on Radeon GPUs out-of-the-box with the drop-in ZLUDA library replacements. CUDA-optimized Blender 4.0 rendering now runs faster on AMD Radeon GPUs than the native ROCm/HIP port, reducing render times by around 10-20%, depending on the scene. The implementation is surprisingly robust, considering it was a single-developer project. However, there are some limitations—OptiX and PTX assembly codes still need to be fully supported. Overall, though, testing showed very promising results. Over the generic OpenCL runtimes in Geekbench, CUDA-optimized binaries produce up to 75% better results. With the ZLUDA libraries handling API translation, unmodified CUDA binaries can now run directly on top of ROCm and Radeon GPUs. Strangely, the ZLUDA port targets AMD ROCm 5.7, not the newest 6.x versions. Only time will tell if AMD continues investing in this approach to simplify porting of CUDA software. However, the open-sourced project now enables anyone to contribute and help improve compatibility. For a complete review, check out Phoronix tests.



View at TechPowerUp Main Site | Source
 
Joined
Sep 30, 2019
Messages
406 (0.21/day)
Processor 7950X3D
Motherboard Gigabyte X670E Aorus Pro X
Cooling NZXT Kraken ELite 280
Memory 64GB Kingston FURY Beast RGB 6000MT/s C30
Video Card(s) GeForce RTX™ 4080 16GB AERO OC
Storage Crucial T700 1TB SSD
Display(s) Samsung Odyssey G9 Neo
Case Lian-Li O11D Mini
Audio Device(s) Mackie CM3 studio monitors
Power Supply Asus Loki 1kW
Mouse Razer Deathadder Pro 3
Keyboard Ducky Zero 6108 (silver switches)
VR HMD Vive XR Elite
Software Kubuntu 24.10
Nvidia lawsuit arriving in 3...2...1...
 
Joined
Aug 12, 2010
Messages
137 (0.03/day)
Location
Brazil
Processor Ryzen 7 7800X3D
Motherboard ASRock B650M PG Riptide
Cooling Wraith Max + 2x Noctua Redux NF-P12
Memory 2x16GB ADATA XPG Lancer Blade DDR5-6000 CL30
Video Card(s) Powercolor RX 7800 XT Fighter OC
Storage ADATA Legend 970 2TB PCIe 5.0
Display(s) Dell 32" S3222DGM - 1440P 165Hz + P2422H
Case HYTE Y40
Audio Device(s) Microsoft Xbox TLL-00008
Power Supply Cooler Master MWE 750 V2
Mouse Alienware AW320M
Keyboard Alienware AW510K
Software Windows 11 Pro
Joined
Jan 19, 2023
Messages
247 (0.35/day)
It's written on github that AMD also abandoned the project. That's why he is making it open source as it was part of the contract.
One interesting tidbit for us here is:


  • What's the future of the project?
    With neither Intel nor AMD interested, we've run out of GPU companies. I'm open though to any offers of that could move the project forward.
    Realistically, it's now abandoned and will only possibly receive updates to run workloads I am personally interested in (DLSS)

So who knows, DLSS on AMD GPUs soon? :roll:
 
Joined
Dec 29, 2021
Messages
67 (0.06/day)
Location
Colorado
Processor Ryzen 7 7800X3D
Motherboard Asrock x670E Steel Legend
Cooling Arctic Liquid Freezr II 420mm
Memory 64GB G.Skill DDR5 CAS30 fruity LED RAM
Video Card(s) Nvidia RTX 4080 (Gigabyte)
Storage 2x Samsung 980 Pros, 3x spinning rust disks for ~20TB total storage
Display(s) 2x Asus 27" 1440p 165hz IPS monitors
Case Thermaltake Level 20XT E-ATX
Audio Device(s) Onboard
Power Supply Super Flower Leadex VII 1000w
Mouse Logitech g502
Keyboard Logitech g915
Software Windows 11 Insider Preview
Nvidia lawsuit arriving in 3...2...1...

Possibly. The individual is just a contractor so they'd go after AMD, Intel, and possibly Microsoft (Github) for hosting it. They're kind of asking for the FTC to get involved in an anti-trust action if they get too uppity.

It's written on github that AMD also abandoned the project. That's why he is making it open source as it was part of the contract.
One interesting tidbit for us here is:

So who knows, DLSS on AMD GPUs soon? :roll:
Remember that big Nvidia leak where the DLSS source got leaked? AMD RDNA2 GPUs could reportedly run DLSS at a hardware-level but nothing ever came of it. Nobody in their right mind would expose themselves to that kind of civil legal jeaopardy.

Nvidia's lawyers could easily construe the software to be based on stolen source code, even if it was a clean-room implementation like ZLUDA.
 
Joined
Jan 19, 2023
Messages
247 (0.35/day)
Possibly. The individual is just a contractor so they'd go after AMD, Intel, and possibly Microsoft (Github) for hosting it. They're kind of asking for the FTC to get involved in an anti-trust action if they get too uppity.


Remember that big Nvidia leak where the DLSS source got leaked? AMD RDNA2 GPUs could reportedly run DLSS at a hardware-level but nothing ever came of it. Nobody in their right mind would expose themselves to that kind of civil legal jeaopardy.

Nvidia's lawyers could easily construe the software to be based on stolen source code, even if it was a clean-room implementation like ZLUDA.
Yeah I'm not holding my breath. Considering that DLSS basically became their most important feature that is always mentioned when someone is asking whether to go AMD or nvidia if there suddenly was a way to run DLSS on AMD GPU, nvidia would sue everybody that probably even downloaded that :D
 
Joined
Apr 24, 2020
Messages
2,723 (1.60/day)
OptiX and PTX assembly codes still need to be fully supported.

Um... OptiX compiles down into NVidia-specific hardware. PTX is similarly NVidia-specific pseudo-assembly (PTX compiles into SASS / true NVidia assembly), so... I'm not sure if anything could ever truly support all PTX code.

But a subset of PTX (ex: add, multiply, and other common operations) probably could be built for AMD. But no one writing PTX would do the common + or * commands of CUDA/C++. The only reason anyone would dip down to PTX assembly is to take advantage of hardware-specific features and/or hardware-specific performance characteristics.

Neither would ever get done IMO. So the project as it stands probably has hit an end. CUDA/C++ cross-compiled binaries running on AMD/Intel sounds "good enough" to me... while later stages of this coding project sound like a lot of work without much gains...
 
Joined
Jul 15, 2020
Messages
1,021 (0.63/day)
System Name Dirt Sheep | Silent Sheep
Processor i5-2400 | 13900K (-0.02mV offset)
Motherboard Asus P8H67-M LE | Gigabyte AERO Z690-G, bios F29e Intel baseline
Cooling Scythe Katana Type 1 | Noctua NH-U12A chromax.black
Memory G-skill 2*8GB DDR3 | Corsair Vengeance 4*32GB DDR5 5200Mhz C40 @4000MHz
Video Card(s) Gigabyte 970GTX Mini | NV 1080TI FE (cap at 50%, 800mV)
Storage 2*SN850 1TB, 230S 4TB, 840EVO 128GB, WD green 2TB HDD, IronWolf 6TB, 2*HC550 18TB in RAID1
Display(s) LG 21` FHD W2261VP | Lenovo 27` 4K Qreator 27
Case Thermaltake V3 Black|Define 7 Solid, stock 3*14 fans+ 2*12 front&buttom+ out 1*8 (on expansion slot)
Audio Device(s) Beyerdynamic DT 990 (or the screen speakers when I'm too lazy)
Power Supply Enermax Pro82+ 525W | Corsair RM650x (2021)
Mouse Logitech Master 3
Keyboard Roccat Isku FX
VR HMD Nop.
Software WIN 10 | WIN 11
Benchmark Scores CB23 SC: i5-2400=641 | i9-13900k=2325-2281 MC: i5-2400=i9 13900k SC | i9-13900k=37240-35500
The end of NV monopol in the professional, CUDA world?
 
Joined
Dec 17, 2020
Messages
140 (0.10/day)
I think it would be to nvidia's advantage to have such a project allowing anyone to use cudu.

Once your using their software you are halfway to being a customer.

Camel nose -> tent.
 
Joined
Nov 28, 2012
Messages
427 (0.10/day)
Could be big!
I wonder if it would be possible to make some old physx games run on AMD, would be nice to have it for the Batman Arkham games.
 
Joined
Dec 25, 2020
Messages
7,021 (4.81/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard ASUS ROG Maximus Z790 Apex Encore
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Audio Device(s) Apple USB-C + Sony MDR-V7 headphones
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic Intellimouse
Keyboard IBM Model M type 1391405 (distribución española)
Software Windows 11 IoT Enterprise LTSC 24H2
Benchmark Scores I pulled a Qiqi~
I wouldn't be surprised, judging by the amount of nvidia lawyers I see in many posts' comment section.

So anyone that doesn't have a strictly negative view of "the super evil green monster" is not only a shill, but also a lawyer for Nvidia? I wish it was that easy.

I think it would be to nvidia's advantage to have such a project allowing anyone to use cudu.

Once your using their software you are halfway to being a customer.

Camel nose -> tent.

Everyone already uses CUDA regardless, AMD's market share has always been near irrelevant for the compute market and the absence of CUDA compatibility a massive negative blemish in the Radeon business. It's only been recently with extremely powerful CDNA hardware and software being developed for it that Instinct began making a dent.

The simple fact that even this translation layer outperforms AMD's native OpenCL stack speaks volumes of how ridiculously bad their compute system has been. This will need to be maintained in order to continue to keep up with newer versions of CUDA, and the funding for that will likely have to come from AMD, but it's a massive step in the right direction.
 
Joined
Sep 6, 2013
Messages
3,393 (0.82/day)
Location
Athens, Greece
System Name 3 desktop systems: Gaming / Internet / HTPC
Processor Ryzen 5 7600 / Ryzen 5 4600G / Ryzen 5 5500
Motherboard X670E Gaming Plus WiFi / MSI X470 Gaming Plus Max (1) / MSI X470 Gaming Plus Max (2)
Cooling Aigo ICE 400SE / Segotep T4 / Νoctua U12S
Memory Kingston FURY Beast 32GB DDR5 6000 / 16GB JUHOR / 32GB G.Skill RIPJAWS 3600 + Aegis 3200
Video Card(s) ASRock RX 6600 + GT 710 (PhysX) / Vega 7 integrated / Radeon RX 580
Storage NVMes, ONLY NVMes / NVMes, SATA Storage / NVMe, SATA, external storage
Display(s) Philips 43PUS8857/12 UHD TV (120Hz, HDR, FreeSync Premium) / 19'' HP monitor + BlitzWolf BW-V5
Case Sharkoon Rebel 12 / CoolerMaster Elite 361 / Xigmatek Midguard
Audio Device(s) onboard
Power Supply Chieftec 850W / Silver Power 400W / Sharkoon 650W
Mouse CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Keyboard CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Software Windows 10 / Windows 10&Windows 11 / Windows 10
AMD decided not to productize this effort for unknown reasons but did open-source it once funding ended per their agreement.
Probably AMD doesn't want to be seen as supporting Nvidia's CUDA officially. So they funded the project and then did what they usually do. Open source it.
To be fair, if AMD wants to see CUDA going away, the last thing they need to do is make CUDA run on their GPUs officially. If they support it, then they will be like supporting the only reason today Nvidia makes billions every quarter.
 
Joined
Nov 26, 2021
Messages
1,705 (1.52/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
So anyone that doesn't have a strictly negative view of "the super evil green monster" is not only a shill, but also a lawyer for Nvidia? I wish it was that easy.



Everyone already uses CUDA regardless, AMD's market share has always been near irrelevant for the compute market and the absence of CUDA compatibility a massive negative blemish in the Radeon business. It's only been recently with extremely powerful CDNA hardware and software being developed for it that Instinct began making a dent.

The simple fact that even this translation layer outperforms AMD's native OpenCL stack speaks volumes of how ridiculously bad their compute system has been. This will need to be maintained in order to continue to keep up with newer versions of CUDA, and the funding for that will likely have to come from AMD, but it's a massive step in the right direction.
There's a more likely explanation for the performance difference in many tests. Due to CUDA's dominance, more effort has been poured by developers into optimizing their applications for it.
 
Joined
Sep 6, 2013
Messages
3,393 (0.82/day)
Location
Athens, Greece
System Name 3 desktop systems: Gaming / Internet / HTPC
Processor Ryzen 5 7600 / Ryzen 5 4600G / Ryzen 5 5500
Motherboard X670E Gaming Plus WiFi / MSI X470 Gaming Plus Max (1) / MSI X470 Gaming Plus Max (2)
Cooling Aigo ICE 400SE / Segotep T4 / Νoctua U12S
Memory Kingston FURY Beast 32GB DDR5 6000 / 16GB JUHOR / 32GB G.Skill RIPJAWS 3600 + Aegis 3200
Video Card(s) ASRock RX 6600 + GT 710 (PhysX) / Vega 7 integrated / Radeon RX 580
Storage NVMes, ONLY NVMes / NVMes, SATA Storage / NVMe, SATA, external storage
Display(s) Philips 43PUS8857/12 UHD TV (120Hz, HDR, FreeSync Premium) / 19'' HP monitor + BlitzWolf BW-V5
Case Sharkoon Rebel 12 / CoolerMaster Elite 361 / Xigmatek Midguard
Audio Device(s) onboard
Power Supply Chieftec 850W / Silver Power 400W / Sharkoon 650W
Mouse CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Keyboard CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Software Windows 10 / Windows 10&Windows 11 / Windows 10
Could be big!
I wonder if it would be possible to make some old physx games run on AMD, would be nice to have it for the Batman Arkham games.
Nvidia's drivers are lock free for a number of years now. Buy a small Nvidia GPU, add it as a secondary GPU and CUDA/PhysX would be enabled and free to run on that GPU even if you have an AMD as primary. No patches needed anymore.
 
Joined
Dec 25, 2020
Messages
7,021 (4.81/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard ASUS ROG Maximus Z790 Apex Encore
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Audio Device(s) Apple USB-C + Sony MDR-V7 headphones
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic Intellimouse
Keyboard IBM Model M type 1391405 (distribución española)
Software Windows 11 IoT Enterprise LTSC 24H2
Benchmark Scores I pulled a Qiqi~
There's a more likely explanation for the performance difference in many tests. Due to CUDA's dominance, more effort has been poured by developers into optimizing their applications for it.

Indeed, but there is still a translation layer involved. It's not the first time we've seen translation layers outperform AMD's native solutions either, DXVK had widely been considered a way to fix performance on DX11 games until the new PAL abstraction layer UMD was developed. In some cases, still is.
 
Joined
Oct 6, 2021
Messages
1,605 (1.37/day)
I wouldn't be surprised, judging by the amount of nvidia lawyers I see in many posts' comment section.
Well, If I'm not mistaken, in the European Union there were discussions about CUDA being an anti-competitive monopoly tactic, and they should by law force Nvidia to open the ecosystem to competitors.

@topic, I find it strange as hell that this gimmick runs faster than AMD's own HIP, it's the same as saying that the company doesn't know its own hardware to create optimized code, in other words, they're underutilizing its capabilities. Incompetence? Lack of investment? What's it lol
 
Joined
Jun 27, 2011
Messages
6,772 (1.37/day)
Processor 7800x3d
Motherboard Gigabyte B650 Auros Elite AX
Cooling Custom Water
Memory GSKILL 2x16gb 6000mhz Cas 30 with custom timings
Video Card(s) MSI RX 6750 XT MECH 2X 12G OC
Storage Adata SX8200 1tb with Windows, Samsung 990 Pro 2tb with games
Display(s) HP Omen 27q QHD 165hz
Case ThermalTake P3
Power Supply SuperFlower Leadex Titanium
Software Windows 11 64 Bit
Benchmark Scores CB23: 1811 / 19424 CB24: 1136 / 7687
Possibly. The individual is just a contractor so they'd go after AMD, Intel, and possibly Microsoft (Github) for hosting it. They're kind of asking for the FTC to get involved in an anti-trust action if they get too uppity.
AMD hired the same contractor that Intel had hired to get Cuda running on Intel gpu's. If Nvidia was going to sue they would have done so already.
 
Last edited:
Joined
Aug 12, 2010
Messages
137 (0.03/day)
Location
Brazil
Processor Ryzen 7 7800X3D
Motherboard ASRock B650M PG Riptide
Cooling Wraith Max + 2x Noctua Redux NF-P12
Memory 2x16GB ADATA XPG Lancer Blade DDR5-6000 CL30
Video Card(s) Powercolor RX 7800 XT Fighter OC
Storage ADATA Legend 970 2TB PCIe 5.0
Display(s) Dell 32" S3222DGM - 1440P 165Hz + P2422H
Case HYTE Y40
Audio Device(s) Microsoft Xbox TLL-00008
Power Supply Cooler Master MWE 750 V2
Mouse Alienware AW320M
Keyboard Alienware AW510K
Software Windows 11 Pro
Well, If I'm not mistaken, in the European Union there were discussions about CUDA being an anti-competitive monopoly tactic, and they should by law force Nvidia to open the ecosystem to competitors.
Not gonna complain about the EU pushing for a more open-standard market. Look at what they did to apple, enforcing them to adopt USB-C, worldwide.
 

Count von Schwalbe

Nocturnus Moderatus
Staff member
Joined
Nov 15, 2021
Messages
3,179 (2.80/day)
Location
Knoxville, TN, USA
System Name Work Computer | Unfinished Computer
Processor Core i7-6700 | Ryzen 5 5600X
Motherboard Dell Q170 | Gigabyte Aorus Elite Wi-Fi
Cooling A fan? | Truly Custom Loop
Memory 4x4GB Crucial 2133 C17 | 4x8GB Corsair Vengeance RGB 3600 C26
Video Card(s) Dell Radeon R7 450 | RTX 2080 Ti FE
Storage Crucial BX500 2TB | TBD
Display(s) 3x LG QHD 32" GSM5B96 | TBD
Case Dell | Heavily Modified Phanteks P400
Power Supply Dell TFX Non-standard | EVGA BQ 650W
Mouse Monster No-Name $7 Gaming Mouse| TBD
@topic, I find it strange as hell that this gimmick runs faster than AMD's own HIP, it's the same as saying that the company doesn't know its own hardware to create optimized code, in other words, they're underutilizing its capabilities. Incompetence? Lack of investment? What's it lol
OpenCL is a serious afterthought in most applications. OpenCL/CUDA performance difference is a major thing even on Nvidia.

Imagine a racecar with a slightly worse transmission (translation layer) and the same engine (GPU) but weighing much less (application optimization). Still faster than one with a better transmission.
 
Joined
Jan 8, 2017
Messages
9,507 (3.27/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
If Nvidia was going to sue they would have done so already.
If AMD is a tiny name on the map Intel is virtually nonexistent. No one cares if you can run CUDA on Intel GPUs, but they might if you can run it on these new MI300s.

But it doesn't matter because as far as I can tell this isn't actually backed by AMD.
 
Joined
Jul 5, 2013
Messages
28,263 (6.75/day)
Nvidia lawsuit arriving in 3...2...1...
I wouldn't be surprised, judging by the amount of nvidia lawyers I see in many posts' comment section.
Possibly. The individual is just a contractor so they'd go after AMD, Intel, and possibly Microsoft (Github) for hosting it.
NVidia's legal dept can do nothing about this. Every bit of it is covered by fair-use. NVidia would be fools to take any legal action whatsoever.

They likely don't care.
 
Last edited:
Joined
Apr 18, 2019
Messages
2,397 (1.15/day)
Location
Olympia, WA
System Name Sleepy Painter
Processor AMD Ryzen 5 3600
Motherboard Asus TuF Gaming X570-PLUS/WIFI
Cooling FSP Windale 6 - Passive
Memory 2x16GB F4-3600C16-16GVKC @ 16-19-21-36-58-1T
Video Card(s) MSI RX580 8GB
Storage 2x Samsung PM963 960GB nVME RAID0, Crucial BX500 1TB SATA, WD Blue 3D 2TB SATA
Display(s) Microboard 32" Curved 1080P 144hz VA w/ Freesync
Case NZXT Gamma Classic Black
Audio Device(s) Asus Xonar D1
Power Supply Rosewill 1KW on 240V@60hz
Mouse Logitech MX518 Legend
Keyboard Red Dragon K552
Software Windows 10 Enterprise 2019 LTSC 1809 17763.1757
AMD hired the same contractor that Intel had hired to get Cuda running on Intel gpu's. If Nvidia was going to sue they would have done so already.

NGL, I feel awful for the team/individual that worked on all this.
I might be overstating the complexity involved, but I see this as almost John Carmack tier,
and both Intel and AMD left them in the dust.



Looks like Intel and AMD both merely wanted a proof of concept.

Hopefully,
this will end up like MCM chips. Where, (eventually) companies pool resources together for a new platform/company-agnostic standard(s).

Esp. w/ Intel (GP)GPUs and AMD MI Accelerators in Datacenters, even 'the industry' might support efforts for a more-agnostic CUDA-descendant.
 
Joined
May 3, 2018
Messages
2,881 (1.19/day)
ZLUDA is done and dusted. There will be no more development. Neither Intel or AMD saw any upside in getting CUDA to run on their hardware. They would rather compete head-on with oneAPI, ROCm/HIP.

ZLUDA
 
Joined
May 31, 2016
Messages
4,443 (1.42/day)
Location
Currently Norway
System Name Bro2
Processor Ryzen 5800X
Motherboard Gigabyte X570 Aorus Elite
Cooling Corsair h115i pro rgb
Memory 32GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s) Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s) LG 27UD69 UHD / LG 27GN950
Case Fractal Design G
Audio Device(s) Realtec 5.1
Power Supply Seasonic 750W GOLD
Mouse Logitech G402
Keyboard Logitech slim
Software Windows 10 64 bit
I think it might be hard for NV to sue anyone. They are not creating CUDA cores, they just use blender with optimized CUDA and simulate them on any give GPU.
For the CUDA itself, these are not necessary to run apps with it. It is not hardware limited as advertised. Probably most NV stuff isn't but is being advertised as such and blocked so that you dont use it with other products. You all know why. I can bet same can be done to all those DLLS and FG etc.
If so, that means NV is charging extra for something that can work on any GPU. I really find hard to believe that a 3090Ti cant run FG or any given 3000 series card can't because of some hardware limitation.
 
Joined
Jan 14, 2019
Messages
12,586 (5.80/day)
Location
Midlands, UK
System Name Nebulon B
Processor AMD Ryzen 7 7800X3D
Motherboard MSi PRO B650M-A WiFi
Cooling be quiet! Dark Rock 4
Memory 2x 24 GB Corsair Vengeance DDR5-4800
Video Card(s) AMD Radeon RX 6750 XT 12 GB
Storage 2 TB Corsair MP600 GS, 2 TB Corsair MP600 R2
Display(s) Dell S3422DWG, 7" Waveshare touchscreen
Case Kolink Citadel Mesh black
Audio Device(s) Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply Seasonic Prime GX-750
Mouse Logitech MX Master 2S
Keyboard Logitech G413 SE
Software Bazzite (Fedora Linux) KDE
For the CUDA itself, these are not necessary to run apps with it. It is not hardware limited as advertised. Probably most NV stuff isn't but is being advertised as such and blocked so that you dont use it with other products. You all know why. I can bet same can be done to all those DLLS and FG etc.
The same has been done on other GPUs - there's FSR. One can argue the minor differences in quality, but regardless of one's personal opinion, the technology is there, and it doesn't need no AI magic to run.
 
Top