• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

What local LLM-s you use?

Joined
Nov 23, 2023
Messages
315 (0.63/day)
I know, but its like when you buy expensive RWD sports car,it starts raining and many times cheaper AWD cars become suddenly a lot faster. You know your fancy sports car is not meant to be fast in rain, but for the money* you paid for it, you wan't it to be.
It's more like bringing a Jeep to a racetrack and expecting it to beat a supercar. The Jeep is built for off-roading, after all - something a supercar can't do at all.
Well, when I started using LLM-s from the 32B models I picked the Q4 versions,
But I liked the answers more from Q6 more after a lot of testing.
Also Bartowski recommending a lot of quants, including Q6 with the note very high quality
View attachment 388121
Would be nice to have an 5090 to run this from VRAM ofc.
Some day we will! :D

If I need a quick answer, I have smaller models ready to load!
I'll take your word for it. I guess we just use our systems differently.
 
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
It's more like bringing a Jeep to a racetrack and expecting it to beat a supercar. The Jeep is built for off-roading, after all - something a supercar can't do at all.

I'll take your word for it. I guess we just use our systems differently.
Just completed testing with the new driver, also that previous number came up with doing other browser stuff and watching YouTube
LM Studion only gave this:
1741294438871.png

6.29 tks is more than double :D
 

johnspack

Here For Good!
Joined
Oct 6, 2007
Messages
6,069 (0.95/day)
Location
Nelson B.C. Canada
System Name System2 Blacknet , System1 Blacknet2
Processor System2 Threadripper 1920x, System1 2699 v3
Motherboard System2 Asrock Fatality x399 Professional Gaming, System1 Asus X99-A
Cooling System2 Noctua NH-U14 TR4-SP3 Dual 140mm fans, System1 AIO
Memory System2 64GBS DDR4 3000, System1 32gbs DDR4 2400
Video Card(s) System2 GTX 980Ti System1 GTX 970
Storage System2 4x SSDs + NVme= 2.250TB 2xStorage Drives=8TB System1 3x SSDs=2TB
Display(s) 1x27" 1440 display 1x 24" 1080 display
Case System2 Some Nzxt case with soundproofing...
Audio Device(s) Asus Xonar U7 MKII
Power Supply System2 EVGA 750 Watt, System1 XFX XTR 750 Watt
Mouse Logitech G900 Chaos Spectrum
Keyboard Ducky
Software Archlinux, Manjaro, Win11 Ent 24h2
Benchmark Scores It's linux baby!
If anyone is looking for uncensored models, I found another one. https://huggingface.co/bartowski/Qwen_QwQ-32B-GGUF
Here's the output from what is taiwan? Unless all non deepseek models are uncensored... I'm not even sure yet....
I'm prefering QwQ over DeepSeek now. It's just as fast, and gives better details. 2nd one took 23 secs on my ancient computer....

qwq.png
Screenshot 2025-03-10 180020.png
 
Last edited:
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
Well, there will be about ~4 weeks for QwQ-32B before we see a version like the actually de-censored DeepSeek.
On that note, DeepSeek v2 is coming on the 17th, according to the rumors.
 
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
Looks like Google been faster, we have Gemma before DeepSeek V2!
Gemma 3 27B Gemma 3 12B
7900XT does 39 tks while the rig busy with other things!
Nice

It has image input option also, not bad - a bit slow tho ...
1741815397077.png
 
Last edited:

johnspack

Here For Good!
Joined
Oct 6, 2007
Messages
6,069 (0.95/day)
Location
Nelson B.C. Canada
System Name System2 Blacknet , System1 Blacknet2
Processor System2 Threadripper 1920x, System1 2699 v3
Motherboard System2 Asrock Fatality x399 Professional Gaming, System1 Asus X99-A
Cooling System2 Noctua NH-U14 TR4-SP3 Dual 140mm fans, System1 AIO
Memory System2 64GBS DDR4 3000, System1 32gbs DDR4 2400
Video Card(s) System2 GTX 980Ti System1 GTX 970
Storage System2 4x SSDs + NVme= 2.250TB 2xStorage Drives=8TB System1 3x SSDs=2TB
Display(s) 1x27" 1440 display 1x 24" 1080 display
Case System2 Some Nzxt case with soundproofing...
Audio Device(s) Asus Xonar U7 MKII
Power Supply System2 EVGA 750 Watt, System1 XFX XTR 750 Watt
Mouse Logitech G900 Chaos Spectrum
Keyboard Ducky
Software Archlinux, Manjaro, Win11 Ent 24h2
Benchmark Scores It's linux baby!
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
How are you running Gemma? Won't run under Koboldcpp, or maybe my system.
LM Studio 0.3.13 with ROCm llama.cpp v1.19.2 runtime.
 
Joined
Feb 12, 2025
Messages
25 (0.49/day)
Location
EU
Processor AMD 5600X
Motherboard ASUS TUF GAMING B550M-Plus WiFi
Cooling be quiet! Dark Rock 4
Memory G.Skill Ripjaws 2 x 32 GB DDR4-3600 CL18-22-22-42 1.35V F4-3600C18D-64GVK
Video Card(s) Sapphire Pulse RX 7800XT 16GB
Storage Kingston KC3000 2TB + QNAP TBS-464
Display(s) LG 35" LCD 35WN75C-B 3440x1440
Case Kolink Bastion RGB Midi-Tower
Power Supply Enermax Digifanless 550W
Mouse Razer Deathadder v2
Benchmark Scores phi4 - 42.00 tokens/s
How are you running Gemma? Won't run under Koboldcpp, or maybe my system.
It's now also availabe on ollama, can't wait to try it out considering how good Gemma2 is.
 

Easy Rhino

Linux Advocate
Staff member
Joined
Nov 13, 2006
Messages
15,676 (2.33/day)
Location
Mid-Atlantic
System Name Desktop
Processor i5 13600KF
Motherboard AsRock B760M Steel Legend Wifi
Cooling Noctua NH-U9S
Memory 4x 16 Gb Gskill S5 DDR5 @6000
Video Card(s) Gigabyte Gaming OC 6750 XT 12GB
Storage WD_BLACK 4TB SN850x
Display(s) Gigabye M32U
Case Corsair Carbide 400C
Audio Device(s) On Board
Power Supply EVGA Supernova 650 P2
Mouse MX Master 3s
Keyboard Logitech G915 Wireless Clicky
Software Fedora KDE Spin
It's now also availabe on ollama, can't wait to try it out considering how good Gemma2 is.

it is very good. i have it running with the updated Ollama version and new Rocm/mesa libs and it flies using the 6750 XT! Over 60 tps.
 
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
Joined
May 10, 2023
Messages
804 (1.16/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
I agree that Strix Halo's 256GB/s is puny, but then again that's first gen unified memory PC platform from AMD. It has potential to get much better over time.
DIGITS should get 900GB/s and a GPU to match that bandwidth. With 128GB of RAM, this will best many datacenter class GPUs that cost a lot more money.
I've seen various Apple's benchmarks on LLMs and considering hardware, they do not fare well, I suspect software optimizations are partly to blame here.
Ultimately merging system and GPU memory over high speed link offers superior bang for buck, than just cranking up GDDRN on a GPU each gen.

I've roamed around various datacenters for decades. Did so even last week. Low oxygen ones are "fun". Most high end compute in DCs is nowadays water cooled (DLC), especially GPUs, but we are going off topic here.
Turns out it's really disappointing for its price, it's 256-bit 8533MHz, so 273GB/s

Bummer
 

Easy Rhino

Linux Advocate
Staff member
Joined
Nov 13, 2006
Messages
15,676 (2.33/day)
Location
Mid-Atlantic
System Name Desktop
Processor i5 13600KF
Motherboard AsRock B760M Steel Legend Wifi
Cooling Noctua NH-U9S
Memory 4x 16 Gb Gskill S5 DDR5 @6000
Video Card(s) Gigabyte Gaming OC 6750 XT 12GB
Storage WD_BLACK 4TB SN850x
Display(s) Gigabye M32U
Case Corsair Carbide 400C
Audio Device(s) On Board
Power Supply EVGA Supernova 650 P2
Mouse MX Master 3s
Keyboard Logitech G915 Wireless Clicky
Software Fedora KDE Spin
Which one?
Gemma 3's 1B Q3 or 27B Q8?
Not irrelevant detail :D

Oops! The 4b one.

4B parameter model (128k context window)
With Q4

 
Joined
Apr 18, 2019
Messages
2,744 (1.26/day)
Location
PNW, USA
System Name Metalia
Processor AMD Ryzen 7 5800X3D
Motherboard Asus TuF Gaming X570-PLUS
Cooling ID Cooling 280mm AIO w/ Arctic P14s
Memory 2x32GB DDR4-3600
Video Card(s) Sapphire Pulse RX 9070 XT
Storage Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s) LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case Geometric Future M8 Dharma
Audio Device(s) Xonar Essence STX
Power Supply Seasonic Focus GX-1000 Gold
Mouse Attack Shark R3 Magnesium - White
Keyboard Keychron K8 Pro - White - Tactile Brown Switch
Software Windows 10 IoT Enterprise LTSC 2021
Sorry if already answered, but I have a choice to make tomorrow. :p

For messing around with LLMs and Stable Diffusion,
7900XTX or 9070XT
Not enough room, cooling or PSU (1KW) for both

-And, would 128GB of sysRAM over the 64GB I have be worth it?
 
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
Sorry if already answered, but I have a choice to make tomorrow. :p

For messing around with LLMs and Stable Diffusion,
7900XTX or 9070XT
Not enough room, cooling or PSU (1KW) for both

-And, would 128GB of sysRAM over the 64GB I have be worth it?
For LLM-s the most important is VRAM,
You just can't have enough! So 7900XTX which has 24GB (8GB more than the 9070-s)
The 9070 is also new tech, so the runtimes are not there yet, I see the LM Studio discord with daily crying about when the support will be available.
 
Joined
Apr 18, 2019
Messages
2,744 (1.26/day)
Location
PNW, USA
System Name Metalia
Processor AMD Ryzen 7 5800X3D
Motherboard Asus TuF Gaming X570-PLUS
Cooling ID Cooling 280mm AIO w/ Arctic P14s
Memory 2x32GB DDR4-3600
Video Card(s) Sapphire Pulse RX 9070 XT
Storage Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s) LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case Geometric Future M8 Dharma
Audio Device(s) Xonar Essence STX
Power Supply Seasonic Focus GX-1000 Gold
Mouse Attack Shark R3 Magnesium - White
Keyboard Keychron K8 Pro - White - Tactile Brown Switch
Software Windows 10 IoT Enterprise LTSC 2021
For LLM-s the most important is VRAM,
You just can't have enough! So 7900XTX which has 24GB (8GB more than the 9070-s)
The 9070 is also new tech, so the runtimes are not there yet, I see the LM Studio discord with daily crying about when the support will be available.

Question then.
Is rDMA useable/useful?

I have some MI25s/WX9100s with 16GB HBM. Any way to utilize 'em? (I figure the 7900XTX is more powerful than 2 MI25s, just from generational improvements.)
 
Joined
May 10, 2023
Messages
804 (1.16/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
Sorry if already answered, but I have a choice to make tomorrow. :p

For messing around with LLMs and Stable Diffusion,
7900XTX or 9070XT
Not enough room, cooling or PSU (1KW) for both

-And, would 128GB of sysRAM over the 64GB I have be worth it?
As said above, the 7900XTX has both more memory, and also has more memory bandwidth, so it'll end up faster in most cases.
Software support is kinda meh and you won't be extracting 100% of the theoretical performance of that GPU, but you can get things done nonetheless, specially if you don't mind tinkering around. The 9070xt has no ROCm support still, but seems like some things can be done with the vulkan backend.

128GB of RAM would only be useful if you're offloading some layers from larger models into RAM, but then things would be really slow. If you already have 64GB, I'd say to continue with it until you feel the need to upgrade. Otherwise, you could try to start with 2x32/48/64GB and then add another pair of sticks down the road.
Is rDMA useable/useful?
Eh, not really, each GPU has to keep the layers it's working on within its own memory. Having access to the memory of another GPU is not really useful in this case.
I have some MI25s/WX9100s with 16GB HBM. Any way to utilize 'em? (I figure the 7900XTX is more powerful than 2 MI25s, just from generational improvements.)
Those would be kinda slow, even slower than something like a RTX 3060.
Nonetheless, you can try to get those working, I've seen some reports of so in other forums, even if performance isn't all that great:
 
Joined
Apr 18, 2019
Messages
2,744 (1.26/day)
Location
PNW, USA
System Name Metalia
Processor AMD Ryzen 7 5800X3D
Motherboard Asus TuF Gaming X570-PLUS
Cooling ID Cooling 280mm AIO w/ Arctic P14s
Memory 2x32GB DDR4-3600
Video Card(s) Sapphire Pulse RX 9070 XT
Storage Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s) LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case Geometric Future M8 Dharma
Audio Device(s) Xonar Essence STX
Power Supply Seasonic Focus GX-1000 Gold
Mouse Attack Shark R3 Magnesium - White
Keyboard Keychron K8 Pro - White - Tactile Brown Switch
Software Windows 10 IoT Enterprise LTSC 2021
As said above, the 7900XTX has both more memory, and also has more memory bandwidth, so it'll end up faster in most cases.
Software support is kinda meh and you won't be extracting 100% of the theoretical performance of that GPU, but you can get things done nonetheless, specially if you don't mind tinkering around. The 9070xt has no ROCm support still, but seems like some things can be done with the vulkan backend.

128GB of RAM would only be useful if you're offloading some layers from larger models into RAM, but then things would be really slow. If you already have 64GB, I'd say to continue with it until you feel the need to upgrade. Otherwise, you could try to start with 2x32/48/64GB and then add another pair of sticks down the road.

Eh, not really, each GPU has to keep the layers it's working on within its own memory. Having access to the memory of another GPU is not really useful in this case.

Those would be kinda slow, even slower than something like a RTX 3060.
Nonetheless, you can try to get those working, I've seen some reports of so in other forums, even if performance isn't all that great:
In current living situation, I'm looking for the most efficient and effective use of my hardware. -I can really only have 1 'rig' set up at any given time.

Now, I'm seriously considering how to get both the 9070xt and 7900xtx in my rig without the PSU tripping. (And, how to recover the covered-up slots)
 
Joined
May 10, 2023
Messages
804 (1.16/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
In current living situation, I'm looking for the most efficient and effective use of my hardware. -I can really only have 1 'rig' set up at any given time.

Now, I'm seriously considering how to get both the 9070xt and 7900xtx in my rig without the PSU tripping. (And, how to recover the covered-up slots)
Isn't power limiting those a thing?
I currently run 2x3090s with a 850W PSU without any issues, each is set at 275W.
 
Joined
Apr 18, 2019
Messages
2,744 (1.26/day)
Location
PNW, USA
System Name Metalia
Processor AMD Ryzen 7 5800X3D
Motherboard Asus TuF Gaming X570-PLUS
Cooling ID Cooling 280mm AIO w/ Arctic P14s
Memory 2x32GB DDR4-3600
Video Card(s) Sapphire Pulse RX 9070 XT
Storage Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s) LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case Geometric Future M8 Dharma
Audio Device(s) Xonar Essence STX
Power Supply Seasonic Focus GX-1000 Gold
Mouse Attack Shark R3 Magnesium - White
Keyboard Keychron K8 Pro - White - Tactile Brown Switch
Software Windows 10 IoT Enterprise LTSC 2021
Isn't power limiting those a thing?
I currently run 2x3090s with a 850W PSU without any issues, each is set at 275W.
It is, and I did have that in mind. However, I have 0 idea how any of that is configured in Linux.
NGL, Last time I tried tweaking a GPU in Linux, I gave up and reinstalled Windows :oops:

I've accepted that I'll have to dual-boot, for ROCm support...
 
Joined
Mar 11, 2008
Messages
1,207 (0.19/day)
Location
Hungary / Budapest
System Name Kincsem
Processor AMD Ryzen 9 9950X
Motherboard ASUS ProArt X870E-CREATOR WIFI
Cooling Be Quiet Dark Rock Pro 5
Memory Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s) Sapphire AMD RX 7900 XT Pulse
Storage Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s) Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case Cooler Master CM 690 III
Power Supply Seasonic 1300W 80+ Gold Prime
Mouse Logitech G502 Hero
Keyboard HyperX Alloy Elite RGB
Software Windows 10-64
Benchmark Scores https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc
It is, and I did have that in mind. However, I have 0 idea how any of that is configured in Linux.
NGL, Last time I tried tweaking a GPU in Linux, I gave up and reinstalled Windows :oops:

I've accepted that I'll have to dual-boot, for ROCm support...
Just make sure that PSU you pick is capable to deliver the power written on it's side.
Would recommend Seasonic or Corsair PSU-s in first round.
 
Joined
May 10, 2023
Messages
804 (1.16/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
It is, and I did have that in mind. However, I have 0 idea how any of that is configured in Linux.
NGL, Last time I tried tweaking a GPU in Linux, I gave up and reinstalled Windows :oops:

I've accepted that I'll have to dual-boot, for ROCm support...
Long time since I last used an AMD dGPU, but I hope one of those tools could be of use for that:
 
Joined
Apr 18, 2019
Messages
2,744 (1.26/day)
Location
PNW, USA
System Name Metalia
Processor AMD Ryzen 7 5800X3D
Motherboard Asus TuF Gaming X570-PLUS
Cooling ID Cooling 280mm AIO w/ Arctic P14s
Memory 2x32GB DDR4-3600
Video Card(s) Sapphire Pulse RX 9070 XT
Storage Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s) LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case Geometric Future M8 Dharma
Audio Device(s) Xonar Essence STX
Power Supply Seasonic Focus GX-1000 Gold
Mouse Attack Shark R3 Magnesium - White
Keyboard Keychron K8 Pro - White - Tactile Brown Switch
Software Windows 10 IoT Enterprise LTSC 2021
Just make sure that PSU you pick is capable to deliver the power written on it's side.
Would recommend Seasonic or Corsair PSU-s in first round.
Seasonic Focus Gold GX1000, IIRC. A friend sold it to me BNIB, for a Benjamin in January.

The Seasonic could probably sustain unmitigated full load from 5800X3D+7900XTX+9070XT, but I really really do not want to abuse it,
and my chassis already doubles as a highly-effective space heater. :eek:
Power Limiting would be a necessity, regardless of the PSU's capabilities

TBQH, I'm more 'upset' over the proposed loss of access to my expansion slots.
1742334730291.png

I'll just buy more flat risers, I suppose. -assuming the headers for USB and FP aren't in the way.
1742335816432.png
1742335882382.png


Long time since I last used an AMD dGPU, but I hope one of those tools could be of use for that:
Thank you!

Is there any workaround for No HDMI 2.1 support in Linux? My LG OLED has HDMI 2.1 or DP 1.4a.


Sorry if this is a slightly drifting from the topic. -Kinda figure 'hardware setup' is a big part of running LLMs.
 
Joined
May 10, 2023
Messages
804 (1.16/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
Is there any workaround for No HDMI 2.1 support in Linux? My LG OLED has HDMI 2.1 or DP 1.4a.
Not that I know of, at least so far. AMD would need to add the HDMI handling stuff into their firmware for it to work.
However, you may be able to get it working with a DP to HDMI adapter. Some folks in the below thread reported some success with it:
 
Joined
Nov 23, 2023
Messages
315 (0.63/day)
Sorry if already answered, but I have a choice to make tomorrow. :p

For messing around with LLMs and Stable Diffusion,
7900XTX or 9070XT
Not enough room, cooling or PSU (1KW) for both

-And, would 128GB of sysRAM over the 64GB I have be worth it?
If you're buying something right now, RDNA 4 shouldn't even be an option for you. There's no ROCm support. More RAM is a good upgrade, you can run bigger models and partially offload or use larger contexts.
Is rDMA useable/useful?
No.
I have some MI25s/WX9100s with 16GB HBM. Any way to utilize 'em? (I figure the 7900XTX is more powerful than 2 MI25s, just from generational improvements.)
They're good for raw VRAM. They won't have flash attention so your context will fill up eventually and they're trash for SD, but inference speed for LLMs will be fine. Don't know if they'll do tensor parallelism though.
TBQH, I'm more 'upset' over the proposed loss of access to my expansion slots.
You should see the new X870 motherboards, they're even worse.
Is there any workaround for No HDMI 2.1 support in Linux? My LG OLED has HDMI 2.1 or DP 1.4a.
No. HDMI won't let Mesa support 2.1 and Radeon no longer has a proprietary driver for Linux. You'll have to use DP 1.4a.
 
Joined
Apr 18, 2019
Messages
2,744 (1.26/day)
Location
PNW, USA
System Name Metalia
Processor AMD Ryzen 7 5800X3D
Motherboard Asus TuF Gaming X570-PLUS
Cooling ID Cooling 280mm AIO w/ Arctic P14s
Memory 2x32GB DDR4-3600
Video Card(s) Sapphire Pulse RX 9070 XT
Storage Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s) LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case Geometric Future M8 Dharma
Audio Device(s) Xonar Essence STX
Power Supply Seasonic Focus GX-1000 Gold
Mouse Attack Shark R3 Magnesium - White
Keyboard Keychron K8 Pro - White - Tactile Brown Switch
Software Windows 10 IoT Enterprise LTSC 2021
If you're buying something right now, RDNA 4 shouldn't even be an option for you. There's no ROCm support. More RAM is a good upgrade, you can run bigger models and partially offload or use larger contexts.
Already bought a 7900XTX (from 7900 GRE) at the beginning of the year, and a RX9070XT a few days ago. I wanted the better RT perf. of Navi 48 and less wattage of heat while gaming this summer.
Good to know. rDMA is non-relevant, got it.
They're good for raw VRAM. They won't have flash attention so your context will fill up eventually and they're trash for SD, but inference speed for LLMs will be fine. Don't know if they'll do tensor parallelism though.
My hope was that they could be used as slot-in HBM cache (bottlenecked by PCIe latency/speed, ofc) IIRC, you cannot 'spread' LLM workloads across seperate VRAM segments.
You should see the new X870 motherboards, they're even worse.
I know. I've given up on looking towards AM5 until a new PROMOTORY is released. The standardized fanout for PCIe on AM5 is unironically, literally, and un-amusingly "retarded".
No. HDMI won't let Mesa support 2.1 and Radeon no longer has a proprietary driver for Linux. You'll have to use DP 1.4a.
Since I'll likely game on Windows anyway, not a huge loss.
 
Top