- Joined
- Jun 25, 2020
- Messages
- 153 (0.09/day)
System Name | The New, Improved, Vicious, Stable, Silent Gaming Space Heater |
---|---|
Processor | Ryzen 7 5800X3D |
Motherboard | MSI B450 Tomahawk Max |
Cooling | be quiet! DRP4 (w/ added SilentWings3), 4x Noctua A14x25G2 (3 @ front, 1 @ back) |
Memory | Teamgroup DDR4 3600 16GBx2 @18-22-22-22-42 -> 18-20-20-20-40 |
Video Card(s) | PowerColor RX7900XTX HellHound |
Storage | ADATA SX8200Pro 1TB, Crucial P3+ 4TB (w/riser, @Gen2x4), Seagate 3+1TB HDD, Micron 5300 7.68TB SATA |
Display(s) | Gigabyte M27U @4K150Hz, AOC 24G2 @1080p100Hz(Max144Hz) vertical, ASUS VP228H@1080p60Hz vertical |
Case | Phanteks P600S |
Audio Device(s) | Creative Katana V2X gaming soundbar |
Power Supply | Seasonic Vertex GX-1200 (ATX3.0 compliant) |
Mouse | Razer Deathadder V3 wired |
Keyboard | Keychron Q6Max |
System spec as in profile. Please move to appropriate subforum if needed.
I recently upgraded my GPU from a 3070 to the current 7900XTX. There are a few instabilities here and there, but because at the time I only played Forza Horizon 5 and Forza Motorsport 8, and FM8 is known to hate 7900XTX in its day one version, so I brushed it off as a game-specific thing. Also at the same time, The Crew Motorfest also throws some driver timeout crashes, but it is infrequent enough to not caught my attention.
More recently, as winter is coming, I started folding@home again, which depends on the the work units, will be completely fine, or, instantly causes reboot, sometimes repeatedly. So I tried a few things to ensure the system is stable, but to no avail, and potentially made things worse.
Everytime it reboots, there are no BSODs, and no Event Viewer informations other than the generic "System unexpectedly shutdown" stuff. (If there are other specific codes / keywords I need to search, tell me!)
Things I tried:
- Ensure power cords are properly plugged.
- DDU -> Clean install latest drivers. Tried "driver only" and "full install".
- optimized defaults on motherboard BIOS.
-- BIOS config only has the Kombo thing (which should mean CO -30), a quick and dirty RAM profile, fan curve related stuff, and PCIe ASPM settings (default [Disabled], tried [Auto] and [L0s and L1 states]. Currently [Auto].)
- Running whatever controllable fans at full blast.
- Uninstalled Afterburner and RTSS.
-- I have heard that it helps with crashes in FH5. It didn't help in my case. I will reinstall Afterburner if you guys say so.
- Flipping VBIOS switch on the GPU, and then DDU -> clean install drivers. Currently on Quiet VBIOS. RGBs on card is always off.
- Flipping the PCIe link state settings on Windows.
-- Switching from Moderately power saving to no power savings worsened the situation a lot. Almost all driver timeouts on the "Things that crashed" section can be read as instant reboot.
-- Switching to Maximum power savings doesn't help the situation. It also induces some frame rate instability.
- Forcing a 59.94Hz or 60Hz on all three monitors.
- Power limit -10% on AMD Adreanlin. It can only go that low.
- Disconnecting both 1080p monitors.
- Plugging the PC to a standalone socket instead of a power...strip? tower? thingy. (the power tower thingy is from a local/regional reputable brand. It has lots of sockets. It has the PC, three monitors, a PS4Pro and a XBox360 plugged. Both consoles are off, and the monitors aren't that hungry to begin with, but it's probably worth a try. It didn't work.
Things Iplan to won't try
- A previous driver version (23.11.1). Not sure how it fares on Folding. But it crashed on FH5 though.
Assuming the PCIe link state thing is set to Moderate power savings...
Things that the system currently will not crash
- General light, non-gaming usage
- "Lightweight" games (I tried WWE 2K23, TBP ~160W, nothing happens)
- Superposition benchmark
- Port Royal and Speedway stress test alone
- Port Royal and Speedway stress test + a CPU folding workload
-- Note that it finished the test. Of course it will not pass in such state. Also, Port Royal + CPU workload with no power savings may cause a reboot.
EDIT: Folding by GPU only won't crash. Or, it survived for 12hrs so it should be fine.
Furmark + CPU folding is also probably fine (survived 30mins)
Things that the system currently crashes
- Forza Motorsport 8 (4K, everything Ultra w/RT, not frame limited. It should be 30~100FPS depends on tracks.)
-- The lighting will sometimes glitch out to neon-like, which greatly increases the chance to driver timeout. Sometimes the lighting glitch will resolve by running the race as if nothing happened. Sometimes it will not. Crashes are rare, but will occur from outta nowhere anyway.
-- Lowering details and limit to 75FPS will help, but not completely resolve the situation.
- Forza Horizon 5 (4K, everything Ultra, 150FPS limited). Normally crashes 5~30mins after game launch.
-- Lowering details and further limit FPS to 75 will help, but not completely resolve the situation. Also the lighting glitch and "environment disappear" glitch in FM8 also occurs here.
- The Crew Motorfest (4K, everything Ultra, 60FPS limited by game engine). Normally crashes 15min~2hr after game launch.
-- I can't recall it caused a system reboot ever. But it will eventually freeze, throws a driver timeout error + a "GPU lost!" error message.
- CPU + GPU folding workloads. Sometimes it works fine. Sometimes it's crash city. Sometimes it reboots whenever the GPU tries to start a WU. Probably depends on WUs.
Other basic information:
- Temperatures are all fine. ~85C for CPU. ~92C for GPU hotspot and VRAM. Disks and Motherboard are <60C.
- Outdoor temperature was 6C at the lowest. And it was pretty dry. It is also cold AF in my room, but this level of coldness by itself shouldn't cause any issues. Statics might be, but I'm a potato in regards to such knowledge.
- BIOS is at the time latest version.
- GPU drivers is latest (23.12.1). Cannot be sure about other drivers, which probably means it isn't latest.
- I have not touched any GPU settings other than power limit. No overclock.
- I haven't properly tried underclock and undervolt. With such instability I can't be sure what settings are good.
- You may remember me worrying about idle power draw and fiddling with Custom Resolution Utility (CRU). DDU should remove all CRU settings (I need to re-enter the settings on CRU after DDU), and crashes occur without CRU settings.
- I've heard there are many versions of Corsair RM850. Judging from the fonts, it should be 2019,but I vaguely remembers that it existed in my household before it was purchased in March 2019. I will try to find exactly what it is.
- The GPU in question uses two 8-pin cables and have a power limit of 350W in quiet VBIOS. I have used two separate cables, using themiddle (EDIT: end; middle plug is not usable due to card overhang/ cable length) plug of the pigtail cables. It is plugged on the bottom right sockets on the PSU side. I might have missed the "correct" sockets to use in this situation.
-I won't have any way to test the 7900XTX on other computer. EDIT: My brother agreed to test with his PC, but he got a slightly more potato but kinda newer PSU. See post #5.
I have thoughts on what the issue is, but before me jumping onto conclusions, I would like to hear what your thoughts are, and if I have missed some other things which might be helpful.
Thanks in advance.
I recently upgraded my GPU from a 3070 to the current 7900XTX. There are a few instabilities here and there, but because at the time I only played Forza Horizon 5 and Forza Motorsport 8, and FM8 is known to hate 7900XTX in its day one version, so I brushed it off as a game-specific thing. Also at the same time, The Crew Motorfest also throws some driver timeout crashes, but it is infrequent enough to not caught my attention.
More recently, as winter is coming, I started folding@home again, which depends on the the work units, will be completely fine, or, instantly causes reboot, sometimes repeatedly. So I tried a few things to ensure the system is stable, but to no avail, and potentially made things worse.
Everytime it reboots, there are no BSODs, and no Event Viewer informations other than the generic "System unexpectedly shutdown" stuff. (If there are other specific codes / keywords I need to search, tell me!)
Things I tried:
- Ensure power cords are properly plugged.
- DDU -> Clean install latest drivers. Tried "driver only" and "full install".
- optimized defaults on motherboard BIOS.
-- BIOS config only has the Kombo thing (which should mean CO -30), a quick and dirty RAM profile, fan curve related stuff, and PCIe ASPM settings (default [Disabled], tried [Auto] and [L0s and L1 states]. Currently [Auto].)
- Running whatever controllable fans at full blast.
- Uninstalled Afterburner and RTSS.
-- I have heard that it helps with crashes in FH5. It didn't help in my case. I will reinstall Afterburner if you guys say so.
- Flipping VBIOS switch on the GPU, and then DDU -> clean install drivers. Currently on Quiet VBIOS. RGBs on card is always off.
- Flipping the PCIe link state settings on Windows.
-- Switching from Moderately power saving to no power savings worsened the situation a lot. Almost all driver timeouts on the "Things that crashed" section can be read as instant reboot.
-- Switching to Maximum power savings doesn't help the situation. It also induces some frame rate instability.
- Forcing a 59.94Hz or 60Hz on all three monitors.
- Power limit -10% on AMD Adreanlin. It can only go that low.
- Disconnecting both 1080p monitors.
- Plugging the PC to a standalone socket instead of a power...strip? tower? thingy. (the power tower thingy is from a local/regional reputable brand. It has lots of sockets. It has the PC, three monitors, a PS4Pro and a XBox360 plugged. Both consoles are off, and the monitors aren't that hungry to begin with, but it's probably worth a try. It didn't work.
Things I
- A previous driver version (23.11.1). Not sure how it fares on Folding. But it crashed on FH5 though.
Assuming the PCIe link state thing is set to Moderate power savings...
Things that the system currently will not crash
- General light, non-gaming usage
- "Lightweight" games (I tried WWE 2K23, TBP ~160W, nothing happens)
- Superposition benchmark
- Port Royal and Speedway stress test alone
- Port Royal and Speedway stress test + a CPU folding workload
-- Note that it finished the test. Of course it will not pass in such state. Also, Port Royal + CPU workload with no power savings may cause a reboot.
EDIT: Folding by GPU only won't crash. Or, it survived for 12hrs so it should be fine.
Furmark + CPU folding is also probably fine (survived 30mins)
Things that the system currently crashes
- Forza Motorsport 8 (4K, everything Ultra w/RT, not frame limited. It should be 30~100FPS depends on tracks.)
-- The lighting will sometimes glitch out to neon-like, which greatly increases the chance to driver timeout. Sometimes the lighting glitch will resolve by running the race as if nothing happened. Sometimes it will not. Crashes are rare, but will occur from outta nowhere anyway.
-- Lowering details and limit to 75FPS will help, but not completely resolve the situation.
- Forza Horizon 5 (4K, everything Ultra, 150FPS limited). Normally crashes 5~30mins after game launch.
-- Lowering details and further limit FPS to 75 will help, but not completely resolve the situation. Also the lighting glitch and "environment disappear" glitch in FM8 also occurs here.
- The Crew Motorfest (4K, everything Ultra, 60FPS limited by game engine). Normally crashes 15min~2hr after game launch.
-- I can't recall it caused a system reboot ever. But it will eventually freeze, throws a driver timeout error + a "GPU lost!" error message.
- CPU + GPU folding workloads. Sometimes it works fine. Sometimes it's crash city. Sometimes it reboots whenever the GPU tries to start a WU. Probably depends on WUs.
Other basic information:
- Temperatures are all fine. ~85C for CPU. ~92C for GPU hotspot and VRAM. Disks and Motherboard are <60C.
- Outdoor temperature was 6C at the lowest. And it was pretty dry. It is also cold AF in my room, but this level of coldness by itself shouldn't cause any issues. Statics might be, but I'm a potato in regards to such knowledge.
- BIOS is at the time latest version.
- GPU drivers is latest (23.12.1). Cannot be sure about other drivers, which probably means it isn't latest.
- I have not touched any GPU settings other than power limit. No overclock.
- I haven't properly tried underclock and undervolt. With such instability I can't be sure what settings are good.
- You may remember me worrying about idle power draw and fiddling with Custom Resolution Utility (CRU). DDU should remove all CRU settings (I need to re-enter the settings on CRU after DDU), and crashes occur without CRU settings.
- I've heard there are many versions of Corsair RM850. Judging from the fonts, it should be 2019,
- The GPU in question uses two 8-pin cables and have a power limit of 350W in quiet VBIOS. I have used two separate cables, using the
-
I have thoughts on what the issue is, but before me jumping onto conclusions, I would like to hear what your thoughts are, and if I have missed some other things which might be helpful.
Thanks in advance.
As I have a habit to really make the system quiet, first my thought was "something not in the sensors is too hot". Back when winter wasn't here, it throws some rather unpleasant hot air...but nothing horribly bad. As stated before, pushing all fans to 100% doesn't help.
When I still have the 3070, everytime it frequently reboots it is almost certainly due to the cable not plugged properly. Which drives my attention to the PSU. When I still used the 3070, it only reboots when there is no workload, and when I'm not using the computer. No, it is not Windows Update, but I don't know what it is. It normally happens in weeks, so whatever.
The situation resembles to the dreaded T-word...I mean transient spikes on 3090s / other 7900XTXs. And then I looked harder on HWINFO64.
There's a number, GPU Power Maximum, which I have seen to go as high as 580W. I know this is not a number that can be used in review (it does not state the time period of such maximum; and if such number is usable it devalues a lot of professional reviews), but this number does make me worry about the transients.
I know RM850 is supposed to be a super high quality unit, but I am pretty sure at this point that the PSU ate too much dust / degraded / was not good enough to start with. Attempts to open up the PSU to clean up only leads to the destruction of a "DANGER HIGH VOLTAGE" sticker, which probably also functions as a "Warranty void if broken" sticker. I failed to get inside it, which may be a good thing. I curse my stupidity .
Other possibilities I can think of include "It is actually a bad card" and "LOL AMD drivers".
If it turns into a "which PSU should I buy" thread, just to be ultra safe I will need your advice anyway.
When I still have the 3070, everytime it frequently reboots it is almost certainly due to the cable not plugged properly. Which drives my attention to the PSU. When I still used the 3070, it only reboots when there is no workload, and when I'm not using the computer. No, it is not Windows Update, but I don't know what it is. It normally happens in weeks, so whatever.
The situation resembles to the dreaded T-word...I mean transient spikes on 3090s / other 7900XTXs. And then I looked harder on HWINFO64.
There's a number, GPU Power Maximum, which I have seen to go as high as 580W. I know this is not a number that can be used in review (it does not state the time period of such maximum; and if such number is usable it devalues a lot of professional reviews), but this number does make me worry about the transients.
I know RM850 is supposed to be a super high quality unit, but I am pretty sure at this point that the PSU ate too much dust / degraded / was not good enough to start with. Attempts to open up the PSU to clean up only leads to the destruction of a "DANGER HIGH VOLTAGE" sticker, which probably also functions as a "Warranty void if broken" sticker. I failed to get inside it, which may be a good thing. I curse my stupidity .
Other possibilities I can think of include "It is actually a bad card" and "LOL AMD drivers".
If it turns into a "which PSU should I buy" thread, just to be ultra safe I will need your advice anyway.
Last edited: