• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Still getting WHEA 18 errors all stock

Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
For troubleshooting you can try setting your VSOC to 1.1v then post another screenshot so we can see how your motherboard adjusts the remaining voltages, then do some testing.
VSOC? Is it called something else in bios?
 
Joined
Jul 30, 2019
Messages
3,098 (1.64/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR4-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
VSOC? Is it called something else in bios?
It might just be called SOC voltage. I don't have an MSI UEFI/BIOS to look at the moment. If I recall MSI gives you live voltage readouts as your looking at UEFI/BIOS. You can use that to help confirm the option you want to change because that voltage would match or be very close to your screenshot for vSOC. Perhaps someone here knows the exact name of the option for MSI. It should be near the other voltage settings for VDDP and VDDG. ( Just DO NOT modify the one that says CHIPSET soc voltage. )

If there is any concern take a pic of your UEFI/BIOS and post it before changing anything.
 
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
It might just be called SOC voltage. I don't have an MSI UEFI/BIOS to look at the moment. If I recall MSI gives you live voltage readouts as your looking at UEFI/BIOS. You can use that to help confirm the option you want to change because that voltage would match or be very close to your screenshot for vSOC. Perhaps someone here knows the exact name of the option for MSI. It should be near the other voltage settings for VDDP and VDDG. ( Just DO NOT modify the one that says CHIPSET soc voltage. )

If there is any concern take a pic of your UEFI/BIOS and post it before changing anything.
CPU NB SoC is right above what you said. Sound about right? So scared of nuking my computer.
 
Joined
Mar 10, 2009
Messages
190 (0.03/day)
Location
Mexico
System Name HTPC
Processor AMD Ryzen 7 5800X
Motherboard Gigabyte X570S Aorus Master
Cooling BeQuiet Dark Rock Pro 4
Memory Kingston Fury Renegade RGB 32GB Kit (2x16GB) 3600Mhz DDR4 CL16
Video Card(s) Sapphire NITRO+ RX 5700 XT
Storage Acer Predator GM7000 4TB / WD Blue SN570 1TB / TOSHIBA HDWD130 3TB
Display(s) LG Oled 42C3
Case Lian Li O11 Air Mini
Audio Device(s) Creative Labs Sound BlasterX G6
Power Supply EVGA SuperNOVA 850 P2
Mouse Logitech G900
Keyboard SteelSeries Apex Pro
Software Microsoft Windows 11 Pro
CPU NB SoC is right above what you said. Sound about right? So scared of nuking my computer.
Look like thats the one
 
Joined
Jan 29, 2023
Messages
1,296 (2.11/day)
System Name KLM
Processor 7800X3D
Motherboard B-650E-E Strix
Cooling Arctic Cooling III 280
Memory 16x2 Fury Renegade 6000-32
Video Card(s) 4070-ti PNY
Storage 512+512+1+2+2+2+2+6+500+256+4+4+4
Display(s) VA 32" 4K@60 - OLED 27" 2K@240
Case 4000D Airflow
Audio Device(s) Edifier 1280Ts
Power Supply Shift 1000
Mouse 502 Hero
Keyboard K68
Software EMDB
Benchmark Scores 0>1000
That leads to at least one question,

why the hell BIOS don't have one and only name for a setting instead confuse the entire world without reason by letting the dev choosse how to call it ?
 

tabascosauz

Moderator
Supporter
Staff member
Joined
Jun 24, 2015
Messages
8,032 (2.37/day)
Location
Western Canada
System Name ab┃ob
Processor 7800X3D┃5800X3D
Motherboard B650E PG-ITX┃X570 Impact
Cooling NH-U12A + T30┃AXP120-x67
Memory 64GB 6400CL32┃32GB 3600CL14
Video Card(s) RTX 4070 Ti Eagle┃RTX A2000
Storage 8TB of SSDs┃1TB SN550
Case Caselabs S3┃Lazer3D HT5
It's a whea 18 and it say's processor core.


Log Name: System
Source: Microsoft-Windows-WHEA-Logger
Date: 02/07/2024 20:53:45
Event ID: 18
Task Category: None
Level: Error
Keywords:
User: LOCAL SERVICE
Computer: BlaezaLite
Description:
A fatal hardware error has occurred.

Reported by component: Processor Core
Error Source: Machine Check Exception
Error Type: Cache Hierarchy Error
Processor APIC ID: 11

I was playing Xdefiant when this crash happened.

How would I go about lowering the boost clocks to 4400? See if that prevents it?

Does it happen with XMP/DOCP disabled? CHE is usually caused by memory/IF if I am not mistaken.

Cache Hierarchy is not uncore. Cache Hierarchy is cores. I would look at RMA first, unless there's some custom CO settings or Vcore offset in play which it doesn't seem like.

You're thinking of WHEA 19, Bus/interconnect.
 
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
Cache Hierarchy is not uncore. Cache Hierarchy is cores. I would look at RMA first, unless there's some custom CO settings or Vcore offset in play which it doesn't seem like.

You're thinking of WHEA 19, Bus/interconnect.
Should I try the 1.1V soc setting still? It is totally stock, yes.
 
Joined
Sep 10, 2018
Messages
6,732 (3.04/day)
Location
California
System Name His & Hers
Processor R7 5800X/ R7 7950X3D Stock
Motherboard X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling Corsair h150 elite/ Corsair h115i Platinum
Memory Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s) Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage lots of SSD.
Display(s) A whole bunch OLED, VA, IPS.....
Case 011 Dynamic XL/ Phanteks Evolv X
Audio Device(s) Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply Seasonic Ultra Prime Titanium 1000w/850w
Mouse Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard Logitech - G915 LIGHTSPEED / Logitech G Pro
Cache Hierarchy is not uncore. Cache Hierarchy is cores. I would look at RMA first, unless there's some custom CO settings or Vcore offset in play which it doesn't seem like.

You're thinking of WHEA 19, Bus/interconnect.

Yeah, that's what I was thinking for sure been a while haven't seen a whea error in years and only when pushing IF. Glad you commented was going to tag you originally but didn't want to bother figured you'd see the post and comment if you wanted to.

Oddly I have read people swapping out corsair ram and fixing this issue though.

I would try to RMA the CPU as well though.
 
Last edited:
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
1.1V
1.1v.png
 

tabascosauz

Moderator
Supporter
Staff member
Joined
Jun 24, 2015
Messages
8,032 (2.37/day)
Location
Western Canada
System Name ab┃ob
Processor 7800X3D┃5800X3D
Motherboard B650E PG-ITX┃X570 Impact
Cooling NH-U12A + T30┃AXP120-x67
Memory 64GB 6400CL32┃32GB 3600CL14
Video Card(s) RTX 4070 Ti Eagle┃RTX A2000
Storage 8TB of SSDs┃1TB SN550
Case Caselabs S3┃Lazer3D HT5
Should I try the 1.1V soc setting still? It is totally stock, yes.

Not really, because that has nothing to do with cores and this nothing to do with Cache Hierarchy. You sure can, but 1.1V is high as hell in relation to 3200. You really shouldn't even need 1.0V at your settings.
 
Joined
Sep 10, 2018
Messages
6,732 (3.04/day)
Location
California
System Name His & Hers
Processor R7 5800X/ R7 7950X3D Stock
Motherboard X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling Corsair h150 elite/ Corsair h115i Platinum
Memory Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s) Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage lots of SSD.
Display(s) A whole bunch OLED, VA, IPS.....
Case 011 Dynamic XL/ Phanteks Evolv X
Audio Device(s) Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply Seasonic Ultra Prime Titanium 1000w/850w
Mouse Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard Logitech - G915 LIGHTSPEED / Logitech G Pro


If you were wanting to try another kit this is what I would go with..... I don't think it will fix your issue though just looking around the interwebs the issue can be almost anything related to the CPU

Some people have reported fixing the issue by RMA the CPU, swapping out the ram, and swapping the motherboard.

I would start with the CPU then the Ram and Mobo last personally.

I've done over 100 Ryzen builds and never actually encountered this issue only the interconnect variant when pushing IF.


Maybe pick one of these up just to verify that is the issue.


Return after verifying ofc.
 
Joined
Jul 30, 2019
Messages
3,098 (1.64/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR4-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
Yeah, that's what I was thinking for sure been a while haven't seen a whea error in years and only when pushing IF. Glad you commented was going to tag you originally but didn't want to bother figured you'd see the post and comment if you wanted to.

Oddly I have read people swapping out corsair ram and fixing this issue though.
I had a really strange case where my 5950x was throwing cache hierarchy errors too (among other errors) but swapping it out into another motherboard it was just fine and the new CPU I swapped in the old motherboard started having it too! So the motherboard might be the issue but I wiped and redid the OS install and now everything works fine! :banghead: I still have more testing to do before ruling out the motherboard as the issue in my case. These issues are no fun to deal with. Amazon flagged B550 Phantom Gaming-ITX/ax as a commonly returned item so maybe there is something up this this motherboard although it was working great for years without an issue.
I would try to RMA the CPU as well though.
Yea RMA.
 
Last edited:
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
I'll try that ram first, the G-Skill. Then I'll try my old r5 3600. Then I'll have to magic money out my ass for a mobo, lol.

RMAing will be difficult as I am agoraphobic.
 
Joined
Jul 30, 2019
Messages
3,098 (1.64/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR4-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
I'll try that ram first, the G-Skill. Then I'll try my old r5 3600. Then I'll have to magic money out my ass for a mobo, lol.

RMAing will be difficult as I am agoraphobic.
If you find the 3600 works perfectly then your motherboard is probably just fine. Before you spend any money you might just try the 3600 first.
 
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
If you find the 3600 works perfectly then your motherboard is probably just fine. Before you spend any money you might just try the 3600 first.
I'm getting the ram anyway. But next will be trying the 3600.
 

tabascosauz

Moderator
Supporter
Staff member
Joined
Jun 24, 2015
Messages
8,032 (2.37/day)
Location
Western Canada
System Name ab┃ob
Processor 7800X3D┃5800X3D
Motherboard B650E PG-ITX┃X570 Impact
Cooling NH-U12A + T30┃AXP120-x67
Memory 64GB 6400CL32┃32GB 3600CL14
Video Card(s) RTX 4070 Ti Eagle┃RTX A2000
Storage 8TB of SSDs┃1TB SN550
Case Caselabs S3┃Lazer3D HT5
I had a really strange case where my 5950x was throwing cache hierarchy errors too (among other errors) but swapping it out into another motherboard it was just fine and the new CPU I swapped in the old motherboard started having it too! So the motherboard might be the issue but I wiped and redid the OS install and now everything works fine! :banghead: I still have more testing to do before ruling out the motherboard as the issue in my case. These issues are no fun to deal with. Amazon flagged B550 Phantom Gaming-ITX/ax as a commonly returned item so maybe there is something up this this motherboard although it was working great for years without an issue.

Yea RMA.

Yes, it can be cores defect (RMA only, really) but equally possible the motherboard is pushing bad Vcore settings esp. at idle.

I'll try that ram first, the G-Skill. Then I'll try my old r5 3600. Then I'll have to magic money out my ass for a mobo, lol.

RMAing will be difficult as I am agoraphobic.

Unless that also prevents you from going to the post office, I don't see how that prevents RMA?

Maybe things have changed in the last 2 years. But their depot was in Miami and you can ship via any carrier you want.
 
Joined
Jan 1, 2012
Messages
272 (0.06/day)
These issues are no fun to deal with.
Understatement of the year.

What might be my biggest heartache with PCs in about fifteen to twenty years specifically came from troubleshooting this exact issue around half a year ago (WHEA error 18, "cache hierarchy" type). During all of my research, I found basically everything suggested as a possible resolution. Some had success with changing the CPU, some with RAM, some with the motherboard, some with the PSU, and some with the video card.

The last might seem like the least likely given the error type. I also presumed it was also a CPU or maybe platform side issue (RAM or maybe even motherboard/power related). But in my case, the error showed up when changing the video card. And it only happened when the GPU was under light-medium or higher loads. It would be be working fine, only to lose video signal, and then some seconds later it would restart (power was never off in the entire process so the PC never shut down but simply restarted). No BSODs at all. Just event viewer logs (ID 18) and Windows WHEA logs (generic 0x124 for WHEA, and paired with them, four different video related Watchdog logs).

I was lost as to whether the video errors were happening as a result of whatever the real issue was, or the other way around. The big clue was it started after changing the video card, but I wanted to rule out what may have been a possible hidden issue with my existing hardware that the video card change somehow brought about, before sending back a possibly good video card.

I tried just about everything under the sun from updating BIOS multiple times, changing no end of BIOS settings, changing driver versions, even reinstalling Windows. No change from any of it.

I tried swapping the CPU from a 5800X3D to my old 3700X to see if it was a CPU-side issue. No change.

I tried running with half the RAM (unfortunately, I didn't have different RAM to test). No change.

I tried disabling RAM profile speeds and running at default. It got worse somehow. Things that didn't cause the restart before (League of Legends for example), now did. Things that did cause the restart, now did so more often. This part in particular made me cry because this suggested a platform-side issue, but the issue didn't exist on my old video card so I gave up platform troubleshooting for the time being and finally gave in to doing an RMA on the new video card since that where it started. I figured if the issue remained with an RMA (or if Sapphire said they found no issue), then I'd continue efforts on the platform at least knowing the GPU "should be" fine. These "Black screen restarts" were apparently very common on the 7800 XT from feedback online anyway despite the card being a mere two months old at the time, so... maybe there was something to it being the cause of all this afterall?

After the RMA, it was "almost entirely" resolved. I've had one reproducible case that still causes it (a particular version of Minecraft, and a modded one at that) and it seems... weird a game would cause a machine check exception if the hardware wasn't faulty to begin with, but... everything else that was doing it stopped. I'm still fearful the issue may return one day, but so far it hasn't. To say I'm still clueless as to what went on is an understatement.

So my main question to the thread starter would be this; have you changed anything recently?

If not, do you have spare parts you can swap as a test? In my case, this helped me rule things out. I swapped everything but the PSU (no spare), RAM (no spare), and motherboard (I did have my old one as a spare, which as an RMA return itself due to the original having a separate issue, but it'd be time and effort intensive to test that one so I relegated it to after trying the video card RMA since that's where the issue showed up).
 
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
Yes, it can be cores defect (RMA only, really) but equally possible the motherboard is pushing bad Vcore settings esp. at idle.



Unless that also prevents you from going to the post office, I don't see how that prevents RMA?

Maybe things have changed in the last 2 years. But their depot was in Miami and you can ship via any carrier you want.
I can go across the road to a corner shop and I'm a complete wreck mentally and emotionally when I get home, maybe 20 meters away.
Understatement of the year.

What might be my biggest heartache with PCs in about fifteen to twenty years specifically came from troubleshooting this exact issue around half a year ago (WHEA error 18, "cache hierarchy" type). During all of my research, I found basically everything suggested as a possible resolution. Some had success with changing the CPU, some with RAM, some with the motherboard, some with the PSU, and some with the video card.

The last might seem like the least likely given the error type. I also presumed it was also a CPU or maybe platform side issue (RAM or maybe even motherboard/power related). But in my case, the error showed up when changing the video card. And it only happened when the GPU was under light-medium or higher loads. It would be be working fine, only to lose video signal, and then some seconds later it would restart (power was never off in the entire process so the PC never shut down but simply restarted). No BSODs at all. Just event viewer logs (ID 18) and Windows WHEA logs (generic 0x124 for WHEA, and paired with them, four different video related Watchdog logs).

I was lost as to whether the video errors were happening as a result of whatever the real issue was, or the other way around. The big clue was it started after changing the video card, but I wanted to rule out what may have been a possible hidden issue with my existing hardware that the video card change somehow brought about, before sending back a possibly good video card.

I tried just about everything under the sun from updating BIOS multiple times, changing no end of BIOS settings, changing driver versions, even reinstalling Windows. No change from any of it.

I tried swapping the CPU from a 5800X3D to my old 3700X to see if it was a CPU-side issue. No change.

I tried running with half the RAM (unfortunately, I didn't have different RAM to test). No change.

I tried disabling RAM profile speeds and running at default. It got worse somehow. Things that didn't cause the restart before (League of Legends for example), now did. Things that did cause the restart, now did so more often. This part in particular made me cry because this suggested a platform-side issue, but the issue didn't exist on my old video card so I gave up platform troubleshooting for the time being and finally gave in to doing an RMA on the new video card since that where it started. I figured if the issue remained with an RMA (or if Sapphire said they found no issue), then I'd continue efforts on the platform at least knowing the GPU "should be" fine. These "Black screen restarts" were apparently very common on the 7800 XT from feedback online anyway despite the card being a mere two months old at the time, so... maybe there was something to it being the cause of all this afterall?

After the RMA, it was "almost entirely" resolved. I've had one reproducible case that still causes it (a particular version of Minecraft, and a modded one at that) and it seems... weird a game would cause a machine check exception if the hardware wasn't faulty to begin with, but... everything else that was doing it stopped. I'm still fearful the issue may return one day, but so far it hasn't. To say I'm still clueless as to what went on is an understatement.

So my main question to the thread starter would be this; have you changed anything recently?

If not, do you have spare parts you can swap as a test? In my case, this helped me rule things out. I swapped everything but the PSU (no spare), RAM (no spare), and motherboard (I did have my old one as a spare, which as an RMA return itself due to the original having a separate issue, but it'd be time and effort intensive to test that one so I relegated it to after trying the video card RMA since that's where the issue showed up).
I've changed nothing apart from a nvme drive, which I added about a month ago.
 
Joined
Jul 30, 2019
Messages
3,098 (1.64/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR4-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
Yes, it can be cores defect (RMA only, really) but equally possible the motherboard is pushing bad Vcore settings esp. at idle.
If the motherboard was pushing bad Vcore would increasing CPU LLC possibly take care of that issue?
 
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
Getting MSI B550 Tomahawk and 5700X3D end of the month. Problem will be fixed.
 
Joined
Sep 10, 2018
Messages
6,732 (3.04/day)
Location
California
System Name His & Hers
Processor R7 5800X/ R7 7950X3D Stock
Motherboard X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling Corsair h150 elite/ Corsair h115i Platinum
Memory Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s) Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage lots of SSD.
Display(s) A whole bunch OLED, VA, IPS.....
Case 011 Dynamic XL/ Phanteks Evolv X
Audio Device(s) Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply Seasonic Ultra Prime Titanium 1000w/850w
Mouse Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard Logitech - G915 LIGHTSPEED / Logitech G Pro
If you can do a B650/7600 combo it would give your platform way more legs. Unless the AM4 system is way cheaper it isn't worth investing in.

Screenshot (1).png
 
Last edited:
Joined
Jan 1, 2012
Messages
272 (0.06/day)
I've changed nothing apart from a nvme drive, which I added about a month ago.
If nothing (major) changed, then I would personally start troubleshooting attempts in the area of the platform parts. So, the CPU, motherboard, and RAM namely. And it might not be a bad part, but a part that is merely faulting as a lack of power somewhere along the frequency/voltage curve?

Odd question, but the title says this is at stock; what happens if you apply the RAM profile? I ask because when I was having the issue with my first graphics card, I found it was less stable at stock (but still unstable at RAM profile speeds). If stock and profile speeds behave differently, it might give you a clue as to where to focus your troubleshooting efforts?

Also, look at the "Processor APIC ID" number in all of your event logs. Is this number the same, or is it always different? What the number indicates is "this is the CPU core where the machine check exception was caught". That doesn't mean the issue originates from the CPU though. If the number is sometimes different, it won't help much, but if it's the same, it might strongly indicate a particular core (or thread) is unstable. That could be because it's actually going bad, or just lacking voltage maybe.
 
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
If you can do a B650/7600 combo it would give your platform way more legs. Unless the AM4 system is way cheaper it isn't worth investing
My limited budget has decided for me.
If nothing (major) changed, then I would personally start troubleshooting attempts in the area of the platform parts. So, the CPU, motherboard, and RAM namely. And it might not be a bad part, but a part that is merely faulting as a lack of power somewhere along the frequency/voltage curve?

Odd question, but the title says this is at stock; what happens if you apply the RAM profile? I ask because when I was having the issue with my first graphics card, I found it was less stable at stock (but still unstable at RAM profile speeds). If stock and profile speeds behave differently, it might give you a clue as to where to focus your troubleshooting efforts?

Also, look at the "Processor APIC ID" number in all of your event logs. Is this number the same, or is it always different? What the number indicates is "this is the CPU core where the machine check exception was caught". That doesn't mean the issue originates from the CPU though. If the number is sometimes different, it won't help much, but if it's the same, it might strongly indicate a particular core (or thread) is unstable. That could be because it's actually going bad, or just lacking voltage maybe.
The apics are 0, 3, 8 and 11. I can use xmp all day and be fine playing games. Then once it does this I panic and come here for help.
 
Joined
Sep 10, 2018
Messages
6,732 (3.04/day)
Location
California
System Name His & Hers
Processor R7 5800X/ R7 7950X3D Stock
Motherboard X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling Corsair h150 elite/ Corsair h115i Platinum
Memory Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s) Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage lots of SSD.
Display(s) A whole bunch OLED, VA, IPS.....
Case 011 Dynamic XL/ Phanteks Evolv X
Audio Device(s) Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply Seasonic Ultra Prime Titanium 1000w/850w
Mouse Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard Logitech - G915 LIGHTSPEED / Logitech G Pro
Joined
May 2, 2022
Messages
1,593 (1.80/day)
Location
G-City, UK
System Name AMDWeapon
Processor Ryzen 7 7800X3D
Motherboard X670E MSI Tomahawk WiFi
Cooling Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s) Powercolor 7900 GRE Red Devil
Storage Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s) 32" LG UltraGear GN600-B
Case Montech 903 MAX AIR
Audio Device(s) Corsair void wireless/Sennheiser EPOS 670
Power Supply MSI MPG AGF 850 watt gold
Mouse Glorious Model D l Pad GameSir G7 SE
Keyboard Redragon Vara K551P
Software Windows 11 Pro 24H2
Benchmark Scores Fast Enough.
Only mentioned it because out here they are about the same price +/- 10%
It's not much more £100 or so but I don't have that. Thanks though!
 
Top