• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Processor ECC Memory Support: Why So Hinky?

Joined
Jul 30, 2019
Messages
3,398 (1.68/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR5-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
According to online sources 5 = Single-bit ECC, 6 = Multi-bit ECC

If you motherboard UEFI/BIOS supports error injection MemTest86 pro can be used also it seems they have a piece of DDR4 hardware that can help with that as well. (https://www.memtest86.com/ecc.htm)
 

adata

New Member
Joined
Feb 10, 2025
Messages
1 (1.00/day)
Perhaps others will find this informative:
On a platform with ECC implemented (CPU + Chipset + Motherboard + RAM) this is what can be observed in the Windows Hardware Error Architecture log in the event viewer:
2025-02-09.jpg


There are many kinds of errors that may appear in the WHEA-Logger, but this screenshot is all for correctable memory errors. Ideally, they would not be so frequent. There are numerous possibilities that could be the cause, but the ECC on this particular platform is doing its job preventing it from crashing.

Window Server 2022 Standard (16-core)
ASUS TUF GAMING B550-PRO
AMD Ryzen 5 5600
(4x) NEMIX 16GB DDR4-3200 ECC UDIMM

The most likely reason for this system having such frequent ECC events is that it has all 4 memory slots populated, giving the memory controller the most amount of work to keep them all in sync. Also I have not adjusted anything in BIOS. Everything is on Auto.

On this particular platform, PassMark MemTest86 does not detect any errors during a full 4-pass screening. If it did, I would make the effort to manually reduce the speed, increase the voltage, adjust the CAS timings, etc. I could also switch to 32GB modules, which would probably reduce the frequency of ECC events because it would be less effort for the memory controller to keep only 2 modules in sync instead of 4.

I'm currently fine with it the way it is since it never crashes. I only have to reboot once a month on Patch Tuesday.
 
Joined
Jul 30, 2019
Messages
3,398 (1.68/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR5-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
Perhaps others will find this informative:
On a platform with ECC implemented (CPU + Chipset + Motherboard + RAM) this is what can be observed in the Windows Hardware Error Architecture log in the event viewer:
View attachment 384141

There are many kinds of errors that may appear in the WHEA-Logger, but this screenshot is all for correctable memory errors. Ideally, they would not be so frequent. There are numerous possibilities that could be the cause, but the ECC on this particular platform is doing its job preventing it from crashing.

Window Server 2022 Standard (16-core)
ASUS TUF GAMING B550-PRO
AMD Ryzen 5 5600
(4x) NEMIX 16GB DDR4-3200 ECC UDIMM

The most likely reason for this system having such frequent ECC events is that it has all 4 memory slots populated, giving the memory controller the most amount of work to keep them all in sync. Also I have not adjusted anything in BIOS. Everything is on Auto.

On this particular platform, PassMark MemTest86 does not detect any errors during a full 4-pass screening. If it did, I would make the effort to manually reduce the speed, increase the voltage, adjust the CAS timings, etc. I could also switch to 32GB modules, which would probably reduce the frequency of ECC events because it would be less effort for the memory controller to keep only 2 modules in sync instead of 4.

I'm currently fine with it the way it is since it never crashes. I only have to reboot once a month on Patch Tuesday.

I could never get 128GB Nemix to work 100% properly regardless of scaling voltages and speeds and ultimately ended up switching to Crucial/Micron instead which worked perfectly in both of my 5950x systems. (see post below, in particular Micron DDR4-3200 ECC UDIMM 18ASF4G72AZ-3G2F1)

You shouldn't settle for it throwing that many errors because either a dimm is going bad or it's on the edge of stability for some reason.
 
Top