DDR5 Thermal Testing & Analysis

ir_cow · Jan 2, 2025

Join us to explore the factors that drive DRAM temperatures and their role in system stability after tuning. We go up to DDR5-8000 and uncover key insights to help sidestep common pitfalls and achieve maximum system performance along the way.

Show full review

Baba · Jan 2, 2025

We go up to DDR5-800

*8000

Carlyle2020hs · Jan 2, 2025

So if comparison is the name of the game i´d like to see aftermarket heat spreaders tested to find the best / even tempered one.

Same goes for aftermarket ram-fans and ram watercooling solutions. With a sprinkle of simple/cheap DIY solutions if you don´t mind (zip ties and vhb tape ftw).

That prepares to test the 10 most bought sticks 3x ways to see how much they are beeing held back by their OEM cooling solution (hotspots anyone?).

For that you´d have to come up with an even more robust universal OC testing strategy / procedure. That includes finding out how to make meaningfull IR pictures.

I advise to cooperate with somebody since it would not hurt and it certainly would make the series even more interesting and the procedure, as well as the results, more well-grounded.

Thanks for this interesting article!

arestavo · Jan 2, 2025

Very cool to see testing like this! I've been dabbling a little with my own DDR5 kit, but resetting the CMOS is a bit of a chore with my X670E Steel Legend (nature of the beast by going best bang for the buck for an X670E), so I keep around Buildzoid's AMD 7000 Series Hynix A (and M) Die "every CPU / kit can do this" settings. It did help quite a bit as it's a 96GB 6400 C32 G.Skill kit, and it's great to see how temperatures can have an effect should I decide to push further.

_roman_ · Jan 2, 2025

I wonder if Igor considered the reflection factor for optical thermal measurements.

I trust those thermal couples more as some thermal camera. Proven stuff. When the measurement electronics is a decent one, it should be very accurate to use thermal couples.

-- Very interesting article

ir_cow · Jan 2, 2025

_roman_ said:
I wonder if Igor considered the reflection factor for optical thermal measurements.

That was my thinking too, but my IR FLIR doesn't work anymore. I found out that once the battery died in those phone versions, it cannot be replaced or even powered through the phone.

So without checking myself, I left it out of the article.

Drash · Jan 3, 2025

ir_cow said:
That was my thinking too, but my IR FLIR doesn't work anymore. I found out that once the battery died in those phone versions, it cannot be replaced or even powered through the phone.

So without checking myself, I left it out of the article.

Not FLIR, but modern IR thermometers have a reflectivity adjustment. Project Farm on YT tested a bunch and it didn't make a difference worth a damn.

Braegnok · Jan 3, 2025

BartX Heatshields, BX2 block running chilled water.

Metroid · Jan 3, 2025

anything over 1.4v adds a lot of stress, 1.35v seems the sweet spot.

LateDevonian · Jan 3, 2025

ir_cow said:
Your feedback and suggestions will help guide the next steps in this ongoing investigation of DRAM temperature and the variables that go into optimizing system performance.

This is a great first revisit of an often neglected topic. While I'd +1 for thermal imaging, the important core coverage of temperature distribution across the DIMM and heatspreader, clock, voltage, and app effects is all here.

The main future suggestion I'd make is an open bench, custom loop test with a single 1R DIMM departs from common use. More on application, more useful to most readers data seems likely to come from testing in a typical ATX case (Lancool 207, 216, II, North, Torrent Compact, something like that) with denser DIMM configs such as 2x48 and 4x48 GB. Other variables I see regularly come up for DDR thermals are

1. CPU cooler: AIO with fanless block, AIO with a block fan, setback dual tower (Fuma 3, Royal Knight 120) or single tower with DIMMs in front of the fan, and dual tower with DIMMs under the front fan.
2. Crossflow: top intake fan cooling (if it's not a top exhaust AIO config) and potential for GPU passthrough heating.
3. Lighting: RGB on temperatures versus off versus the non-RGB version of the same DIMMs.

I like the CL and tREFI testing but it seems unclear from the current text what active cooler was used and what tRFC was set to. A related difficulty's stress tools (including also y-cruncher FFTv4 and Prime95 long) lack a benchmark component. So a common miss is all this work we do for stability and thermals rarely gets tied back to the question of whether it's actually worth it functionally as opposed to just for highmarking numbers. IMO y-cruncher timings or other memory intensive benches would be good data towards articulating a value proposition for CL24, 65+k tREFI, and such.

I'm not set up to probe low CL but FWIW it's been my experience extending DDR5's default 3.9 μs tREFI has little effect on real world compute throughput once tRFC's tightened. I'm looking mainly at runtime shifts in working apps that max out dual channel DDR for like eight hours solid. But y-cruncher picks up on this too.

ir_cow said:
With a limited sample size, it is unclear whether this behavior is exclusive to SK Hynix Rev A-Die, a flawed testing methodology or is an expected outcome.

This is interesting. I've pushed M-die tRFC to values low enough I've backed it off after black screens and OCCT errors but not to a clear breaking point and not in single variable testing where instability could be unambiguously attributed to tRFC. I need the rig up for probably somewhere in the range of 42-56 hours of compute this weekend but will try leaning on tRFC more if some slack time opens up.

Enterprise24 · Jan 3, 2025

My previous trident z neo dual ranks b-die needs 1.61v for 4600 mt/s but it'd require sub 33c (room temp 25c) to be fully stable which is not possible with watercooling that shared a heat with 10900kf and 3080 ti. So I decided to create a separate loop just for ram alone with a small ddc pump+tank combo and a single slim 120mm radiator. I think ddr5 is still sensitive to heat especially with max trefi and low trfc so I'll continue with this method for the upcoming 9800x3d build.

ir_cow · Jan 3, 2025

LateDevonian said:
The main future suggestion I'd make is an open bench, custom loop test with a single 1R DIMM departs from common use.

Your saying move away from a single DIMM setup? This was done with a single DIMM to keep the variables limited.

LateDevonian said:
More on application, more useful to most readers data seems likely to come from testing in a typical ATX case (Lancool 207, 216, II, North, Torrent Compact, something like that) with denser DIMM configs such as 2x48 and 4x48 GB. Other variables I see regularly come up for DDR thermals are

1. CPU cooler: AIO with fanless block, AIO with a block fan, setback dual tower (Fuma 3, Royal Knight 120) or single tower with DIMMs in front of the fan, and dual tower with DIMMs under the front fan.
2. Crossflow: top intake fan cooling (if it's not a top exhaust AIO config) and potential for GPU passthrough heating.
3. Lighting: RGB on temperatures versus off versus the non-RGB version of the same DIMMs.

Good ideas. #3 is the easiest. Case airflow is a complicated one though. Like I've gotten my memory to error out just with a Nvidia FE card before because it blows directly onto the memory.

LateDevonian said:
I like the CL and tREFI testing but it seems unclear from the current text what active cooler was used and what tRFC was set to. A related difficulty's stress tools (including also y-cruncher FFTv4 and Prime95 long) lack a benchmark component. So a common miss is all this work we do for stability and thermals rarely gets tied back to the question of whether it's actually worth it functionally as opposed to just for highmarking numbers. IMO y-cruncher timings or other memory intensive benches would be good data towards articulating a value proposition for CL24, 65+k tREFI, and such.

Active cooling is just a fan - will update to mention that. I also don't see the point in using y-cruncher or prime95 over a strictly memory stress test. It yet another factor introduced by putting the CPU into the mix. It can also be offset by just lowering the CPU frequency, negating the "stress" if would add.

The tests in the article were designed / setup to explore the characteristics of the memory itself, not the platform it is used with. Partially why a lower frequency was primary used. Not pushing the limits of the IMC so if errors came out, it was a likely memory related problem. Still lots of things that can be explored like all the other secondaries. That is know changes based on the CPU and motherboard.

LateDevonian said:
I'm not set up to probe low CL but FWIW it's been my experience extending DDR5's default 3.9 μs tREFI has little effect on real world compute throughput once tRFC's tightened. I'm looking mainly at runtime shifts in working apps that max out dual channel DDR for like eight hours solid. But y-cruncher picks up on this too.

This is interesting. I've pushed M-die tRFC to values low enough I've backed it off after black screens and OCCT errors but not to a clear breaking point and not in single variable testing where instability could be unambiguously attributed to tRFC. I need the rig up for probably somewhere in the range of 42-56 hours of compute this weekend but will try leaning on tRFC more if some slack time opens up.

I was at tRFC2 376 tRFCSB 270 for DDR5-5600. Could not trigger a error even at 1.6v. didn't seem to matter if it was 1.25v or higher, that was the lowest it would go to boot. Changing it in windows below this would instantly BSOD or freeze outright.

Still haven't fully explored other factors. But knowing lowest tRFC is tied to frequency, it can still be played with. Higher CAS needs less voltage and to extent frequency x CAS are linked together.

So inclusive. All I found out is at 376-270, that is the lowest it could be stable at for 5600 regardless of the voltage and corresponding CAS linked to the voltage. For two different DIMMs using this specific SK Hynix A die. Larger sample is needed to narrow down if this is abnormal.

UPDATE:
I had some nice in person feedback from a Data Analyst. He pointed out the names of my graphs are incorrectly labeled because its not titled based on X&Y. This does not affect the data and can still be read as is.

Secondly it was assumed that when the graph flatline at the end, this was understood that is was showing equilibrium, ie it will not rise further in temperature due to the thermal dissipation from heat spreader out performing the thermal output of the memory.

Starting temperature is not the reason why one frequency or voltage ends up above or below another. To prove this I will need to make another chart where the temperature starts out at 60+ by using a hair dryer and plot the decline to the same equilibrium as previously shown.

Both will be done after I return from vacation.

progste · Jan 3, 2025

Baba said:
We go up to DDR5-800

*8000

DDR 2.5

JustBenching · Jan 4, 2025

Metroid said:
anything over 1.4v adds a lot of stress, 1.35v seems the sweet spot.

It's not voltage that contributes that much to dim temperatures, it's trefi and trfc. Basically these 2 affect how frequent (TREFI) and how long (TRFC) the timeouts are. By increasing trefi and lowering trfc you basically don't give the dims much time to cool down. Voltage has much less of an impact unless you start pushing something crazy like 1.6+ volts.

ir_cow · Jan 4, 2025

JustBenching said:
It's not voltage that contributes that much to dim temperatures, it's trefi and trfc.

Hmm this would be good to test to explore more. Though from the limited testing 65k, 132k and 256k tREFI all has similar temperature in my tests at 1.5v vs default 7k.

Same goes for when I tried 1.25v, that those 4 were hitting the same temperatures for 1.25v. this would point to that at least at 5600, what your saying about tREFI is not true and voltage is the driver of temp in this example.

What I'm seeing is higher vs low tREFi will error out once it passes a threshold. For example I could run sustain 256k at 1.25v, but not at any higher voltage - because the voltage is lower, this lower temp.

Cannot comment on tRFC for another week as I'm not home to check the raw data I didn't make graphs with.

Metroid · Jan 4, 2025

ir_cow said:
Hmm this would be good to test to explore more. Though from the limited testing 65k, 132k and 256k tREFI all has similar temperature in my tests at 1.5v vs default 7k.

Same goes for when I tried 1.25v, that those 4 were hitting the same temperatures for 1.25v. this would point to that at least at 5600, what your saying about tREFI is not true and voltage is the driver of temp in this example.

What I'm seeing is higher vs low tREFi will error out once it passes a threshold. For example I could run sustain 256k at 1.25v, but not at any higher voltage.

Cannot comment on tRFC for another week as I'm not home to check the raw data I didn't make graphs with.

I would like to know more about it.

mechtech · Jan 4, 2025

"In mid 2024 JEDEC finalized the DDR5-8800 standard"

Does JEDEC still use the 1.100V all the way to this frequency for the standard?

ir_cow · Jan 4, 2025

mechtech said:
"In mid 2024 JEDEC finalized the DDR5-8800 standard"

Does JEDEC still use the 1.100V all the way to this frequency for the standard?

I believe so, but not certain. 8800 CL62 is quite high otherwise

JEDEC Updates DDR5 Specification for Increased Security Against Rowhammer Attacks, New DDR5-8800 Reference Speed

JEDEC Solid State Technology Association, the global leader in standards development for the microelectronics industry, today announced publication of the JESD79-5C DDR5 SDRAM standard. This important update to the JEDEC DDR5 SDRAM standard includes features designed to improve reliability and...

www.techpowerup.com

JustBenching · Jan 4, 2025

ir_cow said:
Hmm this would be good to test to explore more. Though from the limited testing 65k, 132k and 256k tREFI all has similar temperature in my tests at 1.5v vs default 7k.

Same goes for when I tried 1.25v, that those 4 were hitting the same temperatures for 1.25v. this would point to that at least at 5600, what your saying about tREFI is not true and voltage is the driver of temp in this example.

What I'm seeing is higher vs low tREFi will error out once it passes a threshold. For example I could run sustain 256k at 1.25v, but not at any higher voltage - because the voltage is lower, this lower temp.

Cannot comment on tRFC for another week as I'm not home to check the raw data I didn't make graphs with.

That's because you went into diminishing returns territory. Try default TRFC (900 or whatever it is) and then do 10k vs 65k trefi. There Then add in a tightened TRFC with 65k trefi. There should be a huge increase in temperature. Preferably do all that without active cooling.

ir_cow · Jan 4, 2025

JustBenching said:
That's because you went into diminishing returns territory. Try default TRFC (900 or whatever it is) and then do 10k vs 65k trefi. There Then add in a tightened TRFC with 65k trefi. There should be a huge increase in temperature. Preferably do all that without active cooling.

It will be a good follow-up for sure. Though I'm willing to bet the results will be disappointing for one of us since we are at odd here.

Ruru · Jan 4, 2025

1.5 volts? Damm, feels high for even DDR4.

Wirko · Jan 4, 2025

_roman_ said:
I wonder if Igor considered the reflection factor for optical thermal measurements.

I trust those thermal couples more as some thermal camera. Proven stuff. When the measurement electronics is a decent one, it should be very accurate to use thermal couples.

-- Very interesting article

If you measure the black matte plastic surface of a chip package and choose an emissivity factor of 0.90, while ignoring reflections, how much wrong can you be? I don't know how to calculate an estimate but ~20°C error at just ~20°C above ambient seems huge here. The shiny PCB surface is more tricky and the heatsink even more so, but you can't use a formula to account for reflections; you must minimise them.

This article by FLIR says that most flat-finish paints have an emissivity around 0.90. Also, for higher emissivity objects, reflected temperature has less influence. For highly reflective surfaces (heatsink and probably PCB too) it advises to place a piece of black tape over the surface, then measure temperature at that point.

@ir_cow How did you attach the thermocouples to the surface? Did you use a goop of TIM or temporary glue? If you're using only tape to make the sensor touch the surface, I'd say it's insufficient.

mechtech said:
"In mid 2024 JEDEC finalized the DDR5-8800 standard"

Does JEDEC still use the 1.100V all the way to this frequency for the standard?

I'd also say yes, because JEDEC is meant for serious and conservative stuff (read: servers). 8800 MT/s at 1.1 V will probably become possible with MRDIMM where the memory chips will operate at 4400 MT/s and only the multiplexer will work at full data rate.

LateDevonian · Jan 5, 2025

ir_cow said:
You're saying move away from a single DIMM setup?

Depends what you want to cover. If the data I have is anything to go by, a single 1R DIMM'll be the coolest and thus easiest to highmark with. But if perf on memory-liking apps is important, dual DIMM's needed to utilize both DDR channels. And, if 2x48's not enough, then 4x32 or 4x48's necessary. I see the highest temperatures and least airflow response in 2DPC 2R, which is unsurprising as it's the densest config.

For any app, including memory stress, the CPU's in the mix. I understand wanting to minimize its effects but I'm not sure that's helpful to understanding thermal requirements for a build. I don't have an Arrow Lake to test on as yet but Intel's hitting ~120 GB/s of DDR bandwidth for perf broadly comparable to what Zen 5 does at ~70 GB/s. It seems plausible either departure from the 13900K's ~100 GB/s influences DIMM temperatures.

ir_cow said:
It will be a good follow-up for sure.

I'd suggest including auto refresh settings as a control on the tightened values as that's what folks doing EXPO/XMP or just putting up clocks and voltage are going to be running. For example, I tightened the 2x48GB M-die I'm working with from

3.90 μs tREFI, tRFC-tRFC2-tRFCsb 1145-615-531 to
5.85 μs, 480-288-244

The tRFC changes bench several percent higher. The longer tREFI barely increases bandwidth, hardly reduces latency, almost negligibly improves benchmarks, and lowers active power by 2%. It does reduce idle power by ~13% with the tightened tRFC-tRFC2-tRFCsb. I can't measure any difference putting tREFI above 5.85 μs, so there doesn't appear to be functional value in doing the cooling for 20+ μs.

Also, if I tighten tRFC-tRFC2-tRFCsb to 448-244-200 I can boot M-die to an instant BSOD in OCCT.

Cowboystrekk · Jan 5, 2025

Interesting! I like that you tested tREFI and temp!

freeagent · Jan 5, 2025

Ruru said:
1.5 volts? Damm, feels high for even DDR4

Nah 1.6 is ok 24/7 on DDR4 (B-Die)

System Name	My PC
Processor	AMD 9800X3D
Motherboard	MSI MPG B850 Edge TI Wifi
Cooling	Deepcool AK620 White, 4 x 140 PWM case fans
Memory	2 x 16GB Corsair Vengeance 6000MHz C28 EXPO DDR5
Video Card(s)	MSI RX 6900 XT Gaming X Trio
Storage	WD SN7100 2TB, MX500 2TB x 2, 3TB WD Blue
Display(s)	27" curved 165Hz VA 1080p (Gigabyte)
Case	Montech Air 903 Max (white)
Audio Device(s)	Creative X4, Onkyo AVR + Monitor Audio MASS 5.1, GigaByte Aorus G5 headphones, AKG K550 headphones
Power Supply	NZXT C850 ATX3.1
Mouse	Deathadder 2
Keyboard	Xtrfy K4
Software	W11 Pro
Benchmark Scores	TBD

Processor	AMD Ryzen 9 9950X
Motherboard	Asus ROG Crosshair X670E Gene
Cooling	Full Custom Water
Memory	G.SKILL F5-8000J3848F24GX2-TZ5K
Video Card(s)	XFX Mercury 9070 XT OC
Storage	Crucial T700 2TB Gen5 SSD
Display(s)	ASUS ROG Swift OLED PG27UCDM
Case	BC1-V2 Titanium Edition
Audio Device(s)	SteelSeries Arctis GameBuds
Power Supply	SeaSonic Prime SSR-1300TR2
Mouse	Viper V3 Pro
Keyboard	Keychron Q1 Max, Drop + Matt3o MT3 Susuwatari, Gateron Milky Yellow Pro.
Software	Windows 11 Pro 24H2
Benchmark Scores	http://www.3dmark.com/pcm10b/2184577

System Name	Can I run it
Processor	AMD Ryzen 9 7950X3D @ 2200Mhz FCLK (The rest is still tuning)
Motherboard	Gigabyte B650E Aorus Master
Cooling	Thermaltake TH420 V2 White
Memory	KLEVV CRAS V RGB DDR5 48GB (2x24GB)7200 MT/s 34-44-44-84 @ 8000 MT/s 36-49-46-76 1.52V VDD/1.4V VDDQ
Video Card(s)	ASUS Strix RTX 4090 LC OC with two more T30 @ +100mv +150Mhz core +1963Mhz mem (~3045Mhz core)
Storage	990 Pro 4TB (Game) Transcend 220S 1TB (Win) WD 250GB (Linux) Galax 120GB (OC test) Seagate HDD 4TB
Display(s)	Samsung Odyssey OLED G9 49" 5120x1440 240Hz calibrated by X-Rite i1 Display Pro Plus
Case	Coolermaster HAF 700 White with 9x Phanteks T30
Audio Device(s)	Q Acoustics M20 HD speakers with Q Acoustics QB12 subwoofer
Power Supply	Thermaltake PF3 1200W 80+ Platinum
Mouse	Logitech G Pro Wireless
Keyboard	Logitech G913 (GL Linear)
VR HMD	Logitech G923 with Logitech Driving Force Shifter
Software	Windows 11, Ubuntu 24.10

System Name	Mean machine
Processor	AMD 6900HS
Memory	2x16 GB 4800C40
Video Card(s)	AMD Radeon 6700S

Processor	Ryzen 5700x
Motherboard	Gigabyte X570S Aero G R1.1 Bios F7g
Cooling	Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory	Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s)	AMD RX 6800 - Asus Tuf
Storage	Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s)	LG 27UL550-W (27" 4k)
Case	Be Quiet Pure Base 600 (no window)
Audio Device(s)	Realtek ALC1220-VB
Power Supply	SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse	Mionix Naos Pro
Keyboard	Corsair Strafe with browns
Software	W10 22H2 Pro x64

DDR5 Thermal Testing & Analysis

ir_cow

Baba

Carlyle2020hs

arestavo

_roman_

ir_cow

Drash

Braegnok

Metroid

LateDevonian

New Member

Enterprise24

ir_cow

progste

JustBenching

ir_cow

Metroid

mechtech

ir_cow

JEDEC Updates DDR5 Specification for Increased Security Against Rowhammer Attacks, New DDR5-8800 Reference Speed

JustBenching

ir_cow

Ruru

S.T.A.R.S.

Wirko

LateDevonian

New Member

Cowboystrekk

freeagent

Moderator

System Name	4K-gaming / console
Processor	5800X @ PBO +200 / i5-8600K @ 4.6GHz
Motherboard	ROG Crosshair VII Hero / ROG Strix Z370-F
Cooling	Custom loop CPU+GPU / Custom loop CPU
Memory	32GB DDR4-3466 / 16GB DDR4-3600
Video Card(s)	Asus RTX 3080 TUF / Powercolor RX 6700 XT
Storage	3TB SSDs + 3TB / 372GB SSDs + 750GB
Display(s)	4K120 IPS + 4K60 IPS / 1080p projector @ 90"
Case	Corsair 4000D AF White / DeepCool CC560 WH
Audio Device(s)	Sony WH-CH720N / Hecate G1500
Power Supply	EVGA G2 750W / Seasonic FX-750
Mouse	MX518 remake / Ajazz i303 Pro
Keyboard	Roccat Vulcan 121 AIMO / Obinslab Anne 2 Pro
VR HMD	Oculus Rift CV1
Software	Windows 11 Pro / Windows 11 Pro
Benchmark Scores	They run Crysis

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

Processor	7800X3D 2x16GB CO
Motherboard	Asrock B650m HDV
Cooling	Peerless Assassin SE
Memory	2x16GB DR A-die@6000c30 tuned
Video Card(s)	Asus 4070 dual OC 2610@915mv
Storage	WD blue 1TB nvme
Display(s)	Lenovo G24-10 144Hz
Case	Corsair D4000 Airflow
Power Supply	EVGA GQ 650W
Software	Windows 10 home 64
Benchmark Scores	Superposition 8k 5267 Aida64 58.5ns

System Name	Step_Sis Rodeo
Processor	AMD R9 9900X @ PBO
Motherboard	Asus Strix X670E -F
Cooling	Thermalright FW PRO 360, 3x TL-H12-X28-S, 3x TL-P12-S
Memory	2x 16GB Lexar Ares @ 6400 30-36-36-68 1.55v
Video Card(s)	Zotac 4070 Ti Trinity OC @ 3045/1500
Storage	WD SN850 1TB, SN850X 2TB, 3x SN770 1TB
Display(s)	LG 50UP7100
Case	Asus ProArt PA602
Audio Device(s)	JBL Bar 700
Power Supply	Seasonic Vertex GX-1000, Monster HDP1800
Mouse	Logitech G502 Hero
Keyboard	Logitech G213
VR HMD	Oculus 3
Software	Yes
Benchmark Scores	Yes