• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Dual-CCD Ryzen 5 5600X and Ryzen 7 5800X In the Wild

Status
Not open for further replies.
Joined
Jan 18, 2020
Messages
750 (0.45/day)
Interesting find by Igor, there are other weirdies on some of the older Ryzens as well.

E.g my generation one Ryzen 5 1600 with dual channel 2666mhz ram benches at 50GB/s memory transfer rate ~ in AIDA and shows higher than expected bench in others too. Only when I overclock it on the fly using the original version of Ryzen Master. That should be impossible, theoretical max bandwidth with only 2 channels of this ram is 21.3 x 2=42.6GB.

I had wondered if there was either a bug in the benches or if that version of Ryzen Master accidentally has some code AMD should have taken out of it...
 
Joined
Jan 3, 2021
Messages
3,106 (2.34/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
IIRC it has been said on-die CCX-to-CCX communication basically goes through IOD.
Given that the latencies are around 80 ns, it doesn't go just through IOD, it goes through DRAM. Well, not exactly "through" but at least it looks like the data needs to be written to main memory before it can be transferred from one CCX to another (Ryzen 2) or from one CCD to another (Ryzen 3).
 
Joined
May 2, 2017
Messages
7,762 (2.91/day)
Location
Back in Norway
System Name Hotbox
Processor AMD Ryzen 7 5800X, 110/95/110, PBO +150Mhz, CO -7,-7,-20(x6),
Motherboard ASRock Phantom Gaming B550 ITX/ax
Cooling LOBO + Laing DDC 1T Plus PWM + Corsair XR5 280mm + 2x Arctic P14
Memory 32GB G.Skill FlareX 3200c14 @3800c15
Video Card(s) PowerColor Radeon 6900XT Liquid Devil Ultimate, UC@2250MHz max @~200W
Storage 2TB Adata SX8200 Pro
Display(s) Dell U2711 main, AOC 24P2C secondary
Case SSUPD Meshlicious
Audio Device(s) Optoma Nuforce μDAC 3
Power Supply Corsair SF750 Platinum
Mouse Logitech G603
Keyboard Keychron K3/Cooler Master MasterKeys Pro M w/DSA profile caps
Software Windows 10 Pro
Given that the latencies are around 80 ns, it doesn't go just through IOD, it goes through DRAM. Well, not exactly "through" but at least it looks like the data needs to be written to main memory before it can be transferred from one CCX to another (Ryzen 2) or from one CCD to another (Ryzen 3).
I very sincerely doubt that - that would be an extreme bottleneck, and would likely cause some serious power draw issues too. Besides, wouldn't that cause a lot more latency than plain DRAM latency, seeing how it would essentially be at least two operations (one write, one read)?
 
Joined
Jan 3, 2021
Messages
3,106 (2.34/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
I very sincerely doubt that - that would be an extreme bottleneck, and would likely cause some serious power draw issues too. Besides, wouldn't that cause a lot more latency than plain DRAM latency, seeing how it would essentially be at least two operations (one write, one read)?
Sure it is an extreme bottleneck, one that core-to-core latency tests are designed to create and measure. No, I don't think data is read from DRAM immediately after being written to it. But I do think that when two sections of L3 cache on different CCDs need to be synchronised, this can't be done without also writing the changed data to DRAM. This results in an 80 ns latency.
 
Joined
May 2, 2017
Messages
7,762 (2.91/day)
Location
Back in Norway
System Name Hotbox
Processor AMD Ryzen 7 5800X, 110/95/110, PBO +150Mhz, CO -7,-7,-20(x6),
Motherboard ASRock Phantom Gaming B550 ITX/ax
Cooling LOBO + Laing DDC 1T Plus PWM + Corsair XR5 280mm + 2x Arctic P14
Memory 32GB G.Skill FlareX 3200c14 @3800c15
Video Card(s) PowerColor Radeon 6900XT Liquid Devil Ultimate, UC@2250MHz max @~200W
Storage 2TB Adata SX8200 Pro
Display(s) Dell U2711 main, AOC 24P2C secondary
Case SSUPD Meshlicious
Audio Device(s) Optoma Nuforce μDAC 3
Power Supply Corsair SF750 Platinum
Mouse Logitech G603
Keyboard Keychron K3/Cooler Master MasterKeys Pro M w/DSA profile caps
Software Windows 10 Pro
Sure it is an extreme bottleneck, one that core-to-core latency tests are designed to create and measure. No, I don't think data is read from DRAM immediately after being written to it. But I do think that when two sections of L3 cache on different CCDs need to be synchronised, this can't be done without also writing the changed data to DRAM. This results in an 80 ns latency.
Hm. Given that the L3 is non-inclusive that would make sense, but would that even be a part of the latency measurement? If data needs to be ejected from the L3 and into DRAM, wouldn't that need to happen before the CCX-to-CCX measurement can even take place?
 
Joined
Jan 3, 2021
Messages
3,106 (2.34/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
I'm hoping that someone with greater knowledge joins in here.

Inclusive vs. exclusive describes the relation between various levels of cache, not between parts of L3 that reside on separate dies, so it shouldn't matter here.
This is how a core-to-core latency measurement is done, I suppose:
1. A thread running on core A writes some data to memory location X
2. A thread running on core B reads data from memory location X a couple nanoseconds later
3. The control program measures how long core B has waited before it got the data
4. Everything is repeated many times to make sure all write buffers are overflown etc. At the end, some statistical calculations are performed.
Now if A and B are on separate dies, X needs to be copied from A's caches to B's caches (L3 may not even be involved here). To do this, it't not absolutely necessary to write X back to DRAM; however, that's what a Ryzen with two CCDs does, and that makes core-to-core latency as high as DRAM latency. This sounds bad but it probably allows the cache sync logic to be less complex and faster.

Some more info is here (for Ryzen 1), the most interesting bit is "This means that the true LLC for the entire chip is actually DRAM".
 
Joined
May 2, 2017
Messages
7,762 (2.91/day)
Location
Back in Norway
System Name Hotbox
Processor AMD Ryzen 7 5800X, 110/95/110, PBO +150Mhz, CO -7,-7,-20(x6),
Motherboard ASRock Phantom Gaming B550 ITX/ax
Cooling LOBO + Laing DDC 1T Plus PWM + Corsair XR5 280mm + 2x Arctic P14
Memory 32GB G.Skill FlareX 3200c14 @3800c15
Video Card(s) PowerColor Radeon 6900XT Liquid Devil Ultimate, UC@2250MHz max @~200W
Storage 2TB Adata SX8200 Pro
Display(s) Dell U2711 main, AOC 24P2C secondary
Case SSUPD Meshlicious
Audio Device(s) Optoma Nuforce μDAC 3
Power Supply Corsair SF750 Platinum
Mouse Logitech G603
Keyboard Keychron K3/Cooler Master MasterKeys Pro M w/DSA profile caps
Software Windows 10 Pro
I'm hoping that someone with greater knowledge joins in here.

Inclusive vs. exclusive describes the relation between various levels of cache, not between parts of L3 that reside on separate dies, so it shouldn't matter here.
This is how a core-to-core latency measurement is done, I suppose:
1. A thread running on core A writes some data to memory location X
2. A thread running on core B reads data from memory location X a couple nanoseconds later
3. The control program measures how long core B has waited before it got the data
4. Everything is repeated many times to make sure all write buffers are overflown etc. At the end, some statistical calculations are performed.
Now if A and B are on separate dies, X needs to be copied from A's caches to B's caches (L3 may not even be involved here). To do this, it't not absolutely necessary to write X back to DRAM; however, that's what a Ryzen with two CCDs does, and that makes core-to-core latency as high as DRAM latency. This sounds bad but it probably allows the cache sync logic to be less complex and faster.

Some more info is here (for Ryzen 1), the most interesting bit is "This means that the true LLC for the entire chip is actually DRAM".
I'm by no means an expert here, but I don't think this is right. I think it's important to include the entirety of that sentence you quoted at the end:
This means that the true LLC for the entire chip is actually DRAM, although AMD states that the two CCXes can communicate with each other through the custom fabric which connects both the complexes, the memory controller, the IO, the PCIe lanes etc.
What I take from that: "true LLC" for AnandTech is a cache that is equally available/shared across all compute resources (in this case: CCXes). Since the L3 is bound to its CCX, that makes DRAM the "true LLC" in this sense.

As for the above, I don't think your reasoning towards the end is right - that still feels like that would add far more latency than just DRAM latency, as there are more steps involved in a core-to-core measurement than in a straight DRAM access. The part of the sentence from the 1800X review also supports this, by saying that the two CCXes can communicate (implied: directly) through IF, bypassing memory. It's clear that cores on the same CCX can access each other through the L3, so I would assume the access pattern for intra-CCX access to be something like core-L3-IF-[some sort of management system on the IOD]-IF-L3-core.

But I entirely agree on your first point, I hope someone with greater knowledge could jump in and explain this to us :)
 

RdGBoY

New Member
Joined
Feb 14, 2021
Messages
1 (0.00/day)
i have a 3600 and show "Active CCD#2"
 

Attachments

  • Sem título.png
    Sem título.png
    129.8 KB · Views: 125
Joined
Aug 26, 2016
Messages
5 (0.00/day)
Probably not ,start a thread for your issue , if the other CCD is dead ,Lazer cut or removed via other means it's still not there to the system and can't get involved in issues.
Although this isn't impossible it's unlikely.
Actually, It is the reason why I cannot see my cores just like other 2 CCD users of 5600x and 5800x - This will be fixed apparently this month in new version of Ryzen master.
 

TheChester

New Member
Joined
Feb 17, 2021
Messages
6 (0.00/day)
The likelihood of these being somehow unlockable is essentially zero. I would be shocked if anyone could pull that off. If a CCD is disabled, nothing in the system knows it's there, after all.
I'm not so sure of that. I was having trouble getting ryzen master to work. I then opened Clock tuner and it says "You have a unique processor. Active CCD#2."
 

craxton

New Member
Joined
Dec 30, 2019
Messages
10 (0.01/day)
The likelihood of these being somehow unlockable is essentially zero. I would be shocked if anyone could pull that off. If a CCD is disabled, nothing in the system knows it's there, after all.
idk bud best take a "look" at this as it can read that CCD etc.
and tell you if ya gotta "decent" overclocker or not on the core side of things.

my 5600x is a 80 sil quality lowest ive seen out of 100s of other users using it. but
i can run 4000 1:1:1 DAILY c16 flat tuned without issues, overclocking a static core clock however
could only run 4600 at 1.35v and that wasnt prime stable. (did get CO dialed in tho)
this is that tool tho, https://drive.google.com/file/d/1p3ilWHGaVd_Afso4KqpjSsAVpfq88M85/view?usp=sharing

now my 5800x i get 5000 WHEA-19s an hour or did, now i get around 500 an hour with some IOD, CCD adjusting.
but its got 120 sil quality and hits 5ghz on most cores no issues.
none the less ill possibly just shut the dam logger up with Mannaitx's WHEA suppressor from OCN.
but yea its been stated that it can happen by a few who knows what theyre speaking on. not 100% sure or not.
but its being worked on ;)
 

Attachments

  • Screenshot (82).png
    Screenshot (82).png
    367.6 KB · Views: 89
  • Screenshot (97).png
    Screenshot (97).png
    1.7 MB · Views: 97
  • Screenshot (86).png
    Screenshot (86).png
    157.7 KB · Views: 92
  • Screenshot (239).png
    Screenshot (239).png
    242.1 KB · Views: 87
  • Screenshot (236).png
    Screenshot (236).png
    1.7 MB · Views: 99

TheChester

New Member
Joined
Feb 17, 2021
Messages
6 (0.00/day)
Am I using the AMD V/F tool correctly for my 5800x? It's show my Sil Quality as 125.278. Also what is the Cooler Score?
 

Attachments

  • 5800x.JPG
    5800x.JPG
    25.3 KB · Views: 100

PvtMaYhEm

New Member
Joined
Aug 8, 2022
Messages
1 (0.00/day)
Is there any update on this?

I apparently have a Dual CCD 5600X.

Wondering if I should try the lottery again, or just go with a 5900X or wait for Zen4 launch.

It still OC's to 4.85GHz with AutoOC/Curve Optimizer enabled w/ -30 per core and 200 boost. But it is warm at that speed. lol.
 
Status
Not open for further replies.
Top