• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Ryzen 9 9950X3D Carries 3D V-Cache on a Single CCD, 5.6 GHz Clock Speed, and 170 Watt TDP

Joined
Jun 1, 2010
Messages
398 (0.07/day)
System Name Very old, but all I've got ®
Processor So old, you don't wanna know... Really!
For the people who keep quoting Latency graphs to argue there is no penalty, consider this. Those tests you are quoting are 1 core accessing 1 core on the 2nd CCD, I have linked a test that goes into further details where they load down CCDS from single thread to fully loaded and measured its latency and in actually splitting threads across the two CCDs and with Zen 4 it is bad!!!


Zen 4 has a hardware limitation that a dual X3D setup would have been absolutly HORRENDOUS in performance as accessing the 2nd CCDs cache would have been only as fast as accessing DRAM in certain worst case scenarios and can very easily see 2-3 times the latency penalty rising to nearly 10 times in the worst case. I suspect Zen 3/5xxx series parts would have seen similar issues due to the design of the IO Die etc

Zen 5 has seemingly fixed this issue as well as having the high clock speeds due to the relocated X3D. I wonder if we AMD are holding back dual X3D parts in case Intel pulls something out of the bag ala Nvidias origianl Ti/Super variants of a few years ago? I mean the Single CCD parts are completely handing Intel the L in gaming by quite a margin currently.

Also are they trying to prevent confusion as the dual x3d parts would segregate the market even futher again as you now have 3 different SKUs for each core count and with desktop parts probably pushing up towards the $/£1k mark again for the top end non HEDT part. How much would it cut into their lower end HEDT/Workstation sales.
My bet, is that this is just the margin/business thing. They want to milk with limited SKU, for as long as possible. This is "reasonable" from the business standpoint, but atrocious, from every other. Including amount of e-waste, produced for the sole purpose of very temporary sales boost, while intel has nothing on the table, yet. As these heterohgenous SKUs might be as avoided, as Zen4 ones, despite they could be a solid solution from the get go. These hybrid stuff might be eclipsed and avoided again in favor of either mono-3D, single 9800X3D, or any ppotential dual CCD 3D. Money...
 
Last edited:
Joined
Jul 20, 2018
Messages
129 (0.05/day)
System Name Multiple desktop/server builds
Processor Desktops: 13900K, 5800X3D, 12900K | Servers: 2 x 3900X, 2 x 5950X, 3950X, 2950X, 8700K
Motherboard Z690 Apex, X570 Aorus Xtreme, Z690-I Strix
Cooling All watercooled
Memory DDR5-6400C32, DDR4-3600C14, DDR5-6000C36
Video Card(s) 4090 Gaming OC, 4090 TUF OC, 2 x 3090, 2 x 2080Ti, 1080Ti Gaming X EK, 2 x 1070, 2 x 1060
Storage dozens of TBs of SSDs, 112TB NAS, 140TB NAS
Display(s) Odyssey Neo G9, PG35VQ, P75QX-H1
Case Caselabs S8, Enthoo Elite, Meshlicious, Cerberus X, Cerberus, 2 x Velka 7, MM U2-UFO, Define C
Audio Device(s) Schiit Modius + SMSL SP200, Grace DAC + Drop THX AAA, Sony HT-A9, Nakamichi 9.2.4
Power Supply AX1200, Dark Power Pro 12 1500W
Mouse G Pro X Superlight Black + White
Keyboard Wooting 60HE, Moonlander
VR HMD Index, Oculus CV1
Sigh.... Still with the assymentric cache arrangement....

Was kind of hoping they would do that as a 'lower' tier Ryzen 9 part and release a dual-CCD X3D top-tier Ryzen 9 CPU....
It's a pain having to use windows gamebar or set process affinity but if a game is crossing CCDs that latency penalty is so huge the vcache won't matter anyways.
 
Joined
Jan 14, 2023
Messages
856 (1.19/day)
System Name Asus G16
Processor i9 13980HX
Motherboard Asus motherboard
Cooling 2 fans
Memory 32gb 4800mhz
Video Card(s) 4080 laptop
Storage 16tb, x2 8tb SSD
Display(s) QHD+ 16in 16:10 (2560x1600, WQXGA) 240hz
Power Supply 330w psu
What they really need is OS support so programmers can manually assign threads to the CCD of choice. Each CCD in the non-x3d cache version already had a massive cache and dividing workloads among CCD's can be a significant boost at the application level when the application doesn't need all cores.

Saving 20 seconds on compile is another 20 seconds to sip coffee. (No joke.)
For me, its 20 more sec of TPU, watching porn, then green tea, all in that order.
 
Joined
Jan 3, 2021
Messages
3,649 (2.50/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
For me, its 20 more sec of TPU, watching porn, then green tea, all in that order.
None of these activities should be rationed by the second but to each his own.
 
Top