Thursday, January 5th 2023
AMD Confirms Ryzen 9 7950X3D and 7900X3D Feature 3DV Cache on Only One of the Two Chiplets
AMD today announced its new Ryzen 7000X3D high-end desktop processors to much fanfare, with availability slated for February 2023, you can read all about them in our older article. In our coverage, we noticed something odd about the cache sizes of the 12-core 7900X3D and 16-core 7950X3D. Whereas the 8-core, single-CCD 7800X3D comes with 104 MB of total cache (L2+L3), which works out to 1 MB L2 cache per core and 96 MB of L3 cache (32 MB on-die + 64 MB stacked 3DV cache); the dual-CCD 7900X3D and 7950X3D was shown with total caches of 140 MB and 144 MB, while they should have been 204 MB or 208 MB, respectively.
In our older article, we explored two possibilities—one that the 3DV cache is available on both CCDs but halved in size for whatever reason; and the second more outlandish possibility that only one of the two CCDs has stacked 3DV cache, while the other is a normal planar CCD with just the on-die 32 MB L3 cache. As it turns out, the latter theory is right! AMD put out high-resolution renders of the dual-CCD 7000X3D processors, where only one of the two CCDs is shown having the L3D (L3 cache die) stacked on top. Even real-world pictures of the older "Zen 3" 3DV cache CCDs from the 5800X3D or EPYC "Milan-X" processors show CCDs with 3DV caches having a distinct appearance with dividing lines between the L3D and the structural substrates over the regions of the CCD that have the CPU cores. In these renders, we see these lines drawn on only one of the two CCDs.It shouldn't be hard for such an asymmetric cache setup to work in the real world from a software perspective, given that we are now firmly in the era of hybrid-core processors thanks to Intel and Arm. Even way before "Alder Lake," when AMD started shipping dual-CCD client processors with the Ryzen 3000 "Matisse" based on "Zen 2," the company closely collaborated with Microsoft to optimize OS scheduling such that high-performance and less-parallelized workloads such as games, are localized to just one of the two CCDs, to minimize DDR4 memory roundtrips.
Even before "Matisse," AMD and Microsoft confronted multi-threaded workload optimization challenges with dual-CCX architectures such as "Zen" and "Zen 2," where the OS scheduler would ideally want to localize gaming workload to a single CCX before saturating both CCXs on a single CCD, and then onward to the next CCD. This is achieved using methods such as CPPC2 preferred-core flagging, and which is why AMD highly recommends you to use their "Ryzen Balanced" Windows power-plan included with their Chipset drivers.
We predict that something similar is happening with the 12-core and 16-core 7000X3D processors—where gaming workloads can benefit from being localized to the 3DV cache-enabled CCD, and any spillover workloads (such as audio stack, network stack, background services, etc) are handled by the second CCD. In non-gaming workloads that scale across all 16 cores, the processor works like any other multi-core chip, it's just that the cores in the 3DV-enabled CCD have better performance from the larger victim cache. There shouldn't be any runtime errors arising from ISA mismatch, as the CPU core types on both CCDs are the same "Zen 4."
AMD Ryzen 7000X3D processors go on sale in February 2023.
In our older article, we explored two possibilities—one that the 3DV cache is available on both CCDs but halved in size for whatever reason; and the second more outlandish possibility that only one of the two CCDs has stacked 3DV cache, while the other is a normal planar CCD with just the on-die 32 MB L3 cache. As it turns out, the latter theory is right! AMD put out high-resolution renders of the dual-CCD 7000X3D processors, where only one of the two CCDs is shown having the L3D (L3 cache die) stacked on top. Even real-world pictures of the older "Zen 3" 3DV cache CCDs from the 5800X3D or EPYC "Milan-X" processors show CCDs with 3DV caches having a distinct appearance with dividing lines between the L3D and the structural substrates over the regions of the CCD that have the CPU cores. In these renders, we see these lines drawn on only one of the two CCDs.It shouldn't be hard for such an asymmetric cache setup to work in the real world from a software perspective, given that we are now firmly in the era of hybrid-core processors thanks to Intel and Arm. Even way before "Alder Lake," when AMD started shipping dual-CCD client processors with the Ryzen 3000 "Matisse" based on "Zen 2," the company closely collaborated with Microsoft to optimize OS scheduling such that high-performance and less-parallelized workloads such as games, are localized to just one of the two CCDs, to minimize DDR4 memory roundtrips.
Even before "Matisse," AMD and Microsoft confronted multi-threaded workload optimization challenges with dual-CCX architectures such as "Zen" and "Zen 2," where the OS scheduler would ideally want to localize gaming workload to a single CCX before saturating both CCXs on a single CCD, and then onward to the next CCD. This is achieved using methods such as CPPC2 preferred-core flagging, and which is why AMD highly recommends you to use their "Ryzen Balanced" Windows power-plan included with their Chipset drivers.
We predict that something similar is happening with the 12-core and 16-core 7000X3D processors—where gaming workloads can benefit from being localized to the 3DV cache-enabled CCD, and any spillover workloads (such as audio stack, network stack, background services, etc) are handled by the second CCD. In non-gaming workloads that scale across all 16 cores, the processor works like any other multi-core chip, it's just that the cores in the 3DV-enabled CCD have better performance from the larger victim cache. There shouldn't be any runtime errors arising from ISA mismatch, as the CPU core types on both CCDs are the same "Zen 4."
AMD Ryzen 7000X3D processors go on sale in February 2023.
164 Comments on AMD Confirms Ryzen 9 7950X3D and 7900X3D Feature 3DV Cache on Only One of the Two Chiplets
So you're saying for content creation 13700kf is better than 7800x3d?
5800x CineBench R23 single-core 1619
5800x3d score 1491 (8% lower)
13700kf score 2126
7900x score 2041
Assuming 7800x3d won't have single-core score of more than 7900x which is already lower than 13700kf which is 420$
7800x3d perhaps 450$ at least
edit: forgot the link to the article - www.phoronix.com/review/amd-5800x3d-linux/8
12700kf (currently 275$) is 15% faster in photoshop than 5800x3d (current price 350$)
12700kf is 50% faster but 27% cheaper in Unreal Engine than 5800x3d
53% faster in Adobe Premiere Pro I'm guessing single-core score of 7800x3d is 1688 (because 7700x is 5.4GHz vs 7800x3d 5GHz)
13700kf@394$ is 2126 which is more than 7950x@568$ 2072
so with 13700kf@~400$ we get 25% higher CineBench single-core score than 7800x3d@450$
But I would not buy any of these CPUs anyway. And the 7800X3D will probably be too expensive.
Maybe AMD did "solve" the voltage "issues" for OC'ing with V-cache.
AMD adds cache for only half the cores, "that's ok for games", "let's wait for benchmarks"... Lovely, just lovely.
Also, Zen 4c seems pretty much specialised for certain server tasks, not really intended to become a universal core. We've seen Alder Lake. Expect a rough start, then an improvement. Also, Microsoft may say we need a new Windows 11 update with major UI changes for that. That's for certain ... sadly.
The 95° were “normal” and expected, and now magically the TDP for the 3D models has been lowered to 120W (from 170W).
Newer article:
www.phoronix.com/review/amd-5800x3d-linux6/4
16 September 2022. The 970 was claimed to be a 4GB card, not 3.5 (fast) + 0.5 (slow).
AMD claims 16 cores and 144MB of L2+L3 cache - and that's 16 cores with 144MB of L2+L3 cache.
If you'd like 64MB of 3D-cache on each CCD, write that explicitly.