• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Ryzen 9 9950X

Joined
Jun 10, 2014
Messages
3,019 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
Even though this is mostly simulations, raytracing, "AI" stuff etc., it gives a taste of what we might have in store when more common productive applications will start to offer support. I'm thinking of things like Photoshop, Premiere, DaVinci Resolve, various CADs and modellers, etc.

Especially worth mentioning here is the difference of AVX2 vs. AVX-512 on Zen 4 vs. AVX-512 on Zen 5. Overall, there is still a lot of gain for the "double-pumped" 256-bit on Zen 4, and most of the difference over AVX2 here showcases how much instruction overhead (directly and indirectly) there is.

AMDs new ZEN5 CPUs weren't ready for prime time. They should have waited till the software and new motherboards were "Officially" ready. So now AMD needs to play catch-up because these CPUs should be at a min 10-12% IPC over the previous gen.

ZEN5 suffers from Windows Scheduling issues. The same happened with ZEN4 when they first came out.
IPC wouldn't change from software improvements.

Developing a scheduler is Microsoft's responsibility, not AMD's.
But specify what do you mean here; do you expect an optimized scheduler for generic CPU capabilities (like SMT, hybrid cores, etc.?), or one per microarchitecture?
I'm not aware of significant changes which should cause Zen 5 to behave significantly different from Zen 4, are you?

And in case you are expecting specialized schedulers then that's not going to be very sustainable in the long run. A couple of generations down the line, and Zen 5 owners will be "screwed" again, as sooner or later they will either have to drop something or make a hard choice which benefits the newer hardware at the cost of the old.

If hardware really needs this much specializing in the OS kernel, then something is heading down the wrong track. We need more robust hardware designs than this.

AMD did have an opportunity to really give it to Intel and all their major design issues. But nope lol, not yet I suppose.
Major design issues?
Are you talking about the issue of voltage causing prematurely worn CPUs? That's not a design/architectural issue at all.

It's not like any of them can observe the other and 6 months later release a product based on those decisions. The major design of Zen 5 was done years ago. They have two or three generations in development at any time.

I think many are just having unrealistic expectations. As I said months ago, it would take a lot to catch up with and even beat Raptor Lake in performance per core. This is not helped by clickbait YouTube channels and rumor sites trying to make up the most exciting and controversial news.

Overall, I'm not so disappointed in Zen 5 as most of the people in here seem to be. There are certainly use cases where it shines, and others where Intel shines. If AMD would get their act together and create Threadripper variants for these very quickly, at least down to 12 cores (8 cores too, preferably), I'd be very tempted to buy one.
 
Joined
Aug 25, 2023
Messages
446 (0.86/day)
System Name No.1
Processor Ryzen 9 9900X with custom PBO + 2200 FCLK fully stable
Motherboard B650 Gigabyte Aorus Elite v1.0
Cooling Thermaltake toughair 710 + Thermal Grizzly Kryonaut extreme
Memory Patriot Viper PVV532G740C36K @ 6200MT/s 30-36-36-63 1:1
Video Card(s) Asus TUF gaming RX 7900 XTX OC edition
Storage 1TB T-Force Z44A7 + 2TB T-Force A440 Pro
Display(s) 34 " Asus TUF Gaming VG3A series
Case Antec C8 constellation white edition
Audio Device(s) Asus Xonar AE 7.1 + Logitech Z906
Power Supply Corsair RM1000x V2
Mouse MSI Clutch GM20 Elite
Keyboard Logitech G512 Carbon
Zen 5 shares the EXACT same IMC as Zen 4, it's the same physical piece of silicon. There's some software changes though that have only a minor effect.

I couldn't even get 6000 stable at stock on my 9900X. What do you suggest how to proceed with the review if 6400 is the comparison baseline? I suspect that at least half my AM5 CPUs will not run 6400 1:1

This is from the AMD Ryzen 9000 reviewer's guide
View attachment 359106


Inded, I'm planning to switch to Zelda for the next rebench. Will probably hve to play a substantial part of the game though, to find a decent scene
Your definition of Exact is different to mine then, if you claim software changes have had minor effects, then they are by definition not exact.

I just ran prime 95 blend yesterday for over 6 hrs with 1:1 & 6000MT/S with my old Hynix M die kit, stable no problems. The chip you tested with must be a poor sample thanks to the silicon lottery perhaps?
PBO is enabled & tuned without the new curve shaper mechanism, I leave that on auto for time being hence why the power consumption is approx doubled from 65w TDP - Asrock's implementation of the current agesa.
6.45Hrs_PBOc_6000CL30_FLCK2133_blend - Copy.jpg


But I can't be bothered arguing anymore, just going to leave this here & move on to testing at 6200MT/S, maybe post results in another thread.
 
Joined
Nov 6, 2008
Messages
10 (0.00/day)
It is just not clocking well because too thick IHS and tight Wattage.
I doubt the IHS is limiting maximum clock speed, though it might be limiting in heavily threaded loads. Zen 5 has a much wider execution pipeline and higher density than you would expect from the node change, both of which can reduce the safe operating voltage. The maximum automatic VID decreased from 1.475v to 1.4v.
I just ran prime 95 blend yesterday for over 6 hrs with 1:1 & 6000MT/S with my old Hynix M die kit, stable no problems. The chip you tested with must be a poor sample thanks to the silicon lottery perhaps?
P95 blend isn’t a useful stability test. I can run it indefinitely on my 5800X3D at -30 all core, 3800MT/s, CL16, Cmd 1T, and GDM off. At those settings OCCT memtest fails in less than a second and the system crashes when idle.
 
Joined
Jun 10, 2014
Messages
3,019 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
People probably have very different standards for what they regard as "stable". There is a big difference in what you can achieve on a brand new chip running well enough to complete some batches of benchmarks vs. during years of actual usage. So if the purpose is a theoretical exercise to see the potential and locate any sweetspots, then running overclocked memory is fine. But if the purpose is to showcase the real world performance the buyer can expect to achieve throughout the product's lifespan, then better stick with the speeds and voltage the memory controller is designed and certified for.

When I regard a system as stable, I don't expect a single system crash during at least 3 months of uptime. What I've seen in development environments, system crashes are extremely rare even with heavily used workstations, especially if they're running Linux. That kind of stability would not be achievable with overclocked memory, not only due to system crashes, but also file corruption.

But validating this level of system stability isn't trivial either. You generally need many different types of tests, and for a prolonged time to draw final conclusions. I've seen for instance what overclocked memory can do to code compilation with gcc and llvm, with just occasional absurd behavior, while passing memory tests and various CPU tests (removing the OC obviously resolved the issue). It was in fact gcc and llvm which were the most reliable workloads to pinpoint the stability issues with Zen 1, and not with a particularly "stressful" load either, which goes to show that hardware issues can absolutely cause problems during light to medium load too.
 
Joined
Apr 30, 2011
Messages
2,722 (0.54/day)
Location
Greece
Processor AMD Ryzen 5 5600@80W
Motherboard MSI B550 Tomahawk
Cooling ZALMAN CNPS9X OPTIMA
Memory 2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s) Sapphire Radeon RX 6750 XT Pulse 12GB
Storage Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s) AOC 27G2U/BK IPS 144Hz
Case SHARKOON M25-W 7.1 BLACK
Audio Device(s) Realtek 7.1 onboard
Power Supply Seasonic Core GC 500W
Mouse Sharkoon SHARK Force Black
Keyboard Trust GXT280
Software Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux
Software is the only problem with Zen5. Linux review on the topic of Linux VS Windows performance (https://www.phoronix.com/review/ryzen-9950x-windows11-ubuntu/8) proved that definately. 14% difference between the OSes on average of +70 tests doesn't need any more words. So, the Zen5 arch is clearly a progress over Zen4, the software (win11 updates, chipset drivers) and maybe the AGESA code needs more work. AMD's fault? For sure! But the letter had good news, the post office sent it earlier and couldn't be comprehended properly due to circumstances. ;)
 
Joined
Aug 6, 2023
Messages
33 (0.06/day)
It's not good, but it is not bad either. There is a specific market for this CPU. The average person are not gonna pay 15k+ ( I am expecting the price to be close to that) in South Africa, When you can get a 7800x3d for a lot cheaper. Productivity yes maybe but there are also cheaper options. What is the purpose of shaving off 2,3 seconds? If you are in a crunch working that needs it done as soon as possible then yes, Otherwise, I think for the price here it won't be worth it.
 
Joined
May 10, 2023
Messages
524 (0.84/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
Software is the only problem with Zen5. Linux review on the topic of Linux VS Windows performance (https://www.phoronix.com/review/ryzen-9950x-windows11-ubuntu/8) proved that definately. 14% difference between the OSes on average of +70 tests doesn't need any more words. So, the Zen5 arch is clearly a progress over Zen4, the software (win11 updates, chipset drivers) and maybe the AGESA code needs more work. AMD's fault? For sure! But the letter had good news, the post office sent it earlier and couldn't be comprehended properly due to circumstances. ;)
I wonder how much input those CPU manufacturers have on Window's core parts, specially the scheduler.
I'm going to believe "not much" since it's not the first time AMD had issues with the NT scheduler, and Intel had to resort to a hw scheduler (something that I personally deem way unnecessary and overkill, but apparently needed).

We have many other hybrid CPUs in other spaces (see ARM mobile devices), and Linux seems to not have much of an issue (Intel's addition of their thread director support barely made a difference in performance), so it indeed seems to be just a Windows issue.
I wonder if that's why Qcom decided to go with a single core type for their Elite SoC (or if it's mostly because Oryon was not designed to be used like so).

Well, I hope MS manages to catch up and allow their scheduler to properly work, after all, this also has implications with their future adoption with ARM devices.
 
Joined
Oct 30, 2020
Messages
319 (0.21/day)
Location
Toronto
System Name GraniteXT
Processor Ryzen 9950X
Motherboard ASRock B650M-HDV
Cooling 2x360mm custom loop
Memory 2x24GB Team Xtreem DDR5-8000 [M die]
Video Card(s) RTX 3090 FE underwater
Storage Intel P5800X 800GB + Samsung 980 Pro 2TB
Display(s) MSI 342C 34" OLED
Case O11D Evo RGB
Audio Device(s) DCA Aeon 2 w/ SMSL M200/SP200
Power Supply Superflower Leadex VII XG 1300W
Mouse Razer Basilisk V3
Keyboard Steelseries Apex Pro V2 TKL
Faster Ram is not better.

7800 C38 isn't supposed to be faster than even 6000. The problem with 7800 is that UCLK sits at 1900 with the 2:1 ratio, which causes a bottleneck, compared to 3000 for the 1:1 6000 setup. 1900 is way too limiting and doesn't allow the 'other half' of the transmission to compensate enough.

If you want more speed than 6200 (or even tight 6000), you have to go 8000 with tighter timings at the minimum. That isn't possible with 2DPC boards and there are very few options for 1DPC boards for x670. This is what i'm hoping x870 will bring (and looks like there's at least one option coming up), as from my earlier tests it's only possible to go faster than tight 6200 is to have a relatively tight 8000 tune.

I wonder why they can't have a 3:2 ratio for MCLK:UCLK. The choice of only 1:1 or 2:1 is limiting and a 3:2 ratio at 8000 would alleviate a lot of Zen 5's bottlenecks especially in gaming.

I'll quote chips and cheese, who mirror my thoughts on the Zen 5 arch "Much of the potential throughput offered by Zen 5’s wider pipeline is lost to latency, either with backend memory accesses or frontend delays."

They really need a new IO, or that divider at the least. For gaming, it's mostly the backend memory access that's really limiting them. Good thing is the X3D variant should, in theory, be very fast and lose a lot less core width to memory latency. If they can combine with the VCache with the rumoured higher clocks (compared to prior X3D implementations), we might have a winner.

7800 C38 isn't supposed to be faster than even 6000. The problem with 7800 is that UCLK sits at 1900 with the 2:1 ratio, which causes a bottleneck, compared to 3000 for the 1:1 6000 setup. 1900 is way too limiting and doesn't allow the 'other half' of the transmission to compensate enough.

If you want more speed than 6200 (or even tight 6000), you have to go 8000 with tighter timings at the minimum. That isn't usually possible with 2DPC boards and there are very few options for 1DPC boards for x670. This is what i'm hoping x870 will bring (and looks like there's at least one option coming up), as from my earlier tests it's only possible to go faster than tight 6200 is to have a relatively tight 8000 tune.

I wonder why they can't have a 3:2 ratio for MCLK:UCLK. The choice of only 1:1 or 2:1 is limiting and a 3:2 ratio at 8000 would alleviate a lot of Zen 5's bottlenecks especially in gaming.

I'll quote chips and cheese, who mirror my thoughts on the Zen 5 arch "Much of the potential throughput offered by Zen 5’s wider pipeline is lost to latency, either with backend memory accesses or frontend delays."

They really need a new IO, or that divider at the least. For gaming, it's mostly the backend memory access that's really limiting them. Good thing is the X3D variant should, in theory, be very fast and lose a lot less core width to memory latency. If they can combine with the VCache with the rumoured higher clocks (compared to prior X3D implementations), we might have a winner.
 
Joined
Jul 30, 2019
Messages
3,380 (1.69/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR5-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
Joined
Apr 30, 2020
Messages
1,028 (0.59/day)
System Name S.L.I + RTX research rig
Processor Ryzen 7 5800X 3D.
Motherboard MSI MEG ACE X570
Cooling Corsair H150i Cappellx
Memory Corsair Vengeance pro RGB 3200mhz 32Gbs
Video Card(s) 2x Dell RTX 2080 Ti in S.L.I
Storage Western digital Sata 6.0 SDD 500gb + fanxiang S660 4TB PCIe 4.0 NVMe M.2
Display(s) HP X24i
Case Corsair 7000D Airflow
Power Supply EVGA G+1600watts
Mouse Corsair Scimitar
Keyboard Cosair K55 Pro RGB
Your definition of Exact is different to mine then, if you claim software changes have had minor effects, then they are by definition not exact.

I just ran prime 95 blend yesterday for over 6 hrs with 1:1 & 6000MT/S with my old Hynix M die kit, stable no problems. The chip you tested with must be a poor sample thanks to the silicon lottery perhaps?
PBO is enabled & tuned without the new curve shaper mechanism, I leave that on auto for time being hence why the power consumption is approx doubled from 65w TDP - Asrock's implementation of the current agesa.
View attachment 359308

But I can't be bothered arguing anymore, just going to leave this here & move on to testing at 6200MT/S, maybe post results in another thread.
What's with the weird 4.6ghz L3 Cache speed while memory controller is sitting a 3.0ghz???? & infinity fabric is 2133mhz???? :confused:
Nothings in line with each other the cores are 5.7ghz+
 
Joined
Nov 6, 2008
Messages
10 (0.00/day)
What's with the weird 4.6ghz L3 Cache speed while memory controller is sitting a 3.0ghz???? & infinity fabric is 2133mhz???? :confused:
Nothings in line with each other the cores are 5.7ghz+
Look at the top left entry, the average core clock is 4.687Ghz. L3 clock is always equal to the highest core.
 
Joined
Jun 10, 2014
Messages
3,019 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
Software is the only problem with Zen5. Linux review on the topic of Linux VS Windows performance (https://www.phoronix.com/review/ryzen-9950x-windows11-ubuntu/8) proved that definately. 14% difference between the OSes on average of +70 tests doesn't need any more words. So, the Zen5 arch is clearly a progress over Zen4, the software (win11 updates, chipset drivers) and maybe the AGESA code needs more work. AMD's fault? For sure! But the letter had good news, the post office sent it earlier and couldn't be comprehended properly due to circumstances. ;)
There should be little doubt that the antiquated "New Technology" kernel is long overdue for a major overhaul, and I'm thinking beyond just tweaking the scheduler. There is also stability issues, security issues and performance issues, the latter of which is more than just being a few percent slower than Linux on average. The performance inconsistency is far more annoying which and the reason why so many people "feel" Linux is faster and less jerky on their computer (especially if it's not the latest and greatest).

But I do appreciate the challenge MS is facing though; the mountain of obscure enterprise software which is very closely knit to archaic APIs. But with long-term software maintenance, if you don't prioritize modernizing continuously, it will only get harder to catch up.

As for AMD's firmware, I doubt that is going to make a big difference. Fixing Windows' kernel is MS' job, not AMD's. (assuming there is no design flaw in hardware)

I'll quote chips and cheese, who mirror my thoughts on the Zen 5 arch "Much of the potential throughput offered by Zen 5’s wider pipeline is lost to latency, either with backend memory accesses or frontend delays."

They really need a new IO, or that divider at the least. For gaming, it's mostly the backend memory access that's really limiting them. Good thing is the X3D variant should, in theory, be very fast and lose a lot less core width to memory latency. If they can combine with the VCache with the rumoured higher clocks (compared to prior X3D implementations), we might have a winner.
First of all, saturating wider execution and larger vector width is primarily a memory bandwidth issue, not a L3 cache hit issue. L3 is after all discarded L2 cache lines, and the cache hits here are primarily instruction cache lines. With optimized workloads we generally don't see heavy L3 cache sensitivity (which is why only select workloads benefits from lots of L3 cache), and especially a heavy AVX workload wouldn't saturate the CPU very well if the code was this bloated.

I will be curious to see whether upcoming Epyc and Threadripper(?) models with more memory channels will scale better in AVX512 workloads, then we'll probably see if the memory bus or the CPU frontend is the bigger bottleneck. I'd expect Phoronix to do a good job exploring this sort of stuff as usual.
 
Joined
Oct 30, 2020
Messages
319 (0.21/day)
Location
Toronto
System Name GraniteXT
Processor Ryzen 9950X
Motherboard ASRock B650M-HDV
Cooling 2x360mm custom loop
Memory 2x24GB Team Xtreem DDR5-8000 [M die]
Video Card(s) RTX 3090 FE underwater
Storage Intel P5800X 800GB + Samsung 980 Pro 2TB
Display(s) MSI 342C 34" OLED
Case O11D Evo RGB
Audio Device(s) DCA Aeon 2 w/ SMSL M200/SP200
Power Supply Superflower Leadex VII XG 1300W
Mouse Razer Basilisk V3
Keyboard Steelseries Apex Pro V2 TKL
First of all, saturating wider execution and larger vector width is primarily a memory bandwidth issue, not a L3 cache hit issue. L3 is after all discarded L2 cache lines, and the cache hits here are primarily instruction cache lines. With optimized workloads we generally don't see heavy L3 cache sensitivity (which is why only select workloads benefits from lots of L3 cache), and especially a heavy AVX workload wouldn't saturate the CPU very well if the code was this bloated.

I will be curious to see whether upcoming Epyc and Threadripper(?) models with more memory channels will scale better in AVX512 workloads, then we'll probably see if the memory bus or the CPU frontend is the bigger bottleneck. I'd expect Phoronix to do a good job exploring this sort of stuff as usual.

I agree with you that it's a mem bandwidth issue when trying to saturate the wider execution engines. I'm aware that poorly written code (eg. games) causes L2 spillover which ends up in L3. What I was stating is that, when executing poor code, a larger L3 reduces the number of memory trips so it loses less core width to memory latency/bandwidth in that scenario.

I'm waiting for the Turin tests as well, a lot of info will come out of that one.
 
Joined
Nov 26, 2021
Messages
1,730 (1.50/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
I agree with you that it's a mem bandwidth issue when trying to saturate the wider execution engines. I'm aware that poorly written code (eg. games) causes L2 spillover which ends up in L3. What I was stating is that, when executing poor code, a larger L3 reduces the number of memory trips so it loses less core width to memory latency/bandwidth in that scenario.

I'm waiting for the Turin tests as well, a lot of info will come out of that one.
Even exceptionally well optimized code can benefit from more cache if the working set is large enough. Note the higher IPC for the 3D cache CCD of the 7950X3D.

1724271327341.png
 
Joined
Oct 30, 2020
Messages
319 (0.21/day)
Location
Toronto
System Name GraniteXT
Processor Ryzen 9950X
Motherboard ASRock B650M-HDV
Cooling 2x360mm custom loop
Memory 2x24GB Team Xtreem DDR5-8000 [M die]
Video Card(s) RTX 3090 FE underwater
Storage Intel P5800X 800GB + Samsung 980 Pro 2TB
Display(s) MSI 342C 34" OLED
Case O11D Evo RGB
Audio Device(s) DCA Aeon 2 w/ SMSL M200/SP200
Power Supply Superflower Leadex VII XG 1300W
Mouse Razer Basilisk V3
Keyboard Steelseries Apex Pro V2 TKL
Joined
Apr 30, 2020
Messages
1,028 (0.59/day)
System Name S.L.I + RTX research rig
Processor Ryzen 7 5800X 3D.
Motherboard MSI MEG ACE X570
Cooling Corsair H150i Cappellx
Memory Corsair Vengeance pro RGB 3200mhz 32Gbs
Video Card(s) 2x Dell RTX 2080 Ti in S.L.I
Storage Western digital Sata 6.0 SDD 500gb + fanxiang S660 4TB PCIe 4.0 NVMe M.2
Display(s) HP X24i
Case Corsair 7000D Airflow
Power Supply EVGA G+1600watts
Mouse Corsair Scimitar
Keyboard Cosair K55 Pro RGB
That CCD to CCD latency is like old zen+ architechture that didn't have an i/o die. The CCD had go all the way to ram from the CCD's memoery controller and through the ram back over through the other dies memoery controller.

The only bonus amd got from those exrta memory controlers that was threadrippers 2000's 144gbs ram bandwidth on qaud chaneel with numa and up to 288gbs on EPYC octo channel.
 
Joined
Dec 17, 2020
Messages
144 (0.10/day)
There should be little doubt that the antiquated "New Technology" kernel is long overdue for a major overhaul, and I'm thinking beyond just tweaking the scheduler. There is also stability issues, security issues and performance issues, the latter of which is more than just being a few percent slower than Linux on average. The performance inconsistency is far more annoying which and the reason why so many people "feel" Linux is faster and less jerky on their computer (especially if it's not the latest and greatest).

But I do appreciate the challenge MS is facing though; the mountain of obscure enterprise software which is very closely knit to archaic APIs. But with long-term software maintenance, if you don't prioritize modernizing continuously, it will only get harder to catch up.

As for AMD's firmware, I doubt that is going to make a big difference. Fixing Windows' kernel is MS' job, not AMD's. (assuming there is no design flaw in hardware)


First of all, saturating wider execution and larger vector width is primarily a memory bandwidth issue, not a L3 cache hit issue. L3 is after all discarded L2 cache lines, and the cache hits here are primarily instruction cache lines. With optimized workloads we generally don't see heavy L3 cache sensitivity (which is why only select workloads benefits from lots of L3 cache), and especially a heavy AVX workload wouldn't saturate the CPU very well if the code was this bloated.

I will be curious to see whether upcoming Epyc and Threadripper(?) models with more memory channels will scale better in AVX512 workloads, then we'll probably see if the memory bus or the CPU frontend is the bigger bottleneck. I'd expect Phoronix to do a good job exploring this sort of stuff as usual.
That 14% slowdown is a feature of all the embedded spyware. It's not going anywhere.

 
Joined
Apr 17, 2021
Messages
580 (0.42/day)
System Name Jedi Survivor Gaming PC
Processor AMD Ryzen 7800X3D
Motherboard Asus TUF B650M Plus Wifi
Cooling ThermalRight CPU Cooler
Memory G.Skill 32GB DDR5-5600 CL28
Video Card(s) MSI RTX 3080 10GB
Storage 2TB Samsung 990 Pro SSD
Display(s) MSI 32" 4K OLED 240hz Monitor
Case Asus Prime AP201
Power Supply FSP 1000W Platinum PSU
Mouse Logitech G403
Keyboard Asus Mechanical Keyboard
Zen 5 shares the EXACT same IMC as Zen 4, it's the same physical piece of silicon. There's some software changes though that have only a minor effect.

I couldn't even get 6000 stable at stock on my 9900X. What do you suggest how to proceed with the review if 6400 is the comparison baseline? I suspect that at least half my AM5 CPUs will not run 6400 1:1

This is from the AMD Ryzen 9000 reviewer's guide
View attachment 359106


Inded, I'm planning to switch to Zelda for the next rebench. Will probably hve to play a substantial part of the game though, to find a decent scene

That's awesome news. Mario Kart was a Wii U port and runs easily. Zelda is a lot more challenging! Cheers!
 
Joined
Apr 13, 2023
Messages
317 (0.49/day)
System Name Can it run Warhammer 3?
Processor 7800X3D @ 5Ghz
Motherboard Gigabyte B650 Aorus Elite AX
Cooling Enermax Liqmax III 360mm
Memory Teamgroup DDR5 CL30 6000Mhz 32GB
Video Card(s) Gigabyte 4090
Storage Silicon Power XS70, Corsair T700
Display(s) BenQ EX2710Q, BenQEX270M
Case NZXT H7 Flow
Audio Device(s) AudioTechnica M50xBT
Power Supply SuperFlower Leadex III 850W
Wonder if TPU will do their own comparison

 
Joined
Dec 3, 2014
Messages
348 (0.09/day)
Location
Marabá - Pará - Brazil
System Name KarymidoN TitaN
Processor AMD Ryzen 7 5700X
Motherboard ASUS TUF X570
Cooling Custom Watercooling Loop
Memory 2x Kingston FURY RGB 16gb @ 3200mhz 18-20-20-39
Video Card(s) MSI GTX 1070 GAMING X 8GB
Storage Kingston NV2 1TB| 4TB HDD
Display(s) 4X 1080P LG Monitors
Case Aigo Darkflash DLX 4000 MESH
Power Supply Corsair TX 600
Mouse Logitech G300S
Joined
Apr 13, 2023
Messages
317 (0.49/day)
System Name Can it run Warhammer 3?
Processor 7800X3D @ 5Ghz
Motherboard Gigabyte B650 Aorus Elite AX
Cooling Enermax Liqmax III 360mm
Memory Teamgroup DDR5 CL30 6000Mhz 32GB
Video Card(s) Gigabyte 4090
Storage Silicon Power XS70, Corsair T700
Display(s) BenQ EX2710Q, BenQEX270M
Case NZXT H7 Flow
Audio Device(s) AudioTechnica M50xBT
Power Supply SuperFlower Leadex III 850W
Top