• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

We found the Missing Performance: Zen 5 Tested with SMT Disabled

Joined
Sep 10, 2018
Messages
6,321 (2.91/day)
Location
California
System Name His & Hers
Processor R7 5800X/ R7 7950X3D Stock
Motherboard X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling Corsair h150 elite/ Corsair h115i Platinum
Memory Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s) Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage lots of SSD.
Display(s) A whole bunch OLED, VA, IPS.....
Case 011 Dynamic XL/ Phanteks Evolv X
Audio Device(s) Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply Seasonic Ultra Prime Titanium 1000w/850w
Mouse Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard Logitech - G915 LIGHTSPEED / Logitech G Pro
Thanks @W1zzard the amount of work you put into this is amazing.

Hats off to you!

Awesome content as always.
 
Joined
Jan 5, 2011
Messages
23 (0.00/day)
Disabling SMT to increase performance in the gaming community has been around for a long time and something I've personally done for years, however it never was this big of a gap. Disabling SMT will almost always reduce microstuttering/stuttering at the loss of a bit of average FPS. The most important distinction here however is W11. The thread scheduler in W11 was the biggest change from W10 and while Intel said you had to swap to W11 to utilize their CPUs properly, they work under W10 too.

While In a lot of cases loading up the 'best' cores the most helps for overall throughput, it's not a best case scenario for latency sensitive applications. Games are very much latency sensitive as well as throughput sensitive.

Taking this further, you'll actually find that having CPPC on also reduces performance. While it might help with one specific application that is utilizing one core, windows will over load the 'best' cores further exasperating this situation. While you can't turn CPPC off anymore in modern Ryzens specifically, you can by disabling other power management options which has pros/cons to it. CPPC is under a power management feature called PSS Support (which is a few features, but CPPC is what we want).

After disabling CPPC you'll see thread loading to be more uniform across all cores. This is important for multithreaded and even single threaded applications. So a bunch of extra threads aren't being thrown on a CPU that might be all of 5% faster, while other cores have plenty of headroom available.

CPPC/SMT/W11 thread scheduler are all things at work here and need to be tweaked.

Also going to point out using Process Lasso or other applications to 'band aid' a game to every other core is NOT the same thing as natively disabling SMT. Windows will still try to utilize the other half of the virtual core creating thread contention. There are a lot of windows processes that can't be touched, if people using lasso even bother with everything that can outside of the game.
 
Joined
Oct 31, 2022
Messages
175 (0.27/day)
I use a 5800X3D and I have SMT disabled, because Elden Ring has stutter and disabeling it reduces those by a lot.
 
Joined
Nov 15, 2020
Messages
905 (0.66/day)
System Name 1. Glasshouse 2. Odin OneEye
Processor 1. Ryzen 9 5900X (manual PBO) 2. Ryzen 9 7900X
Motherboard 1. MSI x570 Tomahawk wifi 2. Gigabyte Aorus Extreme 670E
Cooling 1. Noctua NH D15 Chromax Black 2. Custom Loop 3x360mm (60mm) rads & T30 fans/Aquacomputer NEXT w/b
Memory 1. G Skill Neo 16GBx4 (3600MHz 16/16/16/36) 2. Kingston Fury 16GBx2 DDR5 CL36
Video Card(s) 1. Asus Strix Vega 64 2. Powercolor Liquid Devil 7900XTX
Storage 1. Corsair Force MP600 (1TB) & Sabrent Rocket 4 (2TB) 2. Kingston 3000 (1TB) and Hynix p41 (2TB)
Display(s) 1. Samsung U28E590 10bit 4K@60Hz 2. LG C2 42 inch 10bit 4K@120Hz
Case 1. Corsair Crystal 570X White 2. Cooler Master HAF 700 EVO
Audio Device(s) 1. Creative Speakers 2. Built in LG monitor speakers
Power Supply 1. Corsair RM850x 2. Superflower Titanium 1600W
Mouse 1. Microsoft IntelliMouse Pro (grey) 2. Microsoft IntelliMouse Pro (black)
Keyboard Leopold High End Mechanical
Software Windows 11
These processors are clearly still a work in process.
 
Joined
Sep 15, 2011
Messages
6,600 (1.40/day)
Processor Intel® Core™ i7-13700K
Motherboard Gigabyte Z790 Aorus Elite AX
Cooling Noctua NH-D15
Memory 32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s) ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage 2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s) Acer Predator X34 3440x1440@100Hz G-Sync
Case NZXT PHANTOM410-BK
Audio Device(s) Creative X-Fi Titanium PCIe
Power Supply Corsair 850W
Mouse Logitech Hero G502 SE
Software Windows 11 Pro - 64bit
Benchmark Scores 30FPS in NFS:Rivals
I didn't know those *X3D CPUs are so good for gaming. Where is the 9800X3D?
 
Joined
Jan 27, 2015
Messages
1,682 (0.48/day)
System Name Legion
Processor i7-12700KF
Motherboard Asus Z690-Plus TUF Gaming WiFi D5
Cooling Arctic Liquid Freezer 2 240mm AIO
Memory PNY MAKO DDR5-6000 C36-36-36-76
Video Card(s) PowerColor Hellhound 6700 XT 12GB
Storage WD SN770 512GB m.2, Samsung 980 Pro m.2 2TB
Display(s) Acer K272HUL 1440p / 34" MSI MAG341CQ 3440x1440
Case Montech Air X
Power Supply Corsair CX750M
Mouse Logitech MX Anywhere 25
Keyboard Logitech MX Keys
Software Lots
Disabling SMT to increase performance in the gaming community has been around for a long time and something I've personally done for years, however it never was this big of a gap. Disabling SMT will almost always reduce microstuttering/stuttering at the loss of a bit of average FPS. The most important distinction here however is W11. The thread scheduler in W11 was the biggest change from W10 and while Intel said you had to swap to W11 to utilize their CPUs properly, they work under W10 too.

While In a lot of cases loading up the 'best' cores the most helps for overall throughput, it's not a best case scenario for latency sensitive applications. Games are very much latency sensitive as well as throughput sensitive.

Taking this further, you'll actually find that having CPPC on also reduces performance. While it might help with one specific application that is utilizing one core, windows will over load the 'best' cores further exasperating this situation. While you can't turn CPPC off anymore in modern Ryzens specifically, you can by disabling other power management options which has pros/cons to it. CPPC is under a power management feature called PSS Support (which is a few features, but CPPC is what we want).

After disabling CPPC you'll see thread loading to be more uniform across all cores. This is important for multithreaded and even single threaded applications. So a bunch of extra threads aren't being thrown on a CPU that might be all of 5% faster, while other cores have plenty of headroom available.

CPPC/SMT/W11 thread scheduler are all things at work here and need to be tweaked.

Also going to point out using Process Lasso or other applications to 'band aid' a game to every other core is NOT the same thing as natively disabling SMT. Windows will still try to utilize the other half of the virtual core creating thread contention. There are a lot of windows processes that can't be touched, if people using lasso even bother with everything that can outside of the game.


Actually the problem here is that if you are inclined to disable SMT on a 9700X to get it up to 14700K levels of gaming performance and thus cripple its anemic productivity performance even more, then you didn't care about productivity performance in the first place and prize gaming performance above all.

And that just means you should have bought a 7800X3D or 7950X3D in the first place. They are crippled in productivity as well, but nothing matches them for gaming.

None of this helps the case for the 9700X/9600X. They just don't have a place in the market IMO.
 
Joined
Sep 10, 2018
Messages
6,321 (2.91/day)
Location
California
System Name His & Hers
Processor R7 5800X/ R7 7950X3D Stock
Motherboard X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling Corsair h150 elite/ Corsair h115i Platinum
Memory Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s) Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage lots of SSD.
Display(s) A whole bunch OLED, VA, IPS.....
Case 011 Dynamic XL/ Phanteks Evolv X
Audio Device(s) Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply Seasonic Ultra Prime Titanium 1000w/850w
Mouse Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard Logitech - G915 LIGHTSPEED / Logitech G Pro
I've never let windows schedule my cpus it's dumb as shite....

Been using process lasso for a while
On my 7950X3D these are my profiles

1 with cache only CCD for games
2 without smt on the cache ccd for games
3 for background process on the secondary ccd
4 for things that benefit from all core workloads.

Once setup it takes all of 1m to set a profile to a particular program.

Should anyone have to do that no but if you want the most out of your hardware it's just the reality.
 
Joined
Mar 16, 2017
Messages
1,943 (0.72/day)
Location
Tanagra
System Name Budget Box
Processor Xeon E5-2667v2
Motherboard ASUS P9X79 Pro
Cooling Some cheap tower cooler, I dunno
Memory 32GB 1866-DDR3 ECC
Video Card(s) XFX RX 5600XT
Storage WD NVME 1GB
Display(s) ASUS Pro Art 27"
Case Antec P7 Neo
I've never let windows schedule my cpus it's dumb as shite....

Been using process lasso for a while
On my 7950X3D these are my profiles

1 with cache only CCD for games
2 without smt on the cache ccd for games
3 for background process on the secondary ccd
4 for things that benefit from all core workloads.

Once setup it takes all of 1m to set a profile to a particular program.

Should anyone have to do that no but if you want the most out of your hardware it's just the reality.
You shouldn't, but let's face it, now that we're in the heterogeneous CPU world, more effort probably needs to go into this part of the tweaking process. The way CPUs boost now and essentially overclock themselves, what you're doing sounds like more the thing enthusiasts should be considering to get the best performance. Maybe someday, devs will work this "preferred CPU setup" into their products, but there's a lot of CPU variation to plan for, and it's probably not worth the trouble for the small gains most will get out it.
 
Joined
Apr 9, 2024
Messages
69 (0.52/day)
System Name Crapostrophic
Processor AMD Ryzen Z1 Extreme
Motherboard ASUS Custom PCB
Cooling Stock Asus Fan and Cooler Design
Memory 16GB of LPDDR5 running 6400mhz with tweaked timings
Video Card(s) AMD Radeon 780M APU
Storage 2TB Aorus 7300 Gen 4
Display(s) 7 Inch IPS Display @120hz
Case Plastic Shell Case designed by Asus
Audio Device(s) Asus ROG Delta
Power Supply 40WHrs, 4S1P, 4-cell Li-ion with a 65W PD Charger
Mouse Asus ROG Keris Wireless
Keyboard AKKO 3098B hotswapped to speed silver pro switches
Software Windows 11 Home (Debloated and tweaked)
AMD needs the chipset driver magic once again for scheduling games properly..lol..
 
Joined
Feb 1, 2019
Messages
3,052 (1.51/day)
Location
UK, Midlands
System Name Main PC
Processor 13700k
Motherboard Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 4080 RTX SUPER FE 16G
Storage 1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Soundblaster AE-9
Power Supply Antec HCG 750 Gold
Software Windows 10 21H2 LTSC
Games run better with SMT disabled? What!? This does not compute! :eek:

SMT uses power, more power budget with 65/88w, higher clocks, better performance?

I disable HTT on my main gaming power schema (its actually off on most of my schemas now), as games usually run better and I save power/heat. Wizzards findings dont surprise me. But I do have one gaming schema that allows HTT cores to be scheduled, so I dont need to reboot to toggle, I can do it on the fly.

The hay day for this type of tech on gaming was on quad core CPU's for 8 threaded games.

I have observed it can still help though, it helps on 3dmark CPU parts of tests, but I havent seen a game that performs better with it on yet.
 
Last edited:
Joined
Jan 14, 2019
Messages
11,019 (5.39/day)
Location
Midlands, UK
System Name Nebulon B
Processor AMD Ryzen 7 7800X3D
Motherboard MSi PRO B650M-A WiFi
Cooling be quiet! Dark Rock 4
Memory 2x 24 GB Corsair Vengeance DDR5-4800
Video Card(s) AMD Radeon RX 6750 XT 12 GB
Storage 2 TB Corsair MP600 GS, 2 TB Corsair MP600 R2, 4 + 8 TB Seagate Barracuda 3.5"
Display(s) Dell S3422DWG, 7" Waveshare touchscreen
Case Kolink Citadel Mesh black
Audio Device(s) Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply Seasonic Prime GX-750
Mouse Logitech MX Master 2S
Keyboard Logitech G413 SE
Software Windows 10 Pro
SMT uses power, more power budget with 65/88w, higher clocks, better performance?
It does not explain the results with CPUs like the 7800X3D which runs way below its power target even in multi-threaded work.
 

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
27,426 (3.70/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit
I guess what I'm asking is... Why aren't we seeing the double-digit performance increases that AMD purported in many of their leaked slides? What's going on here? Is it power? Thermals? Or something else?
It's not power or thermals, these are eliminated by the "PBO max" run, also by the PBO max run in the regular review, which is SMT on. I didn't want to add yet another colored bar. Disabling SMT shows surprisingly good gains, bigger than on Zen 4, which is the basis for this article. I'm not claiming I have a fix, and blindly turning SMT off is definitely not the fix. I'm saying "here's some unexpected results, AMD and the community, please look into it"
 
Joined
Feb 1, 2019
Messages
3,052 (1.51/day)
Location
UK, Midlands
System Name Main PC
Processor 13700k
Motherboard Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 4080 RTX SUPER FE 16G
Storage 1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Soundblaster AE-9
Power Supply Antec HCG 750 Gold
Software Windows 10 21H2 LTSC
I've never let windows schedule my cpus it's dumb as shite....

Been using process lasso for a while
On my 7950X3D these are my profiles

1 with cache only CCD for games
2 without smt on the cache ccd for games
3 for background process on the secondary ccd
4 for things that benefit from all core workloads.

Once setup it takes all of 1m to set a profile to a particular program.

Should anyone have to do that no but if you want the most out of your hardware it's just the reality.
I do similar but with the free process hacker (system informer).

Here is my notes what I have discovered with CPU scheduling on Windows (10).

  1. Power schema settings control/manipulate how Windows will schedule threads. However this only applies when 'no' affinity is configured, this just doesnt mean from the admin of the machine, but software devs themselves can code in affinity in to their software.
  2. If I add a affinity in process hacker to allow use all cores (same as default in theory), it actually changes the scheduling behaviour from having no affinity configured at all, thats interesting.
  3. Any affinity configured will override the windows scheduling behaviour, so e.g. if you set the power schema settings to park half of the cores and then run a cinebench all core benchmark, it will ignore the schema and use all cores.
  4. Windows core parking is only a soft park, it doesnt actually hardware (C6) park a core, thats handled independently, soft park just will prevent threads been scheduled to the core, but of course if affinity forces use of the core, the software parking is overridden.
  5. A process that launches child processes, the child processes will by default inherit the affinity from the parent, but can be overridden with their own affinity using something like process hacker or process lasso.
  6. Back to the power schema, it isnt possible to park specific cores of your choosing, instead Windows has a kind of priority system. On my 13700k. The 2 highest priority cores are the first logical core of the 2 high clocking cores. (this only applies if C6 state is enabled, without it these cores dont clock higher). This will of course be very likely the same for AMD as that also has favoured cores that clock higher. The next cores in priority are the rest of the cores in the same performance class, first logical core only. Then after that the 2nd logical core for the highest clocking cores (HTT/SMT), and last the 2nd logical core for rest of cores in same performance class (HTT/SMT).
  7. So you could tell Windows on the power schema to only keep 50% of cores unparked, and that soft disables HTT/SMT. It will prevent most software from using the 2nd logical core, but it will be overridden by affinity overrides, software requesting use of all cores, that type of stuff, so it wont block something like an all core cinebench run.
  8. Interestingly if you have all cores unparked, the priority for assigning thread workloads is in a different order, Windows will prefer to use the 2nd logical core on the highest clocking cores before loading the first logical core on the other cores. This is the flaw I think that might be causing loss of performance issues in games and some other software with HTT/SMT enabled. Soft parking is usually enough to fix this, no need for bios disable. Affinity configuration can also of course fix it.
 
Last edited:
Joined
Dec 6, 2022
Messages
194 (0.31/day)
Location
NYC
System Name GameStation
Processor AMD R5 5600X
Motherboard Gigabyte B550
Cooling Artic Freezer II 120
Memory 16 GB
Video Card(s) Sapphire Pulse 7900 XTX
Storage 2 TB SSD
Case Cooler Master Elite 120
So, lazy game developers strike again?

Because if the issue was on AMD side, ALL programs would be affected, correct?

Hell, maybe we could check if the culprit is Windows proven-to-suck-thread scheduler.
 
Last edited:
Joined
Feb 1, 2019
Messages
3,052 (1.51/day)
Location
UK, Midlands
System Name Main PC
Processor 13700k
Motherboard Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 4080 RTX SUPER FE 16G
Storage 1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Soundblaster AE-9
Power Supply Antec HCG 750 Gold
Software Windows 10 21H2 LTSC
It's not power or thermals, these are eliminated by the "PBO max" run, also by the PBO max run in the regular review, which is SMT on. I didn't want to add yet another colored bar. Disabling SMT shows surprisingly good gains, bigger than on Zen 4, which is the basis for this article. I'm not claiming I have a fix, and blindly turning SMT off is definitely not the fix. I'm saying "here's some unexpected results, AMD and the community, please look into it"

It might be down to a flaw I discovered in the windows scheduler, I posted above, the relevant line is highlighted in bold. In short if all the logical cores are unparked, Windows prefers to load up the 2nd logical thread on the favoured physical cores before loading up the first logical cores on the other physical cores, for a workload that doesnt need loads of heavy threads it can slow things down, as loading the 2nd logical core even on a faster core will be slower than loading a logical core on a idle physical core.
 
Joined
May 3, 2018
Messages
2,683 (1.17/day)
So Intel was on to something for Arrow Lake, ditching HT...
Well, they'll also have vastly more powerful e-cores too this time around. If Intel don't fcuk up Arrow Lake, they do actually deliver on reduced power consumption, then I would take an i5, let alone i7 over a 9700X any day of the week. i5 will have 14 real cores so 14 threads vs 9700X's 8 cores and 16 threads. It will be rare those extra 2 threads will net them a win. I know most people are hyped for X3D models, but for me gaming is 25% max what I do, so unless they have degimped productivity with Zen v-cache models hard to see why one wouldn't choose Intel unless gaming is all you care about.

Still it's Intel, and who knows if Arrow lake will now even ship this year.

Missing Zen 5 performance wasn't found, there's a mere 2.5% between SMT off vs on in games and it's slower in applications.

That's only a slightly larger gain that the 7700X obtained in the charts and not nearly enough to make up for the difference between AMD's marketing claims and reality.
Yeah, I doubt TPU will have convinced Hardware Unboxed to call off the hounds LOL.

I agree, I am disappointed with Hardware Unboxed, they were saying a lot of crap about AMD products,
They didn't even wait for 9900, or 9950 to conclude that Zen5 is shit. They never read the Tomshardware or Phronix review...
Steve actually alluded to other reviews and basically dismissed their results. He's likes to jab the needle into AMD whenever he gets the chance. Maybe Lisa touched him inappropriately when he was a child.

So, maybe AMD should come out with a chip that's like the 9700X with one chiplet where it's an eight-core chiplet that has no SMT support and another chiplet that a whole bunch of Zen 5c cores to make up for the loss of SMT.
Well we can try that on Strix as it's two ccd's one with 4 Zen 5 cores and one with up to 8 Zen 5c cores and you can turn off SMT in the bios I presume. It's a shame we don't have a Strix model with 6 + 6 config to compare to 9600X at same power.
 
Joined
Nov 18, 2010
Messages
7,342 (1.46/day)
Location
Rīga, Latvia
System Name HELLSTAR
Processor AMD RYZEN 9 5950X
Motherboard ASUS Strix X570-E
Cooling 2x 360 + 280 rads. 3x Gentle Typhoons, 3x Phanteks T30, 2x TT T140 . EK-Quantum Momentum Monoblock.
Memory 4x8GB G.SKILL Trident Z RGB F4-4133C19D-16GTZR 14-16-12-30-44
Video Card(s) Sapphire Pulse RX 7900XTX. Water block. Crossflashed.
Storage Optane 900P[Fedora] + WD BLACK SN850X 4TB + 750 EVO 500GB + 1TB 980PRO+SN560 1TB(W11)
Display(s) Philips PHL BDM3270 + Acer XV242Y
Case Lian Li O11 Dynamic EVO
Audio Device(s) SMSL RAW-MDA1 DAC
Power Supply Fractal Design Newton R3 1000W
Mouse Razer Basilisk
Keyboard Razer BlackWidow V3 - Yellow Switch
Software FEDORA 41
As many already stated... is it AMD fault here? Look at the Intel patch fix, and why Hitman takes a toll, shit code, single thread, what multicore and SMT/HT etc?

Michael did benchmarks with EPYC 128 core CPU's where actually those things if they do good or harm are seen more, not this children play and some toy benchmarks. It had mixed results, but at tasks when it does take advantage, basically is coded right, it is useful and for 128 core EPYC 9754 it ate 15W ~ 3% more when enabled.

Okay the recent review with the Ryzen AI 9 HX 370 shows no penalty only gains at all, so reading this causes a headscratcher.

To be fair Linux got great amount of patches with the upcoming Zen5 AMD cpu's and the AMD-PSTATE drivers, that is enabled by default works much better in Linux, it is even a landmark, that the code is almost ready for Zen5 and even RDNA4 even before the products are out. W1z could courage up and include some Linux Proton gaming benchmarks. Now with even nvidia more proper drivers, I cannot see any toothache in doing that.

The point is, that it would clarify better where to point the finger when something really does not work as it should. I point my finger at Redmond here.
 
Joined
Feb 1, 2019
Messages
3,052 (1.51/day)
Location
UK, Midlands
System Name Main PC
Processor 13700k
Motherboard Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 4080 RTX SUPER FE 16G
Storage 1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Soundblaster AE-9
Power Supply Antec HCG 750 Gold
Software Windows 10 21H2 LTSC
I do similar but with the free process hacker (system informer).

Here is my notes what I have discovered with CPU scheduling on Windows (10).

  1. Power schema settings control/manipulate how Windows will schedule threads. However this only applies when 'no' affinity is configured, this just doesnt mean from the admin of the machine, but software devs themselves can code in affinity in to their software.
  2. If I add a affinity in process hacker to allow use all cores (same as default in theory), it actually changes the scheduling behaviour from having no affinity configured at all, thats interesting.
  3. Any affinity configured will override the windows scheduling behaviour, so e.g. if you set the power schema settings to park half of the cores and then run a cinebench all core benchmark, it will ignore the schema and use all cores.
  4. Windows core parking is only a soft park, it doesnt actually hardware (C6) park a core, thats handled independently, soft park just will prevent threads been scheduled to the core, but of course if affinity forces use of the core, the software parking is overridden.
  5. A process that launches child processes, the child processes will by default inherit the affinity from the parent, but can be overridden with their own affinity using something like process hacker or process lasso.
  6. Back to the power schema, it isnt possible to park specific cores of your choosing, instead Windows has a kind of priority system. On my 13700k. The 2 highest priority cores are the first logical core of the 2 high clocking cores. (this only applies if C6 state is enabled, without it these cores dont clock higher). This will of course be very likely the same for AMD as that also has favoured cores that clock higher. The next cores in priority are the rest of the cores in the same performance class, first logical core only. Then after that the 2nd logical core for the highest clocking cores (HTT/SMT), and last the 2nd logical core for rest of cores in same performance class (HTT/SMT).
  7. So you could tell Windows on the power schema to only keep 50% of cores unparked, and that soft disables HTT/SMT. It will prevent most software from using the 2nd logical core, but it will be overridden by affinity overrides, software requesting use of all cores, that type of stuff, so it wont block something like an all core cinebench run.
  8. Interestingly if you have all cores unparked, the priority for assigning thread workloads is in a different order, Windows will prefer to use the 2nd logical core on the highest clocking cores before loading the first logical core on the other cores. This is the flaw I think that might be causing loss of performance issues in games and some other software with HTT/SMT enabled. Soft parking is usually enough to fix this, no need for bios disable. Affinity configuration can also of course fix it.
An additional note I forgot to add. Possibly relevant to gaming so will post it.

I mentioned cinebench effectively forces all cores to be used by applying some kind of all core affinity when the test is started, However the CPU tests in 3d mark like the physics test and the other test in one of the other benchies just set a load of threads, but dont apply any kind of affinity so e.g. if you run a 3d mark bench, and have soft parked all the HTT logical cores, then it will affect the bench how it behaves and the final score (it will not utilise those logical cores), this is representative of how most games run, they seem to rely on Windows itself to schedule the threads. (at least the games I play and have tested).
 

doc7000

New Member
Joined
Mar 6, 2022
Messages
22 (0.02/day)
It does not explain the results with CPUs like the 7800X3D which runs way below its power target even in multi-threaded work.

I feel like the low power usage of the X3D parts was explained some time ago, they use power differently to protect their stacked cache giving them lower power use.
 
Joined
Apr 19, 2018
Messages
1,220 (0.53/day)
Processor AMD Ryzen 9 5950X
Motherboard Asus ROG Crosshair VIII Hero WiFi
Cooling Arctic Liquid Freezer II 420
Memory 32Gb G-Skill Trident Z Neo @3806MHz C14
Video Card(s) MSI GeForce RTX2070
Storage Seagate FireCuda 530 1TB
Display(s) Samsung G9 49" Curved Ultrawide
Case Cooler Master Cosmos
Audio Device(s) O2 USB Headphone AMP
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Cherry MX
Software Windows 11
This basically just shows you how stupid the Windows scheduler actually is. It makes no sense to assign a heavy workload to a fully occupied physical core's virtual core. Microsoft should know and detect the difference between a physical core and a virtual one. They should at least make this an option in the power settings or something.

I can't help but wonder if all this anti-SMT stuff is a result of Intel's push to remove SMT from their CPU's, and Microsoft is deliberately nerfing performance to help make a case in the minds of consumers to get rid of it.

But another thing wouldn't surprise me, AMD knows their architecture is cache starved, and that enabling SMT also puts more pressure on the tiny L2 cache. 1MB is a joke.
 
Last edited:
Joined
Jul 9, 2021
Messages
36 (0.03/day)
I’ve searched for cpu lasso amd gaming benchmarks and I couldn’t find any. If it’s so good, amd should give a free voucher to buy a lifetime licence. And where is a windows threadt director for amd? They should also encourage retailers to sell linux usb stick combo because of superior performance.
What I can confirm working is setting power plan to ultimate performance, set priority maximum and for old games like ac ii disable core zero then re-enable.
Bottomline is with same settings, IPC isn’t great for average Joe on windows 10.
 
Joined
Dec 16, 2021
Messages
214 (0.22/day)
Location
Denmark
Processor AMD Ryzen 7 3800X
Motherboard ASUS Prime X470-Pro
Cooling bequiet! Dark Rock Slim
Memory 64 GB ECC DDR4 2666 MHz (Samsung M391A2K43BB1-CTD)
Video Card(s) eVGA GTX 1080 SC Gaming, 8 GB
Storage 1 TB Samsung 970 EVO Plus, 1 TB Samsung 850 EVO, 4 TB Lexar NM790, 12 TB WD HDDs
Display(s) Acer Predator XB271HU
Case Corsair Obsidian 550D
Audio Device(s) Creative X-Fi Fatal1ty
Power Supply Seasonic X-Series 560W
Mouse Logitech G502
Keyboard Glorious GMMK
I wonder if Microsoft will use this opportunity to release Windows 12. After all, they pretty much did the same thing with Windows 11 when Intel's Alder Lake architecture couldn't run properly on Windows 10. Naaaaaah....
 

rodrigorras

New Member
Joined
Aug 10, 2024
Messages
3 (0.33/day)
The 7800x3d only uses 88w on all-core workloads via GN. Same as the non-x 7000 chips and the 9000X chips we have so far.

You didn't find the missing performance, you found meaningless 2% which will probably be an issue with the Windows scheduler and Agesa.
Windows 11 scheduler was tweaked for Intel thread director. It's a piece of SHIT...
 
Joined
May 3, 2019
Messages
169 (0.09/day)
I really don't understand some of you. This is the same as with the zen1 and zen 2 when you disable smt you gain a few %

It's just reaching at this point. Instead of admitting it's a shite product you keep on trying to prove it's not.
 
Joined
Jul 9, 2021
Messages
36 (0.03/day)
I wonder if Microsoft will use this opportunity to release Windows 12. After all, they pretty much did the same thing with Windows 11 when Intel's Alder Lake architecture couldn't run properly on Windows 10. Naaaaaah....
Because Intel cooperated with Microsoft both on windows 10 and 11. This is why win10 is great.Now isn’t the case at Intel, because they fired the people who did this, even after got lots of money from us chip acts. And we didn’t saw news of ms-amd cooperations.
 
Top