• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel Raptor Lake Processor with 34 P-Cores Spotted

Joined
Dec 17, 2020
Messages
141 (0.10/day)
No. I hope you remember what Phi was?
I still have one squired away.
It was the surviving part from the Sandy Bridge E5-2687 machine that I burned out designing FIR filters.


Except for having to write my own code,
what part of throwing lots of vector operations at lots of cores is different?
Or am I being too general?
 
Joined
May 2, 2017
Messages
7,762 (2.78/day)
Location
Back in Norway
System Name Hotbox
Processor AMD Ryzen 7 5800X, 110/95/110, PBO +150Mhz, CO -7,-7,-20(x6),
Motherboard ASRock Phantom Gaming B550 ITX/ax
Cooling LOBO + Laing DDC 1T Plus PWM + Corsair XR5 280mm + 2x Arctic P14
Memory 32GB G.Skill FlareX 3200c14 @3800c15
Video Card(s) PowerColor Radeon 6900XT Liquid Devil Ultimate, UC@2250MHz max @~200W
Storage 2TB Adata SX8200 Pro
Display(s) Dell U2711 main, AOC 24P2C secondary
Case SSUPD Meshlicious
Audio Device(s) Optoma Nuforce μDAC 3
Power Supply Corsair SF750 Platinum
Mouse Logitech G603
Keyboard Keychron K3/Cooler Master MasterKeys Pro M w/DSA profile caps
Software Windows 10 Pro
Are you willing to do a benchmarking experiment on your 5800X? I can't , my i5-6600K has had HT disabled forever by its creator. You'd need to run two instances of a single-threaded benchmarking program such as Super Pi at the same time, one pinned (via affinity settings) to virtual CPU 0, the other to virtual CPU 1. Or any pair that belong to the same core. What results do you get, is one instance slower than the other?

I can't find any technical documents or discussion or benchmarking results that would confirm that there exists a "main thread" with higher priority and a "HT thread" with lower priority, so that the main thread would never be slowed down substantially, but the HT thread would "take whatever remains" of execution units and run very slowly. AnandTech has had some great articles on HT since 2002, with this one being the most recent, and there's no mention of the two threads being unequal.

The OS scheduled clearly knows the consequences of dispatching threads to fewer physical cores (preferring HT) vs. more physical cores (preferring no HT), but this is a different matter.
Unfortunately I'm not at home these days, so I don't have access to my main system, and all I've got where I am right now is an old i5-4670S. Otherwise I would definitely test this. I'm reasonably sure the Windows scheduler actively treats HT threads as "lower performance" somehow and mainly allocates lighter loads to these unless fully saturated with high performance tasks - but unfortunately I'm not able to test it until I can go back to Sweden and get my stuff moved to where I am now, which will take a while.
 
Joined
Jun 26, 2022
Messages
240 (0.26/day)
Processor 7950X, PBO CO -15
Motherboard Gigabyte X670 AORUS Elite AX (rev. 1.0)
Cooling EVGA CLC 360 w/Arctic P12 PWM PST A-RGB fans
Memory 64GB G.Skill Trident Z5 RGB F5-6000J3040G32GA2-TZ5RK
Video Card(s) ASUS TUF Gaming GeForce RTX 3070
Storage 970 EVO Plus 2TB x2, 970 EVO 1TB; SATA: 850 EVO 500GB (HDD cache), HDDs: 6TB Seagate, 1TB Samsung
Display(s) ASUS 32" 165Hz IPS (VG32AQL1A), ASUS 27" 144Hz TN (MG278Q)
Case Corsair 4000D Airflow
Audio Device(s) Razer BlackShark V2 Pro
Power Supply Corsair RM1000x
Mouse Logitech M720
Keyboard G.Skill KM780R MX
Software Win10 Pro, PrimoCache, VMware Workstation Pro 16
What do you mean here? When two threads are running on the same core, they have the same performance. The OS cannot change that because a core has no concept of thread priority. Here is some discussion about that.

As you can see in "slide2" image, after you get past the 8th thread on 12900K where it starts to use hyper threading, the 7950x starts beating the snot out of it. Then the 7950x gets to it's 15th/16th thread and stops leaving it in the dust as it has to start using HT. At the 24th thread, the 12900k runs out of E cores and the 7950x continues its lead.
 

Attachments

  • slide1.PNG
    slide1.PNG
    249.1 KB · Views: 44
  • slide2.PNG
    slide2.PNG
    173.1 KB · Views: 39
Joined
Sep 15, 2011
Messages
6,772 (1.40/day)
Processor Intel® Core™ i7-13700K
Motherboard Gigabyte Z790 Aorus Elite AX
Cooling Noctua NH-D15
Memory 32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s) ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage 2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s) Acer Predator X34 3440x1440@100Hz G-Sync
Case NZXT PHANTOM410-BK
Audio Device(s) Creative X-Fi Titanium PCIe
Power Supply Corsair 850W
Mouse Logitech Hero G502 SE
Software Windows 11 Pro - 64bit
Benchmark Scores 30FPS in NFS:Rivals
Games wont ever need more than 12 cores/threads, ever. Many game devs are still struggling with using more than 4 cores.
Latest Unreal, Frostbite and Unity engines can all use up to 32 Cores/Threads. Just FYI.
 
Joined
Jul 16, 2014
Messages
8,220 (2.15/day)
Location
SE Michigan
System Name Dumbass
Processor AMD Ryzen 7800X3D
Motherboard ASUS TUF gaming B650
Cooling Artic Liquid Freezer 2 - 420mm
Memory G.Skill Sniper 32gb DDR5 6000
Video Card(s) GreenTeam 4070 ti super 16gb
Storage Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s) 1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s) onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply Corsair HX1000i
Mouse Steeseries Esports Wireless
Keyboard Corsair K100
Software windows 10 H
Benchmark Scores https://i.imgur.com/aoz3vWY.jpg?2
Latest Unreal, Frostbite and Unity engines can all use up to 32 Cores/Threads. Just FYI.
Unity I knew used up to 16 but it still needs to coded for anything more. None of them by default, but that could have changed in this last year.
 
Joined
Jan 3, 2021
Messages
3,623 (2.49/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
As you can see in "slide2" image, after you get past the 8th thread on 12900K where it starts to use hyper threading, the 7950x starts beating the snot out of it. Then the 7950x gets to it's 15th/16th thread and stops leaving it in the dust as it has to start using HT. At the 24th thread, the 12900k runs out of E cores and the 7950x continues its lead.
Thanks, it's just what I've been looking for. Although the 12900K starts using E-cores before HT. Intel revealed the formula P > E > HT (performance wise), and it's pretty logical, as an E-core has about 60-65% of performance of a P-core without HT. Difference due to HT is less.

Looking just at 12900K results: if I normalize the 8T performance to 100%, next 8 threads (E, not HT) add 45% more, and then next 8 threads (HT) add another 37%.
Looking just at 7950X results: if I normalize the 8T performance to 100%, next 8 threads (not HT) add 80% more, and then next 8 threads (HT) add another 41%.

+45% seems little for E-cores but even on AMD it doesn't scale to +100%, there must be some contention elsewhere in both systems (cache, RAM, boost limits, I don't know).
+37 is quite close to +41%, indicating a similar efficency of HT. Very good actually, and if there was some contention before, there's more now. From what I've read and understood, the effect of HT is the greatest if the two threads are doing very different things, thus competing less for the same execution units. But in MT benchmarks, all threads are given exactly the same code to crunch, so it's a bad case for HT to show its strength.

But there's a but. MT benchmarks just record the total performance of all running threads. If one runs slower than the other on the same core, it won't show anywhere. So these results don't either confirm or deny what I stated before: that both threads on the same core run equally fast (and each is about 30-35% slower than a thread with no HT).
 
Joined
Jun 26, 2022
Messages
240 (0.26/day)
Processor 7950X, PBO CO -15
Motherboard Gigabyte X670 AORUS Elite AX (rev. 1.0)
Cooling EVGA CLC 360 w/Arctic P12 PWM PST A-RGB fans
Memory 64GB G.Skill Trident Z5 RGB F5-6000J3040G32GA2-TZ5RK
Video Card(s) ASUS TUF Gaming GeForce RTX 3070
Storage 970 EVO Plus 2TB x2, 970 EVO 1TB; SATA: 850 EVO 500GB (HDD cache), HDDs: 6TB Seagate, 1TB Samsung
Display(s) ASUS 32" 165Hz IPS (VG32AQL1A), ASUS 27" 144Hz TN (MG278Q)
Case Corsair 4000D Airflow
Audio Device(s) Razer BlackShark V2 Pro
Power Supply Corsair RM1000x
Mouse Logitech M720
Keyboard G.Skill KM780R MX
Software Win10 Pro, PrimoCache, VMware Workstation Pro 16
Although the 12900K starts using E-cores before HT. Intel revealed the formula P > E > HT (performance wise),
If that's the case then it does indeed look like the HT threads are very close in performance to the primary thread under this workload.
 
Joined
Jan 3, 2021
Messages
3,623 (2.49/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
If that's the case then it does indeed look like the HT threads are very close in performance to the primary thread under this workload.
CB would have to keep each of its threads on the same hardware thread throughout the test, then spit out detailed results by thread, then we'd know much more.
 
Top