Monday, July 1st 2024

Intel "Arrow Lake-S" to See a Rearrangement of P-cores and E-cores Along the Ringbus

Intel's first three generations of client processors implementing hybrid CPU cores, namely "Alder Lake," "Raptor Lake," and "Meteor Lake," have them arranged along a ringbus, sharing an L3 cache. This usually sees the larger P-cores to one region of the die, and the E-core clusters to the other region. From the perspective of the bidirectional ringbus, the ring-stops would follow the order: one half of the P-cores, one half of the E-core clusters, iGPU, the other half of E-cores, the other half of the P-cores, and the Uncore, as shown in the "Raptor Lake" die-shot, below. Intel plans to rearrange the P-cores and E-core clusters in "Arrow Lake-S."

With "Arrow Lake," Intel plans to disperse the E-core clusters between the P-cores. This would see a P-core followed by an E-core cluster, followed by two P-cores, and then another E-core cluster, then a lone P-core, and a repeat of this pattern. Kepler_L2 illustrated what "Raptor Lake" would have looked like, had Intel applied this arrangement on it. Dispersing the E-core clusters among the P-cores has two possible advantages. For one, the average latency between a P-core ring-stop and an E-core cluster ring-stop would reduce; and secondly, there will also be certain thermal advantages, particularly when gaming, as it reduces the concentration of heat in a region of the die.
Every P-core would be no more than one ring-stop away from an E-core cluster, which should benefit migration of threads between the two core types. Thread Director prefers E-cores, and when a workload overwhelms an E-core, it is graduated to a P-core. This E-core to P-core migration should see reduced latencies under the new arrangement.
Source: Kepler_L2 (Twitter)
Add your own comment

100 Comments on Intel "Arrow Lake-S" to See a Rearrangement of P-cores and E-cores Along the Ringbus

#1
Daven
Are all P cores and E cores on one tile in Arrow Lake or are E cores and P cores on separate tiles?
Posted on Reply
#2
btarunr
Editor & Senior Moderator
DavenAre all P cores and E cores on one tile in Arrow Lake or are E cores and P cores on separate tiles?
From what I gather, all cores are on the same die.
Posted on Reply
#3
dgianstefani
TPU Proofreader
btarunrFrom what I gather, all cores are on the same die.
Should be nice for latency compared to Zen, fingers crossed, considering it's single chiplet for CPU cores and also on Foveros.
Posted on Reply
#4
rv8000
and secondly, there will also be certain thermal advantages, particularly when gaming, as it reduces the concentration of heat in a region of the die.
How does this provide thermal advantages? Localizing p-cores into a smaller area creates more of a hot spot/heat dense area. Less immediate surface area when transferring to the integrated heat spreader, on top of a denser process node.
Posted on Reply
#5
piloponth
Intel using this ringbus thing since Nehalem. What a shame on their R&D not to bring any lowlatency, lowenergy & low silicon budget solution till this day.
Posted on Reply
#6
P4-630
It doesn't seem intel rushed out this time around, they have clearly better thought about things.
Posted on Reply
#7
dgianstefani
TPU Proofreader
piloponthIntel using this ringbus thing since Nehalem. What a shame on their R&D not to bring any lowlatency, lowenergy & low silicon budget solution till this day.
Compared to what?
Posted on Reply
#8
P4-630
dgianstefanifingers crossed
Right!
Posted on Reply
#9
JustBenching
Was thinking about skipping arrow lake but it seems intel is going to pull another wow moment like alderlake.
Posted on Reply
#10
evernessince
dgianstefaniShould be nice for latency compared to Zen, fingers crossed, considering it's single chiplet for CPU cores and also on Foveros.
Intel's chiplets being physically closer together should also help latency. I'm not sure how the two cores on Intel's low power island will interact with the whole setup but I assume they will only be in use in non-demanding scenarios and thus won't impact any demanding app latency.

Looks pretty promising.
Posted on Reply
#11
Daven
fevgatosWas thingink about skipping arrow lake but it seems intel is going to pull another wow moment like alderlake.
Everything points to this being another architecture realignment since Alder Lake through Raptor Lake refresh hit another dead end with power budgets similar to how Intel hit a dead end with Comet Lake and had to scale back with Rocket Lake.

Lower clocks, no hyperthreading, etc should help with performance per watt but I’m not holding my breath on absolute performance being higher than a 13900k.
Posted on Reply
#12
Crazybc
I am hoping that Arrow Lake desktop works out well .I so badly want to upgrade this year my system is 10 years old and I am hoping to go Intel again but if it turns out to be a dud I'll look at AMD. I like the idea of spreading out the cores for decreased latency and not centralizing all that heat . I will be watching the forums closely and I'm waiting for sept for way more information on the upcoming Z890 platform and Arrow lake desktop.
Posted on Reply
#13
JustBenching
DavenEverything points to this being another architecture realignment since Alder Lake through Raptor Lake refresh hit another dead end with power budgets similar to how Intel hit a dead end with Comet Lake and had to scale back with Rocket Lake.

Lower clocks, no hyperthreading, etc should help with performance per watt but I’m not holding my breath on absolute performance being higher than a 13900k.
Performance per watt was already a strong point for intel. That's the only thing they really don't need to improve.
Posted on Reply
#14
usiname
fevgatosPerformance per watt was already a strong point for intel. That's the only thing they really don't need to improve.
:roll: :roll: :roll: :roll:
Posted on Reply
#15
TristanX
Intel presented ARL compute tile glimpse in their materials, and arrangement is unchanged. Ring bus is necessary, because cores must synchronize cache changes to ensure data integrity, so it is pare of cache system, not cores itself.
Posted on Reply
#16
Daven
fevgatosPerformance per watt was already a strong point for intel. That's the only thing they really don't need to improve.
Posted on Reply
#17
Denver
If the L3 cache is shared between all cores: 1.5MB cache/core... less than Lunar Lake.
Posted on Reply
#18
JustBenching
usiname:roll: :roll: :roll: :roll:
Daven
Those are not ISO power lads. Let's not go over this again. If you care about efficiency you don't run 4096 power limits, 512 amp limits etc.

The point is on the vast majority of segments Intel has a much more efficient CPU at iso power in both ST and MT workloads. It's really not even competitive. R5 vs i5, R7 vs i7 etc.

Take your 7600x, it consumes 100w in CBR23 according to the same tests. Is there any Intel CPU that isn't actually faster than your 7600x while consuming the same watts? The answer is, maybe an i3 or something. Everything else handily beats the 7600x - in both performance and efficiency.
Posted on Reply
#19
Daven
fevgatosThose are not ISO power lads. Let's not go over this again. If you care about efficiency you don't run 4096 power limits, 512 amp limits etc.

The point is on the vast majority of segments Intel has a much more efficient CPU at iso power in both ST and MT workloads. It's really not even competitive. R5 vs i5, R7 vs i7 etc.
Look we can have a discussion about tech but we cannot have a discussion about your favorite company. Everyone here knows about the power draw and real world use of these products. It’s why close to 80% of TPU readers buy AMD and Nvidia for their builds.

We all see the same numbers and 99.9999% of us know Intel’s product are hugely inefficient. So don’t forgive Intel by ignoring their faults, wish for them to do better.
Posted on Reply
#20
TumbleGeorge
fevgatosanother wow moment like alderlake.
I seem to have missed this. Was there really such a moment? I mean positive.
Posted on Reply
#21
dgianstefani
TPU Proofreader
TumbleGeorgeI seem to have missed this. Was there really such a moment? I mean positive.
12900K

12900KS

13900K


So in answer to your question, yes.

Overall the 12-13th gen were massive improvements in IPC from the previous Skylake based systems, are still faster core for core against Zen, and didn't have the core count/process issues of Rocket Lake.
Posted on Reply
#22
JustBenching
DavenLook we can have a discussion about tech but we cannot have a discussion about your favorite company. Everyone here knows about the power draw and real world use of these products. It’s why close to 80% of TPU readers buy AMD and Nvidia for their builds.

We all see the same numbers and 99.9999% of us know Intel’s product are hugely inefficient. So don’t forgive Intel by ignoring their faults, wish for them to do better.
I don't, my favorite company is the one that makes the most efficient products. Currently it's Intel - it doesn't idle at 30-40 watts and it's incredibly fast in MT at low power. Let's say a 13700k / 14700k vs a 7800x 3d, both set at the same 90w, the i7 parts will literally fly past the 7800x 3d in MT workloads, being both faster and more efficient. Same goes for the i5 13600k vs 7600x and the 7700x.

If you strictly care about out of the box efficiency then TPU has tested that too, here is a simulated T and non k intel chip. Nothing is nowhere near in efficiency.

TumbleGeorgeI seem to have missed this. Was there really such a moment? I mean positive.
Alderlake was a 40% jump in ST performance from cometlake within 15 months. It was also a huge uplift in MT performance, again during the same timespan. If you weren't impressed with that, I don't know man.
Posted on Reply
#23
Daven
fevgatosI don't, my favorite company is the one that makes the most efficient products. Currently it's Intel - it doesn't idle at 30-40 watts and it's incredibly fast in MT at low power. Let's say a 13700k / 14700k vs a 7800x 3d, both set at the same 90w, the i7 parts will literally fly past the 7800x 3d in MT workloads, being both faster and more efficient. Same goes for the i5 13600k vs 7600x and the 7700x.

If you strictly care about out of the box efficiency then TPU has tested that too, here is a simulated T and non k intel chip. Nothing is nowhere near in efficiency.




Alderlake was a 40% jump in ST performance from cometlake within 15 months. It was also a huge uplift in MT performance, again during the same timespan. If you weren't impressed with that, I don't know man.

This is from almost a year ago. It's probably close to 80% of AMD users on TPU now. It will probably hit 90% by next year. We DIYers like to buy AMD because of the efficiency and I don't see AMD letting go of this advantage anytime soon. How you see the exact opposite of what we see is mind boggling.
Posted on Reply
#24
TumbleGeorge
fevgatosAlderlake was a 40% jump in ST performance from cometlake within 15 months
Yes, but Intel did a bad job with the pre-Alder generations, I consider this to be back on the train tracks where it should have been the norm. That's why it's not a wow effect for me.
Posted on Reply
#25
P4-630
TumbleGeorgeYes, but Intel did a bad job with the pre-Alder generations
AMD's were never bad were they?... :D
Posted on Reply
Add your own comment
Dec 21st, 2024 14:51 EST change timezone

New Forum Posts

Popular Reviews

Controversial News Posts