Intel Meteor Lake Could Bring Back L4 Caches

AleksandarK · Apr 12, 2023

In the latest Linux Kernel patches, Intel engineers are submitting initial support for Meteor Lake processor generation, with some interesting potential features. In a patch submitted yesterday, the Intel engineer noted, "On MTL, GT can no longer allocate on LLC - only the CPU can. This, along with the addition of support for ADM/L4 cache, calls a MOCS/PAT table update." What this translates to is that starting from Meteor Lake, the integrated graphics can no longer allocate on the last-level cache (LLC), the highest numbered cache accessed by the cores before fetching from memory. Instead, only the CPU cores can allocate to it. Even more interesting is the mention of the Meteor Lake platform's level 4 (L4) cache. For the first time since Haswell and Broadwell, Intel may be planning to bring back the L4 cache and integrate it into the CPU.

Usually, modern processors use L1, L2, and L3 caches where the L1 version is the fastest and smallest, while the others are larger but slower. The inclusion of L4 caches often is unnecessary, as this type of cache can consume a big area on the processor die while bringing little benefit, translating to the cost of manufacturing drastically soaring. However, with Meteor Lake and its multi-die tile design, we wonder where the L4 cache will end up. We could see integration into the base tile, which holds the compute cores and essential compute elements. This makes the most sense since the logic needs access to fast memory, and L4 could improve the performance in specific applications.

View at TechPowerUp Main Site | Source

Daven · Apr 12, 2023

Intel could also stack cache on top like AMD but consider it level 4 instead of an extension of level 3 cache.

SOAREVERSOR · Apr 12, 2023

Daven said:
Intel could also stack cache on top like AMD but consider it level 4 instead of an extension of level 3 cache.

Given the drawbacks of AMDs solution for everything but gaming (and even then not all games can truly use it) there's little point in that approach especially if you want to sell to business, enterprise, content creators, and everyone not a gamer.

Daven · Apr 12, 2023

SOAREVERSOR said:
Given the drawbacks of AMDs solution for everything but gaming (and even then not all games can truly use it) there's little point in that approach especially if you want to sell to business, enterprise, content creators, and everyone not a gamer.

Stacking chips has nothing to do with what you said. The limited improvement you describe in some apps like games is due to more cache and the premise of the article is that Intel is adding more cache. I’m just guessing how they would add more cache given limited space on the die.

Are you arguing that the L4 cache premise of the article is not happening because its not a good solution and Intel will NOT increase cache sizes or add cache levels whether stacked or not?

Vya Domus · Apr 12, 2023

SOAREVERSOR said:
Given the drawbacks of AMDs solution for everything but gaming (and even then not all games can truly use it) there's little point in that approach especially if you want to sell to business, enterprise, content creators, and everyone not a gamer.

There is nothing wrong with the 3D v-cache designs for products targeted outside gaming, 7950X3D for example is still very much on the top of the charts in professional applications.

BoboOOZ · Apr 12, 2023

SOAREVERSOR said:
Given the drawbacks of AMDs solution for everything but gaming (and even then not all games can truly use it) there's little point in that approach especially if you want to sell to business, enterprise, content creators, and everyone not a gamer.

Cache is cache, whether you stack it vertically or place it horizontally it works just the same, and it benefits some applications more than others, as all types of optimizations. Some games benefit a lot from a large cache, but not all do, and some other applications also benefit, but not all do.

hs4 · Apr 12, 2023

Read the Chips and Cheese's article titled "Hot Chips 34 – Intel’s Meteor Lake Chiplets, Compared to AMD’s". It was expected that new kind of chache would be placed in the Meteor lake because iGPU no longer share the ring bus and L3 with other CPU cores as a result of division of tiles. In other words, this is forced.

As of Hot Chips 34 last summer, Intel had not been decided what would be placed on the base tile. So that many people thought it would be a waste to make it just an interposer, and there have been many predictions that a new cache would be placed here. However, Foveros with solder ball bonding are not fast enough to place L3, so their use will be quite limited (V-Cache is bonded with a copper pillar that is a generation ahead of solder ball, and Intel will not make it available until later this year.) The base tile is 22FFL, so cache density will also be an issue.

Instead, the new cache to be placed in the GPU tile may be treated as L4. Also, since the media slice is presumed to be on the SoC tile, we cannot rule out the possibility that L4 is also on the SoC tile.

AMD's 780M has low performance for the number of CUs and DDR5 seems to be the bottleneck, caches like Infinity Cache will solve that to some extent.

After next year, Foveros Direct could get fast enough to join L3, so Intel could get a virtually free VCache on the base tile. However, I do not expect that to happen with Arrow lake.

Daven · Apr 12, 2023

hs4 said:
Read the Chips and Cheese's article titled "Hot Chips 34 – Intel’s Meteor Lake Chiplets, Compared to AMD’s". It was expected that new kind of chache would be placed in the Meteor lake because iGPU no longer share the ring bus and L3 with other CPU cores as a result of division of tiles. In other words, this is forced.

As of Hot Chips 34 last summer, Intel had not been decided what would be placed on the base tile. So that many people thought it would be a waste to make it just an interposer, and there have been many predictions that a new cache would be placed here. However, Foveros with solder ball bonding are not fast enough to place L3, so their use will be quite limited (V-Cache is bonded with a copper pillar that is a generation ahead of solder ball, and Intel will not make it available until later this year.) The base tile is 22FFL, so cache density will also be an issue.

Instead, the new cache to be placed in the GPU tile may be treated as L4. Also, since the media slice is presumed to be on the SoC tile, we cannot rule out the possibility that L4 is also on the SoC tile.

AMD's 780M has low performance for the number of CUs and DDR5 seems to be the bottleneck, caches like Infinity Cache will solve that to some extent.

After next year, Foveros Direct could get fast enough to join L3, so Intel could get a virtually free VCache on the base tile. However, I do not expect that to happen with Arrow lake.

Now that’s the most informative comment I have ever read on the internet. Thanks for the info!

Minus Infinity · Apr 13, 2023

Vya Domus said:
There is nothing wrong with the 3D v-cache designs for products targeted outside gaming, 7950X3D for example is still very much on the top of the charts in professional applications.

In a few apps like cryptography and some video encoding etc. It's still behind in most cases of relevance to normal users unless running in PBO max mode. For the 7950X I could care less about the improved fps in a few games, but the efficiency is really good and probably worth the loss in productivity scores overall.

R0H1T · Apr 13, 2023

SOAREVERSOR said:
Given the drawbacks of AMDs solution for everything but gaming (and even then not all games can truly use it) there's little point in that approach especially if you want to sell to business, enterprise, content creators, and everyone not a gamer.

What drawbacks? Azure was the first one to employ these chips & they work great in DC/HPC just as well :rolleyes:

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	Home
Processor	Ryzen 3600X
Motherboard	MSI Tomahawk 450 MAX
Cooling	Noctua NH-U14S
Memory	16GB Crucial Ballistix 3600 MHz DDR4 CAS 16
Video Card(s)	MSI RX 5700XT EVOKE OC
Storage	Samsung 970 PRO 512 GB
Display(s)	ASUS VA326HR + MSI Optix G24C4
Case	MSI - MAG Forge 100M
Power Supply	Aerocool Lux RGB M 650W

Intel Meteor Lake Could Bring Back L4 Caches

AleksandarK

News Editor

Daven

SOAREVERSOR

Daven

Vya Domus

BoboOOZ

hs4

Daven

Minus Infinity

R0H1T

AMD Releases Milan-X CPUs With 3D V-Cache: EPYC 7003 Up to 64 Cores and 768 MB L3 Cache

Intel Meteor Lake Could Bring Back L4 Caches

News Editor

AMD Releases Milan-X CPUs With 3D V-Cache: EPYC 7003 Up to 64 Cores and 768 MB L3 Cache​

AMD Releases Milan-X CPUs With 3D V-Cache: EPYC 7003 Up to 64 Cores and 768 MB L3 Cache