• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel Meteor Lake Technical Deep Dive

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
27,833 (3.71/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit
Today Intel is taking the wraps off their Meteor Lake Architecture. Our tech preview tells you everything you need to know about Intel's new ideas that will power the company's processors for years to come. Just like AMD, Intel is betting on chiplets, which combine multiple silicon dies into a single CPU to build faster, more-energy efficient designs that are cheaper to manufacture.

Show full review
 
Joined
Oct 6, 2021
Messages
1,605 (1.40/day)
Wouldn't Meteor Lake be focused on laptops? I don't understand the reason for comparing the iGPU with the desktop version which only has 256shaders (768shaders on the laptop). Bigger numbers to show investors, of course...

1695140710377.png
 
Joined
Nov 26, 2021
Messages
1,648 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
I like the low power island which clocks the E cores more sensibly. I was expecting them to use smaller cores than their fairly large Crestmont cores for the low power island. Let's see when it starts shipping. The claims for power efficiency are lower than I expected: more than 20% compared to Raptor Lake mobile. Intel's presentation was rather light on microarchitectural details. The only hard numbers were cache sizes: I cache for the P core has doubled to 64 KB. Other caches stay at the same size as Raptor Lake. It's also interesting that they have opted to use TSMC's N5 rather than their own process for the GPU tile.
 
Joined
Aug 12, 2022
Messages
248 (0.30/day)
Maybe there was more buried in the slides but I feel like very little new information was presented here. I had heard that there was a 6P+8E core configuration plus 2 e-cores in the SoC that would only be used if the CPU tile was powered down. But now I know it's 4 cores on the SoC and they're available for use even when the CPU tile is active. Maybe this was in the slides but the article only said that the GPU tile used something from TSMC which is better than what Arc uses, which implies N5, N4, or N3 but still nothing more specific, which pretty much confirms what we already surmised. The article says the interposer has some logic, but what is that logic? Is it the rumored L4 cache? And what about the ISA, is AVX-512 supported?
 
  • Like
Reactions: bug
Joined
Nov 26, 2021
Messages
1,648 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
Maybe there was more buried in the slides but I feel like very little new information was presented here. I had heard that there was a 6P+8E core configuration plus 2 e-cores in the SoC that would only be used if the CPU tile was powered down. But now I know it's 4 cores on the SoC and they're available for use even when the CPU tile is active. Maybe this was in the slides but the article only said that the GPU tile used something from TSMC which is better than what Arc uses, which implies N5, N4, or N3 but still nothing more specific, which pretty much confirms what we already surmised. The article says the interposer has some logic, but what is that logic? Is it the rumored L4 cache? And what about the ISA, is AVX-512 supported?
They only claimed VNNI which suggests 256 bit AVX.
 
Joined
Apr 12, 2013
Messages
7,528 (1.77/day)
AVX 512 is practically dead for Intel, at least post TGL till now. Don't expect anything like that till AMD also moves to AVX 512 minus the double pumping.
 
Joined
Nov 26, 2021
Messages
1,648 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
AVX 512 is practically dead for Intel, at least post TGL till now. Don't expect anything like that till AMD also moves to AVX 512 minus the double pumping.
Zen 4 has AVX-512 which means that for the foreseeable future, its successors will support it as well.
 
Joined
Aug 12, 2022
Messages
248 (0.30/day)
I'm not really sure what it is for, except I've heard some AI workloads can benefit from it. My reason for wondering is because Golden/Raptor Cove includes it, but Gracemont does not. So I'm wondering if Redwood Cove and Crestmont were actually designed with the same ISA instead of non-shared instructions being disabled.

From Anand Tech, it sounds like the GPU tile is made with TSMC N5 and the SoC with TSMC N6.

I wonder today if Intel's new reliance on TSMC has to do with node optimization. Intel's newest nodes get used first for CPUs, so they're frequency-optimized. But TSMC favors density-optimized nodes because their early adopters make smartphone and graphics processors. So even if Intel and TSMC were keeping pace with one another, it'd still make more sense to build the CPU tile at Intel and the GPU and SoC tiles at TSMC. (I believe both companies tend to make alternate versions of their nodes that are optimized differently, but those come later and might be a little more expensive.)

Anand Tech also said that the Crestmont cores in the SoC are optimized with a lower voltage-frequency curve, perhaps that's because the TSMC N6 process they're built with is density-optimized?

Source:
 
Last edited:
Joined
Nov 26, 2021
Messages
1,648 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
I'm not really sure what it is for, except I've heard some AI workloads can benefit from it. My reason for wondering is because Golden/Raptor Cove includes it, but Gracemont does not. So I'm wondering if Redwood Cove and Crestmont were actually designed with the same ISA instead of non-shared instructions being disabled.

From Anand Tech, it sounds like the GPU tile is made with TSMC N5 and the SoC with TSMC N6.

I wonder today if Intel's new reliance on TSMC has to do with node optimization. Intel's newest nodes get used first for CPUs, so they're frequency-optimized. But TSMC favors density-optimized nodes because their early adopters make smartphone and graphics processors. So even if Intel and TSMC were keeping pace with one another, it'd still make more sense to build the CPU tile at Intel and the GPU and SoC tiles at TSMC. (I believe both companies tend to make alternate versions of their nodes that are optimized differently, but those come later and might be a little more expensive.)

Anand Tech also said that the Crestmont cores in the SoC are optimized with a lower voltage-frequency curve, perhaps that's because the TSMC N6 process they're built with is density-optimized?

Source:
I suspect you're right about N5 being denser than Intel 4 for GPUs which don't need them to clock very high. As far as VNNI is concerned, it's useful for algorithms used for AI. I'm quoting the relevant part from the link in the previous sentence
Platforms not using VNNI require the vpmaddubsw, vpmaddwd and vpaddd instructions to complete the multiply-accumulate operations in INT8 convolution operation:

DL int-8


Platforms using VNNI require only one instruction, “vpdpbusd”, to complete the INT8 convolution operation:

DL int-8
 
Joined
May 25, 2022
Messages
117 (0.13/day)
Wouldn't Meteor Lake be focused on laptops? I don't understand the reason for comparing the iGPU with the desktop version which only has 256shaders (768shaders on the laptop). Bigger numbers to show investors, of course...

View attachment 314302
Not sure where you are getting that it's desktop Raptorlake. It's a 1:1 comparison against mobile Raptorlake. Or did you forget Raptorlake is also in mobile?

I like the low power island which clocks the E cores more sensibly. I was expecting them to use smaller cores than their fairly large Crestmont cores for the low power island. Let's see when it starts shipping. The claims for power efficiency are lower than I expected: more than 20% compared to Raptor Lake mobile. Intel's presentation was rather light on microarchitectural details. The only hard numbers were cache sizes: I cache for the P core has doubled to 64 KB.
That's is just for the process, not Meteorlake. Also, the reason they were light on uarch details is that neither P nor the E cores advance that much. It's basically:

P: Doubles L1i cache to 64KB
E: rename/allocate goes from 5 to 6

Anand Tech also said that the Crestmont cores in the SoC are optimized with a lower voltage-frequency curve, perhaps that's because the TSMC N6 process they're built with is density-optimized?
We always knew that. Intel 4 for Compute, TSMC N5 for GPU and N6 for IO and SoC.

The LP E cores in the SoC are just optimized for lower power, nothing to do with the process.
 
Last edited:
Joined
Aug 12, 2019
Messages
2,180 (1.13/day)
Location
LV-426
System Name Custom
Processor i9 9900k
Motherboard Gigabyte Z390 arous master
Cooling corsair h150i
Memory 4x8 3200mhz corsair
Video Card(s) Galax RTX 3090 EX Gamer White OC
Storage 500gb Samsung 970 Evo PLus
Display(s) MSi MAG341CQ
Case Lian Li Pc-011 Dynamic
Audio Device(s) Arctis Pro Wireless
Power Supply 850w Seasonic Focus Platinum
Mouse Logitech G403
Keyboard Logitech G110
Cool! Can’t wait to see what meteor lake can do, as always I will be getting the second generation, more optimization etc
 
Joined
May 25, 2022
Messages
117 (0.13/day)
It looks like Server and PC cores are diverging further.

Redwood Cove server in Granite Rapids: Enhanced branch prediction, double L1i
Sierra Glen(Crestmont counterpart) in Sierra Forest: Rename/Allocate is still at 5, same as Gracemont.

Redwood Cove client in Meteorlake: Double L1i
Crestmont in Meteorlake: Rename/Allocate is at 6, Enhanced branch prediction

Server E core is optimizing for higher frequency hence the slightly narrower uarch.
 
Joined
Oct 6, 2021
Messages
1,605 (1.40/day)
Not sure where you are getting that it's desktop Raptorlake. It's a 1:1 comparison against mobile Raptorlake. Or did you forget Raptorlake is also in mobile?


That's is just for the process, not Meteorlake. Also, the reason they were light on uarch details is that neither P nor the E cores advance that much. It's basically:

P: Doubles L1i cache to 64KB
E: rename/allocate goes from 5 to 6


We always knew that. Intel 4 for Compute, TSMC N5 for GPU and N6 for IO and SoC.

The LP E cores in the SoC are just optimized for lower power, nothing to do with the process.
You're right, I didn't even notice that they refreshed the Alder-Lake mobile and called it RaptorLake.
 
Joined
May 25, 2022
Messages
117 (0.13/day)
Maybe there was more buried in the slides but I feel like very little new information was presented here. I had heard that there was a 6P+8E core configuration plus 2 e-cores in the SoC that would only be used if the CPU tile was powered down. But now I know it's 4 cores on the SoC and they're available for use even when the CPU tile is active.
It's 2x LP E cores not four. https://www.techpowerup.com/review/intel-meteor-lake-technical-deep-dive/3.html

On the slide "Meteorlake Low Power Island".

With Crestmont you could have 2 core clusters rather than 4 only as with Alder and Raptor.

@AnotherReader Also specifically for Intel 4 it only support HP libraries and doesn't have enough to support a full IO block. Don't know why they would continue to heavily rely on TSMC for future though.
 
Joined
Nov 26, 2021
Messages
1,648 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
It's 2x LP E cores not four. https://www.techpowerup.com/review/intel-meteor-lake-technical-deep-dive/3.html

On the slide "Meteorlake Low Power Island".

With Crestmont you could have 2 core clusters rather than 4 only as with Alder and Raptor.

@AnotherReader Also specifically for Intel 4 it only support HP libraries and doesn't have enough to support a full IO block. Don't know why they would continue to heavily rely on TSMC for future though.
Yes, Intel 3 should be the full featured version of Intel 4.
 
Joined
Apr 18, 2019
Messages
935 (0.46/day)
Location
The New England region of the United States
System Name Gaming Rig
Processor Ryzen 7 3800X
Motherboard Gigabyte X570 Aurus Pro Wifi
Cooling Noctua NH-D15 chromax.black
Memory 32GB(2x16GB) Patriot Viper DDR4-3200C16
Video Card(s) EVGA RTX 3060 Ti
Storage Samsung 970 EVO Plus 1TB (Boot/OS)|Hynix Platinum P41 2TB (Games)
Display(s) Gigabyte G27F
Case Corsair Graphite 600T w/mesh side
Audio Device(s) Logitech Z625 2.1 | cheapo gaming headset when mic is needed
Power Supply Corsair HX850i
Mouse Redragon M808-KS Storm Pro (Great Value)
Keyboard Redragon K512 Shiva replaced a Corsair K70 Lux - Blue on Black
VR HMD Nope
Software Windows 11 Pro x64
Benchmark Scores Nope
I bet we'll see this in a lot of NUC like devices. That new iGPU might finally give AMD some competition on that front.
 
Joined
Mar 18, 2023
Messages
870 (1.41/day)
System Name Never trust a socket with less than 2000 pins
Is there actually any machine learning software that uses Intel neural accelerator?
 
Joined
Jan 14, 2019
Messages
12,337 (5.76/day)
Location
Midlands, UK
System Name Nebulon B
Processor AMD Ryzen 7 7800X3D
Motherboard MSi PRO B650M-A WiFi
Cooling be quiet! Dark Rock 4
Memory 2x 24 GB Corsair Vengeance DDR5-4800
Video Card(s) AMD Radeon RX 6750 XT 12 GB
Storage 2 TB Corsair MP600 GS, 2 TB Corsair MP600 R2
Display(s) Dell S3422DWG, 7" Waveshare touchscreen
Case Kolink Citadel Mesh black
Audio Device(s) Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply Seasonic Prime GX-750
Mouse Logitech MX Master 2S
Keyboard Logitech G413 SE
Software Bazzite (Fedora Linux) KDE
So if I get it right, the purpose of moving some e-cores onto the SoC tile is to save on power consumption by disabling the tile interconnects - which is basically what consumes a lot of power on Ryzen when idle. Interesting.
 
Joined
Aug 12, 2022
Messages
248 (0.30/day)
It's 2x LP E cores not four. https://www.techpowerup.com/review/intel-meteor-lake-technical-deep-dive/3.html

On the slide "Meteorlake Low Power Island".

With Crestmont you could have 2 core clusters rather than 4 only as with Alder and Raptor.
Ah you're right. The article says, "Based on the same 'Crestmont' core architecture as the E-cores on the Compute tile although not being part of its ringbus or sharing its L3 cache; this E-core cluster has its own L2 cache shared among four cores." But I can see the slide you referred to from Intel which says 2 cores.

I bet we'll see this in a lot of NUC like devices. That new iGPU might finally give AMD some competition on that front.
Tiger Lake's iGPU was actually a bit faster than AMD's Vega iGPU at the time. But Alder Lake and Raptor Lake use the exact same iGPU as Tiger Lake so yeah this will be the first time in a while it's improved and with a 2x improvement it should rival the current RDNA3 iGPU. Intel's lower trims are also less cut down than AMD's, so certain i5 and i3 models I think still compete very favorably against AMD.
 
Joined
Dec 12, 2012
Messages
773 (0.18/day)
Location
Poland
System Name THU
Processor Intel Core i5-13600KF
Motherboard ASUS PRIME Z790-P D4
Cooling SilentiumPC Fortis 3 v2 + Arctic Cooling MX-2
Memory Crucial Ballistix 2x16 GB DDR4-3600 CL16 (dual rank)
Video Card(s) MSI GeForce RTX 4070 Ventus 3X OC 12 GB GDDR6X (2610/21000 @ 0.91 V)
Storage Lexar NM790 2 TB + Corsair MP510 960 GB + PNY XLR8 CS3030 500 GB + Toshiba E300 3 TB
Display(s) LG OLED C8 55" + ASUS VP229Q
Case Fractal Design Define R6
Audio Device(s) Yamaha RX-V381 + Monitor Audio Bronze 6 + Bronze FX | FiiO E10K-TC + Sony MDR-7506
Power Supply Corsair RM650
Mouse Logitech M705 Marathon
Keyboard Corsair K55 RGB PRO
Software Windows 10 Home
Benchmark Scores Benchmarks in 2024?
A very mobile-focused architecture, but it's pretty cool that we're finally getting rid of the chipset (even though technically in mobile CPUs the PCH was on package).

Will they do that for desktops too, or will there still be a PCH on the motherboard?
 
Joined
Jul 13, 2016
Messages
3,279 (1.07/day)
Processor Ryzen 7800X3D
Motherboard ASRock X670E Taichi
Cooling Noctua NH-D15 Chromax
Memory 32GB DDR5 6000 CL30
Video Card(s) MSI RTX 4090 Trio
Storage Too much
Display(s) Acer Predator XB3 27" 240 Hz
Case Thermaltake Core X9
Audio Device(s) Topping DX5, DCA Aeon II
Power Supply Seasonic Prime Titanium 850w
Mouse G305
Keyboard Wooting HE60
VR HMD Valve Index
Software Win 10
So if I get it right, the purpose of moving some e-cores onto the SoC tile is to save on power consumption by disabling the tile interconnects - which is basically what consumes a lot of power on Ryzen when idle. Interesting.

From what I read it's only the compute titles:

it allows Intel to power down the Compute tile when not needed

In essence the SoC tile acts as a smaller low end chip within a chip that only taps the compute / IO tiles as needed. Should reduce power consumption for idle or very light tasks.
 
Joined
Jun 14, 2023
Messages
40 (0.08/day)
Intel calls a "disaggregated chiplet-based processor" a SoP or System-on-Package chip architecture.

This article fails to mention, even once, the core IDM 2.0 components as Intel defined them

Dis-aggregated nonsense writing... Better check with DerBuyer.. lulz.
 
Joined
May 25, 2022
Messages
117 (0.13/day)
A very mobile-focused architecture, but it's pretty cool that we're finally getting rid of the chipset (even though technically in mobile CPUs the PCH was on package).

Will they do that for desktops too, or will there still be a PCH on the motherboard?
AMD had an on-die chipset since the Bulldozer-derivative chip right before Zen. The I/O features are cut down so you could call them PCH-lite. The desktops come with additional chipset for more IO.

Intel could conceivably do the same thing.

Intel says 4-6% improvement for Crestmont E cores. Likely less for the P.
 
Last edited:
Joined
Aug 23, 2013
Messages
581 (0.14/day)
Disappointing that they neutered the AV1 encoder to 4:2:0. I was expecting them to make AV1 ubiquitous but they left room for improvement for future generations in classic Apple style.
 
Joined
Sep 7, 2010
Messages
854 (0.16/day)
Location
Nairobi, Kenya
Processor Intel Core i7-14700K
Motherboard ASUS ROG STRIX Z790-H
Cooling DeepCool AK500 WH
Memory Crucial Pro 32GB Kit (16GB x 2) DDR5-5600 (CP2K16G56C46U5)
Video Card(s) Intel ARC A770 Limited Edition
Storage Solidigm P44 Pro (2TB x 2) / PNY CS3140 2TB
Display(s) Philips 32M1N5800A
Case Lian Li O11 Air Mini (White)
Power Supply Seasonic Prime Fanless Titanium 600W
Keyboard Dell KM714 Wireless
Software Windows 11 Pro x64
Very nice article, I enjoyed reading this.
 
Top