• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel Lunar Lake-MX SoC with On-Package LPDDR5X Memory Detailed

Joined
Aug 12, 2022
Messages
250 (0.29/day)
... or, at the very least, have some choice when ordering a laptop. With RAM on processor package you don't even get that.
Given that LPCAMM is coming next year, it'd be nice to move toward more configurable laptops and not less.
 
Joined
May 25, 2022
Messages
128 (0.14/day)
Intel attempted on-package DRAM in 2014, but never shipped, IIRC: unfortunately typical for Intel, but Intel will gladly explain the benefits of on-package DRAM to you:
Intel sold Tablet Atoms with on-package DRAM in 2014. And I can even attest for this personally as my Dell Venue 8 Pro I used at that time had the chip. I took it apart few years later once I stopped using it due to screen damage and there's no DRAM chip anywhere. Of course people don't remember as for Tablets on-package DRAM is granted while for their main line of chips it's some magic.

The first on-package Atom was even before that with the Smartphone oriented Medfield in 2012: https://www.anandtech.com/show/5365/intels-medfield-atom-z2460-arrive-for-smartphones/1

Meteorlake will also have some on-package DRAM variants, but unlike Lunar Lake it doesn't claim to be a big advance in mobility, so it'll be only available in certain variants while for Lunar Lake it'll have it for all.

Given that LPCAMM is coming next year, it'd be nice to move toward more configurable laptops and not less.
LPCAMM is nowhere near low power or compact enough for what's going in Lunar Lake. It will be more than before sure.

One aspect might be specialization of the nodes. Intel 4 is frequency-optimized, so it's the logical choice for Meteor Lake's CPU tile. Intel 3 can be density-optimized, making it the logical choice for the GPU and SoC tiles, but Intel 3 isn't ready and TSMC N5 and N6 are, and as it happens these are being used for the GPU and SoC tiles of Meteor Lake. Intel 20A will be frequency-optimized and Intel 18A will be able to be density-optimized.

I'm not sure why Arrow Lake would have TSMC parts. It seems to me that an Intel 20A CPU tile with Intel 3 GPU and SoC tiles ought to do quite well if 3 and 20A can be produced in enough quantity. And if Lunar Lake has access to all these and Intel 18A, then there oughtn't be a need for TSMC in that product.
Intel 4 doesn't have enough libraries for Lunar Lake, period. Not fully sure about Intel 3, but they admit that they won't really get ahead until 18A, and Intel 3 is going to be using a ton of wafers because both E and P core Xeons are going to be using them and they are all very large dies. Granite Rapids-SP has two compute tiles with over 500mm2 size, while -AP version has three. Sierra Forest-SP has 1x 500mm2+ die, while -AP has 2x.

20A is same as Intel 4 in that it doesn't have enough libraries. If Intel keeps using TSMC for main chip compute tile by 18A, then it's a problem, but until then, there are legitimate reasons for doing what they are doing. When it comes to Intel 3 and 18A being foundry-oriented process, the reason is same. Only those two have all the libraries required, while 4 and 20A contains enough blocks for CPU to rapidly iterate to next generation. With Lunar Lake coming sometime next year and 18A not being production-ready until end of next year, 18A is a no-go.

Lunar Lake was conceived around the time when Apple abandoned Intel, and future of their manufacturing was murky. But as of right now multiple reports state manufacturing is now going full steam while design team is still meh. The design team had been hiding behind massive process lead, and this loss exposes them.

Gelsinger is bringing changes and new metrics to bring accountability for the design team, so hopefully they'll do better. The design team was doing appalling things such as directing the process team to change the metrics to fit their needs, rather than working with what they have. Now, the goal is the design team is essentially treated little more than 3rd party.
If 4. 3 and 20A will be on time ind successful, they should have plenty of free capacity in 10/7nm fabs. Is it better to use them to produce for 3rd parties and produce their own at TSMC? I think it would be better to convert these fabs to 18A and produce their own CPUs.
Leaked slides show that I/O die for Granite Rapids and Sierra Forest use Intel 7, while compute tile uses Intel 3. As I said the compute tiles for both are over 500mm2. Rough analysis got 510mm2. So it's in the ballpark of 500mm2. Yea they have three compute size variants but regardless these tiles are massive.

The Intel 3 and 18A is more than a plus. It's more like 10nm SF and Intel 7, but without the density loss SF and 7 brought. The numerous plusses in 14nm only brought about 4-5% gains, while 3 gains 18% alone, and will have density advantage for high performance libraries(so for successors to Meteorlake), while 3 has full set of libraries for I/O, GPU, CPU, and so on, while 4 is bare minimum.
 
Last edited:
Joined
Jan 3, 2021
Messages
3,605 (2.49/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
Given that LPCAMM is coming next year, it'd be nice to move toward more configurable laptops and not less.
No one yet mentioned the thickness of CAMM and LPCAMM connectors and modules but they look like they're going to require a massive compression bracket to hold the contacts together reliably. And spring contacts of course, similar to LGA sockets.
 
Joined
May 25, 2022
Messages
128 (0.14/day)
Intel attempted on-package DRAM in 2014, but never shipped, IIRC: unfortunately typical for Intel, but Intel will gladly explain the benefits of on-package DRAM to you:
What do you mean it "never shipped"?

You do know Knights Landing has come out and shipped to many units right? You could even get a socketed variant in a ATX tower format and boot itself and run Cinebench on it.

Everyone:
On-package memory for Meteor and Lunar is for space savings for 99% of the scenarios. I doubt it even saves noticeable amount of standby power. Yes, you need on-package/soldered for such fast memory, but the power savings come from being the LPDDR standard.

There are basically three "on-package memory" variants.
1. eDRAM as with Broadwell, with proprietary connections and very low latency and high bandwidth
2. HBM, which is for bandwidth and space savings
3. On package system memory. Mostly for space savings.

The three are very, very different.
 
Joined
Aug 12, 2022
Messages
250 (0.29/day)
I had it in my head that Intel's next consumer generations were Meteor Lake (IFS 4), Arrow Lake (IFS 20A), and Lunar Lake (IFS 18A). But Intel demoed Lunar Lake working in September but hasn't (in my memory) demoed Arrow Lake, and Meteor Lake wasn't even ready then. More recently I've been thinking that Lunar Lake and Arrow Lake will be released around the same time but for different markets, and Lunar Lake will not use 18A. And now that I've read this article again, Intel's slides seem to confirm this. One slide describes Arrow Lake as "optimized for enthusiast" and Lunar Lake as "ultra low power mobility". The same slide depicts both as if they were direct advisors m successors to Meteor Lake and coming in 2024.

These slides also show the CPU and a relatively large Battlemage GPU sharing the same tile. That rules out IFS 20A because it won't have the high density libraries needed to make a GPU and IFS 3 won't be efficient enough and IFS 18A won't be ready in time. TSMC N4 also won't be good enough. So the only possible option is one of the TSMC N3 nodes, which fits because the Battlemage GPU is also likely to be made on that node. Now that SoC tile could be a little more flexible.

This also explains why TSMC is making the CPU tile. Even if Intel delivers all of their new nodes on the schedule they claim, none of them can build this particular CPU tile by the end of 2024.

Now I think I can infer a little more. (Everything I say after this point is speculation.) Aside from the cost of designing new microarchitectures and new manufacturing nodes, there's also a bit of cost designing and making each final package. The Lunar Lake tiles are completely incompatible with the Meteor Lake package, but the Arrow Lake image here looks just like Meteor Lake's package. So I think Lunar Lake will cover all of the 8-15W market (4P+4E, some with on-package memory and some with off-package memory), and Arrow Lake will take the 28W market (2LPE+6P+8E) and up (2LPE+8P+16E). Lunar Lake alone will have Battlemage graphics, as Arrow Lake will reuse the Meteor Lake GPU tiles. Arrow Lake in the 28W market will reuse every tile from Meteor Lake except the CPU tile, which means it'll have 2 low power Crestmont cores, 6 Lion Cove performance cores, and 8 Skymont "efficiency" cores. Arrow Lake for desktops will inherit the 8P+16E configuration from Raptor Lake, probably with a new SoC, either built with TSMC N6 without the low power cores or IFS 3 with 2 Skymont cores.

Panther Lake is next in the lineup, probably for 2025 when IFS 18A is ready.

The article mentions that Lunar Lake MX has a 160-bit memory interface, which would be very helpful for the GPU, but the slides only say dual channel which normally is 128 bit. Software stores things in memory in 64-bit chunks, so 160-bits would be a little unconventional for CPU memory.

Another interesting detail in the slides is that the Core 7 is "8 Core 12M". A similar Meteor Lake configuration would be 8 cores and 12 threads. So what's "12M"? Does this relate to the rumor that Intel is replacing Hyperthreading?
 
Last edited:
Joined
Jan 3, 2021
Messages
3,605 (2.49/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
The article mentions that Lunar Lake MX has a 160-bit memory interface, which would be very helpful for the GPU, but the slides only say dual channel which normally is 128 bit. Software stores things in memory in 64-bit chunks, so 160-bits would be a little unconventional for CPU memory.
It's even harder to fit exactly 16 GB or 32 GB of memory to a 160-bit bus ...

I think the excess bits are for ECC even though, going by this article from Synopsys, ECC in LPDDR is usually not implemented by adding bits to the memory bus ("side-band ECC") but rather in other ways.
 
Joined
Nov 24, 2007
Messages
51 (0.01/day)
Everyone:
On-package memory for Meteor and Lunar is for space savings for 99% of the scenarios. I doubt it even saves noticeable amount of standby power. Yes, you need on-package/soldered for such fast memory, but the power savings come from being the LPDDR standard.

Remembered this comment and, back then, I was too tired to find the public sources to fact-check.

No, memory-on-package (MoP) is genuine necessary for larger memory power savings than just LPDDR. Soldered, off-package memory is just in another class.

This slide allegedly from Intel lists the multi-domain power savings of MoP. So, Intel in late 2024 is finding the same benefits as Apple in late 2020: power savings. Apple just shipped it in volume on laptops four years prior.

That's what people are missing: MoP is not just saving power within the module, but across the entire SoC & system.
1715874009403.png
 
Joined
Nov 24, 2007
Messages
51 (0.01/day)
Everyone:
On-package memory for Meteor and Lunar is for space savings for 99% of the scenarios. I doubt it even saves noticeable amount of standby power. Yes, you need on-package/soldered for such fast memory, but the power savings come from being the LPDDR standard.

There are basically three "on-package memory" variants.
1. eDRAM as with Broadwell, with proprietary connections and very low latency and high bandwidth
2. HBM, which is for bandwidth and space savings
3. On package system memory. Mostly for space savings.

The three are very, very different.

Finally, Intel has made the slides public (not that this wasn't already well-known).

Everyone: On-package DRAM has significant power savings. ;)

1717471334371.png

The presence of on-package memory lowers the memory physical-layer PHY power by 40% versus having memory chips on the motherboard or socketed as SO-DIMMs or CAMM2 modules.
 
Joined
Jan 3, 2021
Messages
3,605 (2.49/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
Those LPDDR5 packages are 4-high or even 8-high stacks of dies, connected by plain old wire bonding. How do I know? Because TSV stacked dies would be nearly as expensive as HBM, and equally unavailable in retail products.

The presence of on-package memory lowers the memory physical-layer PHY power by 40% versus having memory chips on the motherboard or socketed as SO-DIMMs or CAMM2 modules.
Well yes, kinda, because those three types don't even compare well between themselves. Whoever stated that (Intel?) was trying to be as inexact as possible.
 
Top