• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel Releases Arrow Lake and Lunar Lake Instruction-set Reference Guide

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,297 (7.53/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
In a bid to prepare its ISV ecosystem for emerging technologies with future processor microarchitectures, Intel periodically releases instruction-set reference guides. The latest of these was leaked to the web, making their first references to the upcoming "Arrow Lake" and "Lunar Lake" client processor microarchitectures. From the looks of it, Intel is planning a massive push into the client AI acceleration space, starting with the upcoming "Meteor Lake" architecture that debuts later this year. The processor is expected to feature hardware acceleration for AI, with the new AI Boost feature.

The company could build on AI Boost with even more capabilities in the subsequent "Arrow Lake" and "Lunar Lake" microarchitectures. Among the instruction sets relevant to AI deep-learning neural net building and training, are AVX VNNI with INT8, AVX VNNI with INT16, AVX-IFMA, and AVX-NE Convert. There are several new security-relevant instructions, including SHA512, SM3, and SM4. "Lunar Lake" will introduce TSE-PBNDKB (total storage encryption). The ISA Reference Guide can be accessed here.



View at TechPowerUp Main Site | Source
 
Joined
Jul 30, 2019
Messages
3,338 (1.69/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR5-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
It will be interesting to see how AMD responds. Will we see an AI chiplet or AI stacked die like x3d cache in the future?
 
Joined
Mar 17, 2011
Messages
159 (0.03/day)
Location
Christchurch, New Zealand
I had a look at the CPUID instruction in that manual. Just the massive amount of information that can be returned by the CPUID instruction given the value in EAX really does draw a picture of the sheer abundance of accumulated changes in Intel CPU micro-architectures over the years. So long as there's still an instruction set common to all CPUs since the very first Pentium then it matters not. It's just fascinating to me as an assembly programmer from way back.
 

bug

Joined
May 22, 2015
Messages
13,843 (3.95/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
I had a look at the CPUID instruction in that manual. Just the massive amount of information that can be returned by the CPUID instruction given the value in EAX really does draw a picture of the sheer abundance of accumulated changes in Intel CPU micro-architectures over the years. So long as there's still an instruction set common to all CPUs since the very first Pentium then it matters not. It's just fascinating to me as an assembly programmer from way back.
I remember when I was dabbling with assembly, I was overwhelmed by the changes going from 16 to 32bit. I don't imagine me keeping track of whatever came since.
 
Joined
Jul 3, 2022
Messages
7 (0.01/day)
Processor 13900K ES@55 i5-9600KF@52 i9-9900K
Motherboard MSI MAG Z390 TOMAHAWK MAG B660M Mortar WiFi DDR4
Cooling FROZEN WARFRAME 360 WHITE ARGB
Memory 8GX2 4000C14 32GX2 3200C16
Video Card(s) GIGABYTE AORUS GTX 1080 Ti
Storage Samsung 970PRO 512GB+750G+750GRAID0 HDD
Display(s) 360hz
Audio Device(s) HyperX Cloud Alpha
Power Supply SAMA 750W
Mouse TUF M4 Air
Benchmark Scores CPU-Z 592/3409
It will be interesting to see how AMD responds. Will we see an AI chiplet or AI stacked die like x3d cache in the future?
In the future, 3D caching can be widely applied, with the addition of L4 caching from the 14th generation Core. Currently, AMD's caching technology X3D product line is one step ahead of Intel's
 
Joined
Feb 15, 2020
Messages
38 (0.02/day)
Location
Slovakia
Processor Intel Core i9 14900K
Motherboard Gigabyte Z790 Aorus Elite X W7
Cooling Direct-die, custom loop
Memory 2x24GiB G.Skill Trident Z5 6400 CL32
Video Card(s) Gigabyte RTX 4090 WF3
Storage Sabrent Rocket 4.0 1TB, 4x4TB Samsung 860 EVO
Display(s) Acer XV273K
Case none
Audio Device(s) Creative SoundBlasterX G5
Power Supply Seasonic Prime Ultra Titanium 850W
Mouse Microsoft Pro IntelliMouse
Keyboard AJAZZ AKP846 RWB
AVX512 where?
Intel shooting itself in foot once again...
 
Joined
Jul 30, 2019
Messages
3,338 (1.69/day)
System Name Still not a thread ripper but pretty good.
Processor Ryzen 9 7950x, Thermal Grizzly AM5 Offset Mounting Kit, Thermal Grizzly Extreme Paste
Motherboard ASRock B650 LiveMixer (BIOS/UEFI version P3.08, AGESA 1.2.0.2)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, D5 PWM, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR5-5600 ECC Unbuffered Memory (2 sticks, 64GB, MTC20C2085S1EC56BD1) + JONSBO NF-1
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 4TB 980 PRO, 2 x Optane 905p 1.5TB (striped), AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Audio Device(s) Corsair Commander Pro for Fans, RGB, & Temp Sensors (x4)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores RIP Ryzen 9 5950x, ASRock X570 Taichi (v1.06), 128GB Micron DDR4-3200 ECC UDIMM (18ASF4G72AZ-3G2F1)
In the future, 3D caching can be widely applied, with the addition of L4 caching from the 14th generation Core. Currently, AMD's caching technology X3D product line is one step ahead of Intel's
My only issue with the 3D cache is it seems very limited at least it's marketed toward games and many games that are not cache bound it doesn't help.
On the flip side the current limitations of integrated 3d cache end up forcing AMD to release incredibly power efficient CPU's because they just can't juice them to edge out intel without them exploding.
Perhaps I just don't know what it's being used for if anything other than boosting CPU bound cache sensitive games.
 

bug

Joined
May 22, 2015
Messages
13,843 (3.95/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
My only issue with the 3D cache is it seems very limited at least it's marketed toward games and many games that are not cache bound it doesn't help.
On the flip side the current limitations of integrated 3d cache end up forcing AMD to release incredibly power efficient CPU's because they just can't juice them to edge out intel without them exploding.
Perhaps I just don't know what it's being used for if anything other than boosting CPU bound cache sensitive games.
Well, you've just described the limit for every cache: if your stuff fits, you get stellar performance, if it doesn't, you're back to slow mode.
The difference here is that when a new level of caching is added to a CPU it usually helps 80-90% of the workloads, whereas AMD's 3D cache is squarely aimed at gaming. It seems to be doing its job well, even if it comes with a few drawbacks, so I don't have anything against it.
 
Top