• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel Releases Arrow Lake and Lunar Lake Instruction-set Reference Guide

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
46,766 (7.63/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
In a bid to prepare its ISV ecosystem for emerging technologies with future processor microarchitectures, Intel periodically releases instruction-set reference guides. The latest of these was leaked to the web, making their first references to the upcoming "Arrow Lake" and "Lunar Lake" client processor microarchitectures. From the looks of it, Intel is planning a massive push into the client AI acceleration space, starting with the upcoming "Meteor Lake" architecture that debuts later this year. The processor is expected to feature hardware acceleration for AI, with the new AI Boost feature.

The company could build on AI Boost with even more capabilities in the subsequent "Arrow Lake" and "Lunar Lake" microarchitectures. Among the instruction sets relevant to AI deep-learning neural net building and training, are AVX VNNI with INT8, AVX VNNI with INT16, AVX-IFMA, and AVX-NE Convert. There are several new security-relevant instructions, including SHA512, SM3, and SM4. "Lunar Lake" will introduce TSE-PBNDKB (total storage encryption). The ISA Reference Guide can be accessed here.



View at TechPowerUp Main Site | Source
 
Joined
Jul 30, 2019
Messages
2,687 (1.48/day)
System Name Not a thread ripper but pretty good.
Processor Ryzen 9 5950x
Motherboard ASRock X570 Taichi (revision 1.06, BIOS/UEFI version P5.50)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR4-3200 ECC Unbuffered Memory (4 sticks, 128GB, 18ASF4G72AZ-3G2F1)
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 2TB 980 PRO 2TB Gen4x4 NVMe, 2 x Samsung 2TB 970 EVO Plus Gen3x4 NVMe, AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores Typical for non-overclocked CPU.
It will be interesting to see how AMD responds. Will we see an AI chiplet or AI stacked die like x3d cache in the future?
 
Joined
Mar 17, 2011
Messages
159 (0.03/day)
Location
Christchurch, New Zealand
I had a look at the CPUID instruction in that manual. Just the massive amount of information that can be returned by the CPUID instruction given the value in EAX really does draw a picture of the sheer abundance of accumulated changes in Intel CPU micro-architectures over the years. So long as there's still an instruction set common to all CPUs since the very first Pentium then it matters not. It's just fascinating to me as an assembly programmer from way back.
 

bug

Joined
May 22, 2015
Messages
13,454 (4.02/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
I had a look at the CPUID instruction in that manual. Just the massive amount of information that can be returned by the CPUID instruction given the value in EAX really does draw a picture of the sheer abundance of accumulated changes in Intel CPU micro-architectures over the years. So long as there's still an instruction set common to all CPUs since the very first Pentium then it matters not. It's just fascinating to me as an assembly programmer from way back.
I remember when I was dabbling with assembly, I was overwhelmed by the changes going from 16 to 32bit. I don't imagine me keeping track of whatever came since.
 
Joined
Jul 3, 2022
Messages
6 (0.01/day)
Processor i5-9600KF i9-9900K
Motherboard MAG Z390 TOMAHAWK
Cooling h150iPro water-cooling
Memory 8GX2 4000C14 32GX2 3200C16
Video Card(s) GIGABYTE AORUS GTX 1080 Ti
Storage Samsung 970PRO 512GB+Micron 5200Pro 2tb
Display(s) 360hz
Audio Device(s) HyperX Cloud Alpha
Power Supply SAMA 750W
Mouse TUF M4 Air
Benchmark Scores CPU-Z 592/3409
It will be interesting to see how AMD responds. Will we see an AI chiplet or AI stacked die like x3d cache in the future?
In the future, 3D caching can be widely applied, with the addition of L4 caching from the 14th generation Core. Currently, AMD's caching technology X3D product line is one step ahead of Intel's
 
Joined
Feb 15, 2020
Messages
36 (0.02/day)
Processor AMD Ryzen 9 5950X
Motherboard ASRock B550 Phantom Gaming ITX/ax
Cooling Arctic Liquid Freezer II 420 rev.5 + Conductonaut
Memory 2x16GiB G.SKILL TridentZ DDR4-3200 CL14
Video Card(s) Sapphire RX 6900XT TOXIC Limited Edition
Storage Sabrent Rocket 4.0 1TB, addlink S70 1TB
Audio Device(s) Creative SoundBlasterX G5
Power Supply Seasonic Prime Ultra Titanium 850W
AVX512 where?
Intel shooting itself in foot once again...
 
Joined
Jul 30, 2019
Messages
2,687 (1.48/day)
System Name Not a thread ripper but pretty good.
Processor Ryzen 9 5950x
Motherboard ASRock X570 Taichi (revision 1.06, BIOS/UEFI version P5.50)
Cooling EK-Quantum Velocity, EK-Quantum Reflection PC-O11, EK-CoolStream PE 360, XSPC TX360
Memory Micron DDR4-3200 ECC Unbuffered Memory (4 sticks, 128GB, 18ASF4G72AZ-3G2F1)
Video Card(s) XFX Radeon RX 5700 & EK-Quantum Vector Radeon RX 5700 +XT & Backplate
Storage Samsung 2TB 980 PRO 2TB Gen4x4 NVMe, 2 x Samsung 2TB 970 EVO Plus Gen3x4 NVMe, AMD Radeon RAMDisk
Display(s) 2 x 4K LG 27UL600-W (and HUANUO Dual Monitor Mount)
Case Lian Li PC-O11 Dynamic Black (original model)
Power Supply Corsair RM750x
Mouse Logitech M575
Keyboard Corsair Strafe RGB MK.2
Software Windows 10 Professional (64bit)
Benchmark Scores Typical for non-overclocked CPU.
In the future, 3D caching can be widely applied, with the addition of L4 caching from the 14th generation Core. Currently, AMD's caching technology X3D product line is one step ahead of Intel's
My only issue with the 3D cache is it seems very limited at least it's marketed toward games and many games that are not cache bound it doesn't help.
On the flip side the current limitations of integrated 3d cache end up forcing AMD to release incredibly power efficient CPU's because they just can't juice them to edge out intel without them exploding.
Perhaps I just don't know what it's being used for if anything other than boosting CPU bound cache sensitive games.
 

bug

Joined
May 22, 2015
Messages
13,454 (4.02/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
My only issue with the 3D cache is it seems very limited at least it's marketed toward games and many games that are not cache bound it doesn't help.
On the flip side the current limitations of integrated 3d cache end up forcing AMD to release incredibly power efficient CPU's because they just can't juice them to edge out intel without them exploding.
Perhaps I just don't know what it's being used for if anything other than boosting CPU bound cache sensitive games.
Well, you've just described the limit for every cache: if your stuff fits, you get stellar performance, if it doesn't, you're back to slow mode.
The difference here is that when a new level of caching is added to a CPU it usually helps 80-90% of the workloads, whereas AMD's 3D cache is squarely aimed at gaming. It seems to be doing its job well, even if it comes with a few drawbacks, so I don't have anything against it.
 
Top