• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AVX-512 Doubles Intel 5th Gen "Emerald Rapids" Xeon Processor Performance, Up to 10x Improvement in AI Workloads

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,299 (0.92/day)
According to the latest round of tests by Phoronix, we are seeing proof of substantial performance gains Intel's 5th Gen Xeon Emerald Rapids server CPUs deliver when employing AVX-512 vector instructions. Enabling AVX-512 doubled throughput on average across a range of workloads, with specific AI tasks accelerating over 10x versus having it disabled. Running on the top-end 64-core Platinum 8592+ SKU, benchmarks saw minimal frequency differences between AVX-512 on and off states. However, the specialized 512-bit vector processing unlocked dramatic speedups, exemplified in the OpenVINO AI framework. Specifically, weld porosity detection, which has real-world applications, showed the biggest speedups. Power draw also increased moderately - the expected tradeoff for such an unconstrained performance upside.

With robust optimizations, the vector engine potential has now been fully demonstrated. Workloads spanning AI, visualization, simulation, and analytics could multiply speed by upgrading to Emerald Rapids. Of course, developer implementation work remains non-trivial. But for the data center applications that can take advantage, AVX-512 enables Intel to partially close raw throughput gaps versus AMD's core count leadership. Whether those targeted acceleration gains offset EPYC's wider general-purpose value depends on customer workloads. But with tests proving dramatic upside, Intel is betting big on vector acceleration as its ace card. AMD also supports the AVX-512 instruction set. Below, you can find the geometric mean of all test results, and check the review with benchmarks here.



View at TechPowerUp Main Site | Source
 
Joined
Dec 25, 2020
Messages
4,992 (3.95/day)
Location
São Paulo, Brazil
System Name Project Kairi Mk. IV "Eternal Thunder"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard MSI MEG Z690 ACE (MS-7D27) BIOS 1G
Cooling Noctua NH-D15S + NF-F12 industrialPPC-3000 w/ Thermalright BCF and NT-H1
Memory G.SKILL Trident Z5 RGB 32GB DDR5-6800 F5-6800J3445G16GX2-TZ5RK @ 6400 MT/s 30-38-38-38-70-2
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 1x WD Black SN750 500 GB NVMe + 4x WD VelociRaptor HLFS 300 GB HDDs
Display(s) 55-inch LG G3 OLED
Case Cooler Master MasterFrame 700
Audio Device(s) EVGA Nu Audio (classic) + Sony MDR-V7 cans
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Ocean Plastic Mouse
Keyboard Galax Stealth
Software Windows 10 Enterprise 22H2
Benchmark Scores "Speed isn't life, it just makes it go faster."
AI this, AI that, and AI then some

no one bloody cares

Get to actually matching AMD where it matters, Intel. You're slacking.
 
Joined
Jun 27, 2017
Messages
256 (0.10/day)
Processor Intel i5-13600k
Motherboard MSI MEG Z690i Unify
Cooling Noctua NH-C14S
Memory G.Skill Ripjaws V F5-5600J2834F32GX2-RS5W 64GB
Video Card(s) Asus RX6800XT TUF
Storage Samsung 980 Pro 500GB x2
Display(s) Samsung U32H850
Case Streacom DA6 XL chrome
Audio Device(s) Denon PMA-50
Power Supply Corsair SF750
Mouse Logitech MX Master 3
Keyboard Microsoft Surface
Software Win 11 Pro
If you think that
no one bloody cares
you are sorely mistaken. Does a average hobbyist care? No. But there's a butt ton of demand for performant AI hardware (for example see nvidia)
 
Joined
Mar 13, 2021
Messages
427 (0.36/day)
Processor AMD 7600x
Motherboard Asrock x670e Steel Legend
Cooling Silver Arrow Extreme IBe Rev B with 2x 120 Gentle Typhoons
Memory 4x16Gb Patriot Viper Non RGB @ 6000 30-36-36-36-40
Video Card(s) XFX 6950XT MERC 319
Storage 2x Crucial P5 Plus 1Tb NVME
Display(s) 3x Dell Ultrasharp U2414h
Case Coolermaster Stacker 832
Power Supply Thermaltake Toughpower PF3 850 watt
Mouse Logitech G502 (OG)
Keyboard Logitech G512
If AMD also supports AVX-512, what is the actual Intel advantage here?
Its about Upgrading from an existing intel platform to the latest.

Dont talk about the competition if you cant compete sort of scenario so big yourself up against previous/last gen stuff.
 
Joined
Nov 6, 2016
Messages
1,618 (0.58/day)
Location
NH, USA
System Name Lightbringer
Processor Ryzen 7 2700X
Motherboard Asus ROG Strix X470-F Gaming
Cooling Enermax Liqmax Iii 360mm AIO
Memory G.Skill Trident Z RGB 32GB (8GBx4) 3200Mhz CL 14
Video Card(s) Sapphire RX 5700XT Nitro+
Storage Hp EX950 2TB NVMe M.2, HP EX950 1TB NVMe M.2, Samsung 860 EVO 2TB
Display(s) LG 34BK95U-W 34" 5120 x 2160
Case Lian Li PC-O11 Dynamic (White)
Power Supply BeQuiet Straight Power 11 850w Gold Rated PSU
Mouse Glorious Model O (Matte White)
Keyboard Royal Kludge RK71
Software Windows 10
Its about Upgrading from an existing intel platform to the latest.

Dont talk about the competition if you cant compete sort of scenario so big yourself up against previous/last gen stuff.
Ohhhhh
 
Joined
Jan 8, 2017
Messages
9,127 (3.37/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
you are sorely mistaken. Does a average hobbyist care? No. But there's a butt ton of demand for performant AI hardware
ML on CPUs is relegated to some very niche use cases, for all intents and purposes they're wasting their time, no one is going to buy thousands of Xeons for ML.
 
Joined
Sep 1, 2020
Messages
2,072 (1.50/day)
Location
Bulgaria
ML on CPUs is relegated to some very niche use cases, for all intents and purposes they're wasting their time, no one is going to buy thousands of Xeons for ML.
It would be interesting if one could build a supercomputer made up of just the instinct mi300a APU's without any separate CPUs. But for now, such do not exist and are hardly possible.
 
Joined
Dec 25, 2020
Messages
4,992 (3.95/day)
Location
São Paulo, Brazil
System Name Project Kairi Mk. IV "Eternal Thunder"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard MSI MEG Z690 ACE (MS-7D27) BIOS 1G
Cooling Noctua NH-D15S + NF-F12 industrialPPC-3000 w/ Thermalright BCF and NT-H1
Memory G.SKILL Trident Z5 RGB 32GB DDR5-6800 F5-6800J3445G16GX2-TZ5RK @ 6400 MT/s 30-38-38-38-70-2
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 1x WD Black SN750 500 GB NVMe + 4x WD VelociRaptor HLFS 300 GB HDDs
Display(s) 55-inch LG G3 OLED
Case Cooler Master MasterFrame 700
Audio Device(s) EVGA Nu Audio (classic) + Sony MDR-V7 cans
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Ocean Plastic Mouse
Keyboard Galax Stealth
Software Windows 10 Enterprise 22H2
Benchmark Scores "Speed isn't life, it just makes it go faster."
If you think that

you are sorely mistaken. Does a average hobbyist care? No. But there's a butt ton of demand for performant AI hardware (for example see nvidia)

I'm well aware mate. It's sadly the "thing" right now. It's called hyperbole/conjecture to cope with frustration.

That EMR IS, on average, worse than equivalent Ryzen Threadripper Pro and EPYC CPUs stands however
 
Joined
Jan 10, 2011
Messages
1,347 (0.27/day)
Location
[Formerly] Khartoum, Sudan.
System Name 192.168.1.1~192.168.1.100
Processor AMD Ryzen5 5600G.
Motherboard Gigabyte B550m DS3H.
Cooling AMD Wraith Stealth.
Memory 16GB Crucial DDR4.
Video Card(s) Gigabyte GTX 1080 OC (Underclocked, underpowered).
Storage Samsung 980 NVME 500GB && Assortment of SSDs.
Display(s) LG 24MK430 primary && Samsung S24D590 secondary
Case Corsair Graphite 780T.
Audio Device(s) On-Board.
Power Supply SeaSonic CORE GM-650.
Mouse Coolermaster MM530.
Keyboard Kingston HyperX Alloy FPS.
VR HMD A pair of OP spectacles.
Software Ubuntu 22.04 LTS.
Benchmark Scores Me no know English. What bench mean? Bench like one sit on?
10x gains instead of 32x? Damn you, Amdahl! *shakes fist at a graveyard*

Seriously, did anyone carefully read the article and figured out whether the "disabled/off" scenario is AVX2/AVX/SSE, or is off for all vectorization sets? Compiler flags aren't telling much.
 
Joined
Sep 1, 2020
Messages
2,072 (1.50/day)
Location
Bulgaria
10x gains instead of 32x? Damn you, Amdahl! *shakes fist at a graveyard*

Seriously, did anyone carefully read the article and figured out whether the "disabled/off" scenario is AVX2/AVX/SSE, or is off for all vectorization sets? Compiler flags aren't telling much.
Oh, please don't challenge the marketing messages from Intel so seriously, or there might be layoffs on this team. :(
 
Joined
Jun 29, 2018
Messages
475 (0.22/day)
This is not as clear cut as one might think. If you read the comments there's a supposition that the 10x increase is actually caused by the usage of AMX instruction set, and not AVX-512 per se. This result is somewhat consistent with earlier benchmarks of AMX on Phoronix.

10x gains instead of 32x? Damn you, Amdahl! *shakes fist at a graveyard*

Seriously, did anyone carefully read the article and figured out whether the "disabled/off" scenario is AVX2/AVX/SSE, or is off for all vectorization sets? Compiler flags aren't telling much.
OpenVINO x86 CPU mode requires at least SSE4.2, so trying to use it with all SIMD disabled wouldn't work.
 
Joined
Mar 18, 2023
Messages
635 (1.41/day)
System Name Never trust a socket with less than 2000 pins
Up to 10x Improvement in AI Workloads

Who cares when GPUs are even faster for those workloads?

I'm serious, who does? Is it because of easier scheduling and division of hardware to different users?
 
Joined
Mar 31, 2020
Messages
50 (0.03/day)
Correction:
AMD also supports some of the AVX-512 instructions split up to use the AVX-256 units.
 
Joined
Jun 29, 2018
Messages
475 (0.22/day)
Joined
Feb 20, 2020
Messages
9,340 (5.94/day)
Location
Louisiana
System Name Ghetto Rigs z490|x99|Acer 17 Nitro 7840hs/ 5600c40-2x16/ 4060/ 1tb acer stock m.2/ 4tb sn850x
Processor 10900k w/Optimus Foundation | 5930k w/Black Noctua D15
Motherboard z490 Maximus XII Apex | x99 Sabertooth
Cooling oCool D5 res-combo/280 GTX/ Optimus Foundation/ gpu water block | Blk D15
Memory Trident-Z Royal 4000c16 2x16gb | Trident-Z 3200c14 4x8gb
Video Card(s) Titan Xp-water | evga 980ti gaming-w/ air
Storage 970evo+500gb & sn850x 4tb | 860 pro 256gb | Acer m.2 1tb/ sn850x 4tb| Many2.5" sata's ssd 3.5hdd's
Display(s) 1-AOC G2460PG 24"G-Sync 144Hz/ 2nd 1-ASUS VG248QE 24"/ 3rd LG 43" series
Case D450 | Cherry Entertainment center on Test bench
Audio Device(s) Built in Realtek x2 with 2-Insignia 2.0 sound bars & 1-LG sound bar
Power Supply EVGA 1000P2 with APC AX1500 | 850P2 with CyberPower-GX1325U
Mouse Redragon 901 Perdition x3
Keyboard G710+x3
Software Win-7 pro x3 and win-10 & 11pro x3
Benchmark Scores Are in the benchmark section
Joined
Aug 12, 2022
Messages
227 (0.34/day)
This isn't news. The performance increase from using AVX-512 in Emerald Rapids is inline with the increase offered by Sapphire Rapids, and similar to the increase offered by Genoa. I don't see what's changed. Emerald Rapids is faster than Sapphire Rapids so Phoronix's testing should be useful to those considering Emerald Rapids.
 
Joined
Dec 25, 2020
Messages
4,992 (3.95/day)
Location
São Paulo, Brazil
System Name Project Kairi Mk. IV "Eternal Thunder"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard MSI MEG Z690 ACE (MS-7D27) BIOS 1G
Cooling Noctua NH-D15S + NF-F12 industrialPPC-3000 w/ Thermalright BCF and NT-H1
Memory G.SKILL Trident Z5 RGB 32GB DDR5-6800 F5-6800J3445G16GX2-TZ5RK @ 6400 MT/s 30-38-38-38-70-2
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 1x WD Black SN750 500 GB NVMe + 4x WD VelociRaptor HLFS 300 GB HDDs
Display(s) 55-inch LG G3 OLED
Case Cooler Master MasterFrame 700
Audio Device(s) EVGA Nu Audio (classic) + Sony MDR-V7 cans
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Ocean Plastic Mouse
Keyboard Galax Stealth
Software Windows 10 Enterprise 22H2
Benchmark Scores "Speed isn't life, it just makes it go faster."
Hi,
Nice intel about kills avx-512 because it creates to much heat plus not much uses it anyway
Now it's needed for AI boost ?


AVX-512 VNNI was backported into AVX2 and is supported in Alder and Raptor Lake sans EVEX prefixes.

Sapphire Rapids uses the same Golden Cove P-cores from Alder while Emerald uses Raptor Cove. The decision to remove AVX-512 was from a segmentation perspective.
 
Joined
Jun 29, 2018
Messages
475 (0.22/day)
AVX-512 VNNI was backported into AVX2 and is supported in Alder and Raptor Lake sans EVEX prefixes.
While AVX-VNNI along with IFMA were ported they did not bring all of AVX-512's features, regardless of the width change to 256-bit. That will happen only with AVX10.1, which still isn't going to be equal to AVX-512 on its own. Only AVX10.1/512 will have full 512-bit width, while AVX10.1/256 is going to be 256-bit. The former is expected to be available on servers (just like AVX-512 is now), and the latter on everything including future E-cores.
Sapphire Rapids uses the same Golden Cove P-cores from Alder while Emerald uses Raptor Cove. The decision to remove AVX-512 was from a segmentation perspective.
I'm not sure about that last part. 10th/11th generation Core in laptops and 11th gen. in desktops had AVX-512.
The main reason for removing AVX-512 was the inability of E-cores to execute those instructions. I'm not aware of any mainstream operating system that can deal with non-uniform instruction sets, so P-cores had to match them.
12th gen. desktop CPUs were also capable of running AVX-512 by disabling E-cores, but that possibility got taken away with a microcode update, and by physical modification in later stepping/generations.
 
Last edited:
Joined
Aug 12, 2022
Messages
227 (0.34/day)
12th gen. desktop CPUs were also capable of running AVX-512 by disabling E-cores, but that possibility got taken away with a microcode update, and by physical modification in later stepping/generations.
I don't think Intel had any reason to take away AVX-512 when E-cores are disabled other than product segmentation, unless it was literally removed from the die design to make for a smaller die.
 
Joined
Dec 25, 2020
Messages
4,992 (3.95/day)
Location
São Paulo, Brazil
System Name Project Kairi Mk. IV "Eternal Thunder"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard MSI MEG Z690 ACE (MS-7D27) BIOS 1G
Cooling Noctua NH-D15S + NF-F12 industrialPPC-3000 w/ Thermalright BCF and NT-H1
Memory G.SKILL Trident Z5 RGB 32GB DDR5-6800 F5-6800J3445G16GX2-TZ5RK @ 6400 MT/s 30-38-38-38-70-2
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 1x WD Black SN750 500 GB NVMe + 4x WD VelociRaptor HLFS 300 GB HDDs
Display(s) 55-inch LG G3 OLED
Case Cooler Master MasterFrame 700
Audio Device(s) EVGA Nu Audio (classic) + Sony MDR-V7 cans
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Ocean Plastic Mouse
Keyboard Galax Stealth
Software Windows 10 Enterprise 22H2
Benchmark Scores "Speed isn't life, it just makes it go faster."
While AVX-VNNI along with IFMA were ported they did not bring all of AVX-512's features, regardless of the width change to 256-bit. That will happen only with AVX10.1, which still isn't going to be equal to AVX-512 on its own. Only AVX10.1/512 will have full 512-bit width, while AVX10.1/256 is going to be 256-bit. The former is expected to be available on servers (just like AVX-512 is now), and the latter on everything including future E-cores.

I'm not sure about that last part. 10th/11th generation Core in laptops and 11th gen. in desktops had AVX-512.
The main reason for removing AVX-512 was the inability of E-cores to execute those instructions. I'm not aware of any mainstream operating system that can deal with non-uniform instruction sets, so P-cores had to match them.
12th gen. desktop CPUs were also capable of running AVX-512 by disabling E-cores, but that possibility got taken away with a microcode update, and by physical modification in later stepping/generations.

Aware of other -512 features being missing, it's just VNNI and IFMA that got an AVX2 version.

I'm fairly sure that Linux can, but not Windows - and I don't think that's specifically the reason why, or they could simply let us use it with the E-cores turned off. Intel's never been a stranger to market segmentation, they've always done it even when AMD was exerting full pressure on them.

I don't think Intel had any reason to take away AVX-512 when E-cores are disabled other than product segmentation, unless it was literally removed from the die design to make for a smaller die.

There was no physical shrinkage of later batches of Alder Lake - and Raptor Lake "refresh", aka the 14th Gen scam, has no physical differences whatsoever from 13th Gen
 
Joined
Feb 20, 2020
Messages
9,340 (5.94/day)
Location
Louisiana
System Name Ghetto Rigs z490|x99|Acer 17 Nitro 7840hs/ 5600c40-2x16/ 4060/ 1tb acer stock m.2/ 4tb sn850x
Processor 10900k w/Optimus Foundation | 5930k w/Black Noctua D15
Motherboard z490 Maximus XII Apex | x99 Sabertooth
Cooling oCool D5 res-combo/280 GTX/ Optimus Foundation/ gpu water block | Blk D15
Memory Trident-Z Royal 4000c16 2x16gb | Trident-Z 3200c14 4x8gb
Video Card(s) Titan Xp-water | evga 980ti gaming-w/ air
Storage 970evo+500gb & sn850x 4tb | 860 pro 256gb | Acer m.2 1tb/ sn850x 4tb| Many2.5" sata's ssd 3.5hdd's
Display(s) 1-AOC G2460PG 24"G-Sync 144Hz/ 2nd 1-ASUS VG248QE 24"/ 3rd LG 43" series
Case D450 | Cherry Entertainment center on Test bench
Audio Device(s) Built in Realtek x2 with 2-Insignia 2.0 sound bars & 1-LG sound bar
Power Supply EVGA 1000P2 with APC AX1500 | 850P2 with CyberPower-GX1325U
Mouse Redragon 901 Perdition x3
Keyboard G710+x3
Software Win-7 pro x3 and win-10 & 11pro x3
Benchmark Scores Are in the benchmark section
Hi,
Dang I thought they killed it because it was skewing core temperatures and voltage usage to high against amd :p
My bad :slap:
 
Joined
Jun 29, 2018
Messages
475 (0.22/day)
I don't think Intel had any reason to take away AVX-512 when E-cores are disabled other than product segmentation, unless it was literally removed from the die design to make for a smaller die.
As far as I can see from Locuza's annotated die photos they did not remove it physically even from Raptor Lake. It wouldn't really make much sense to remove it since those execution resources are shared between ports 0 and 1 (Intel does fused operations on 256-bit units, but in a different way from AMD), in contrast to server versions which have one additional full AVX-512 unit on port 5, and that one was cut from desktop variants. Maybe the changes were so small that we can't see them directly or maybe they just disabled it permanently via microcode.

It would have been very difficult to properly support mixed configurations, both from end user and professional perspectives.

"AVX-512 is supported**,
** - but only if you disable some cores"

- would not have been good PR, and a source of constant support issues ;)

Compilers contain architectural optimizations, so it would require having two different levels for this setup. Doable, but I don't think Intel wanted to commit to such complexity.

I'm fairly sure that Linux can, but not Windows - and I don't think that's specifically the reason why, or they could simply let us use it with the E-cores turned off. Intel's never been a stranger to market segmentation, they've always done it even when AMD was exerting full pressure on them.
Certain enough to provide an example of such computer? I am not aware of any.
I would also like for Intel to allow this, but I understand the burdens it would bring. I'm just not convinced they explicitly wanted to segment AVX-512 out after investing so much time and money into mainstream hardware and software enablement in their awesome libraries.
 
Joined
Dec 25, 2020
Messages
4,992 (3.95/day)
Location
São Paulo, Brazil
System Name Project Kairi Mk. IV "Eternal Thunder"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard MSI MEG Z690 ACE (MS-7D27) BIOS 1G
Cooling Noctua NH-D15S + NF-F12 industrialPPC-3000 w/ Thermalright BCF and NT-H1
Memory G.SKILL Trident Z5 RGB 32GB DDR5-6800 F5-6800J3445G16GX2-TZ5RK @ 6400 MT/s 30-38-38-38-70-2
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 1x WD Black SN750 500 GB NVMe + 4x WD VelociRaptor HLFS 300 GB HDDs
Display(s) 55-inch LG G3 OLED
Case Cooler Master MasterFrame 700
Audio Device(s) EVGA Nu Audio (classic) + Sony MDR-V7 cans
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Ocean Plastic Mouse
Keyboard Galax Stealth
Software Windows 10 Enterprise 22H2
Benchmark Scores "Speed isn't life, it just makes it go faster."
As far as I can see from Locuza's annotated die photos they did not remove it physically even from Raptor Lake. It wouldn't really make much sense to remove it since those execution resources are shared between ports 0 and 1 (Intel does fused operations on 256-bit units, but in a different way from AMD), in contrast to server versions which have one additional full AVX-512 unit on port 5, and that one was cut from desktop variants. Maybe the changes were so small that we can't see them directly or maybe they just disabled it permanently via microcode.

It would have been very difficult to properly support mixed configurations, both from end user and professional perspectives.

"AVX-512 is supported**,
** - but only if you disable some cores"

- would not have been good PR, and a source of constant support issues ;)

Compilers contain architectural optimizations, so it would require having two different levels for this setup. Doable, but I don't think Intel wanted to commit to such complexity.


Certain enough to provide an example of such computer? I am not aware of any.
I would also like for Intel to allow this, but I understand the burdens it would bring. I'm just not convinced they explicitly wanted to segment AVX-512 out after investing so much time and money into mainstream hardware and software enablement in their awesome libraries.

Most Android phones nowadays have SoCs containing mixed architectures, and well, I am aware Android is very heavily customized, but end of the day still largely a Linux kernel under all of that, so I would place a safe bet on Linux being much better suited for a mixed architecture CPU.

I think both sides of this argument provide good points, by removing this from chips such as the i9-13900K they give a reason for the 8- and 10-core SPR and EMR CPUs to exist, while playing safe on the compatibility front, although compiler-wise, I don't think it'd be more of a burden, the configuration is already there on Xeons anyway
 
Joined
Jun 29, 2018
Messages
475 (0.22/day)
Most Android phones nowadays have SoCs containing mixed architectures, and well, I am aware Android is very heavily customized, but end of the day still largely a Linux kernel under all of that, so I would place a safe bet on Linux being much better suited for a mixed architecture CPU.
I'm not sure if I explained myself well enough. I meant differing instruction sets, in context of this discussion, as x86 cores with AVX-512 (P-cores) and without (E-cores).
Android phones, and most other ARM SoCs can have cores based on differing microarchitectures, for example Snapdragon 8 Gen 3 has ARM Cortex-X4, ARM Cortex-A720 and ARM Cortex-A520 cores, but they all adhere to the same AArch64 specification level - ARMv9.2-A. Thus the kernel can freely move processes between them. I'm not aware of any that do mix levels, but I haven't looked very hard.
I think both sides of this argument provide good points, by removing this from chips such as the i9-13900K they give a reason for the 8- and 10-core SPR and EMR CPUs to exist, while playing safe on the compatibility front, although compiler-wise, I don't think it'd be more of a burden, the configuration is already there on Xeons anyway
They would have existed anyway since "bigger" CPUs bring more RAM channels and PCIe lanes along with core count increases, and well as Intel did that in the past. Ice Lake was shared by mobile, workstation and server segments including lower core count SKUs.
As for compilers, it's true that they contain optimization for Xeons, but they do differ in significant ways - "big" Golden Cove cores have an additional full 512-bit AVX-512 execution port (it matters for fine tuning of instruction ordering), and also have SGX and TSX which were removed from desktop/mobile processors.
 
Top