• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel Xeon Scalable "Ice Lake-SP" 28-core Die Detailed at Hot Chips - 18% IPC Increase

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,257 (7.54/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
Intel in the opening presentation of the Hot Chips 32 virtual conference detailed its next-generation Xeon Scalable "Ice Lake-SP" enterprise processor. Built on the company's 10 nm silicon fabrication process, "Ice Lake-SP" sees the first non-client and non-mobile deployment of the company's new "Sunny Cove" CPU core that introduces higher IPC than the "Skylake" core that's been powering Intel microarchitectures since 2015. While the "Sunny Cove" core itself is largely unchanged from its implementation in 10th Gen Core "Ice Lake-U" mobile processors, it conforms to the cache hierarchy and tile silicon topology of Intel's enterprise chips.

The "Ice Lake-SP" die Intel talked about in its Hot Chips 32 presentation had 28 cores. The "Sunny Cove" CPU core is configured with the same 48 KB L1D cache as its client-segment implementation, but a much larger 1280 KB (1.25 MB) dedicated L2 cache. The core also receives a second fused multiply/add (FMA-512) unit, which the client-segment implementation lacks. It also receives a handful new instruction sets exclusive to the enterprise segment, including AVX-512 VPMADD52, Vector-AES, Vector Carry-less Multiply, GFNI, SHA-NI, Vector POPCNT, Bit Shuffle, and Vector BMI. In one of the slides, Intel also detailed the performance uplifts from the new instructions compared to "Cascade Lake-SP".



Besides higher IPC from the "Sunny Cove" cores, an equally big selling point for "Ice Lake-SP" is its improved and modernized I/O. To begin with, the processor comes with an 8-channel DDR4 interface for the 28-core die. This is accomplished using four independent memory controller tiles, each with two memory channels. The previous-generation 28-core "Cascade Lake-SP" die features two memory controller tiles, each with three memory channels, making up its 6-channel interface. Each of the memory controllers features fixed-function hardware for total memory encryption using the AES-XTS 128-bit algorithm without CPU overhead. Improvements were also made to the per-channel scheduling.


Besides the 8-channel memory interface, the other big I/O feature is PCI-Express gen 4.0 bus. The 28-core "Ice Lake-SP" die features four independent PCIe Gen4 root-complex tiles. Intel enabled all enterprise-relevant features of PCIe Gen4, including I/O virtualization, and a P2P credit fabric to improve P2P bandwidth. The "Ice Lake-SP" silicon is also ready for Intel's Optane Persistent Memory 200 "Barlow Pass" series. The die also features three UPI links for inter-socket or inter-die communication, put out by independent UPI controllers that have their own mesh interconnect ring stops.


The main interconnect on the "Ice Lake-SP" die is a refined second-generation mesh interconnect, which coupled with the improved floor-planning and new features, works to significantly reduce interconnect-level latencies over the previous generation dies. Intel also worked to improve memory access and latencies beyond simply dialing up memory channels and frequency.


Besides benefiting from the switch to the 10 nm silicon fabrication process, "Ice Lake-SP" introduces several new power management latency reduction features, including Fast Core Frequency Change, and Coherent Mesh Fabric Drainless Frequency Change. Effort also appears to be put in the area of improving CPU core frequencies in response to AVX-512 workloads.


Intel Speed Select Technology is a unique new feature that allows admins to power-gate CPU cores to spend the processor's TDP budget to bump up frequency on fewer cores. This allows you to reconfigure the 28-core die to something like a 16-core or even 8-core die, on the fly, with aggressive clock speed profile. This feature is also used to power-gate cores when not needed, to minimize the server's power draw.


Intel is expected to launch its Xeon Scalable "Ice Lake-SP" processor within 2H-2020.

View at TechPowerUp Main Site
 
Joined
Feb 15, 2019
Messages
1,661 (0.78/day)
System Name Personal Gaming Rig
Processor Ryzen 7800X3D
Motherboard MSI X670E Carbon
Cooling MO-RA 3 420
Memory 32GB 6000MHz
Video Card(s) RTX 4090 ICHILL FROSTBITE ULTRA
Storage 4x 2TB Nvme
Display(s) Samsung G8 OLED
Case Silverstone FT04
SKUs core count ?
Frequency ?
TDP ?
Price ?
 

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,257 (7.54/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
SKUs core count ?
Frequency ?
TDP ?
Price ?
SKUs determined by Intel at launch, as are all other segmentation details like frequency, TDP, price.

As for core count, I predict Intel only has up to 28 cores per die, with which it will achieve SKUs with core counts ≤28 using a single die, and >28 cores using a 2-die MCM for up to 56 cores.

Hot Chips is more of a technical deep-dive conference. They seldom talk productization/segmentation in these sessions.
 
Joined
Nov 4, 2005
Messages
11,988 (1.72/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s) 55" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
Intel. Hey look guys we can do it too..... also you need a new board.... and our TDP numbers are where the darts landed
 
Joined
Dec 26, 2016
Messages
290 (0.10/day)
Processor Ryzen 3900x
Motherboard B550M Steel Legend
Cooling XPX (custom loop)
Memory 32GB 3200MHz cl16
Video Card(s) 3080 with Bykski block (custom loop)
Storage 980 Pro
Case Fractal 804
Power Supply Focus Plus Gold 750FX
Mouse G603
Keyboard G610 brown
Software yes, lots!
What the hell is vector-aes?
 
Joined
Nov 6, 2016
Messages
1,755 (0.59/day)
Location
NH, USA
System Name Lightbringer
Processor Ryzen 7 2700X
Motherboard Asus ROG Strix X470-F Gaming
Cooling Enermax Liqmax Iii 360mm AIO
Memory G.Skill Trident Z RGB 32GB (8GBx4) 3200Mhz CL 14
Video Card(s) Sapphire RX 5700XT Nitro+
Storage Hp EX950 2TB NVMe M.2, HP EX950 1TB NVMe M.2, Samsung 860 EVO 2TB
Display(s) LG 34BK95U-W 34" 5120 x 2160
Case Lian Li PC-O11 Dynamic (White)
Power Supply BeQuiet Straight Power 11 850w Gold Rated PSU
Mouse Glorious Model O (Matte White)
Keyboard Royal Kludge RK71
Software Windows 10
SKUs determined by Intel at launch, as are all other segmentation details like frequency, TDP, price.

As for core count, I predict Intel only has up to 28 cores per die, with which it will achieve SKUs with core counts ≤28 using a single die, and >28 cores using a 2-die MCM for up to 56 cores.

Hot Chips is more of a technical deep-dive conference. They seldom talk productization/segmentation in these sessions.

Are you talking about a real 56 core CPU? Or like in the past with those BGA options that had huge power draw and basically didn't exist for all intents and purposes since you could only buy them in a system from Intel with no price offered? I'm sincerely asking
 
Joined
Jan 8, 2017
Messages
9,462 (3.28/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
What the hell is vector-aes?

Some of the steps in AES can be parallelized, I assume that those will use vector instructions.
 
Joined
Feb 11, 2009
Messages
5,561 (0.96/day)
System Name Cyberline
Processor Intel Core i7 2600k -> 12600k
Motherboard Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling Tuniq Tower 120 -> Custom Watercoolingloop
Memory Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s) AMD RX480 -> RX7800XT
Storage Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s) Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s) Focusrite 2i4 (USB)
Power Supply Seasonic 620watt 80+ Platinum
Mouse Elecom EX-G
Keyboard Rapoo V700
Software Windows 10 Pro 64bit
completely unrelated and irrelevant, I really like the yellowed background of these slides, makes it look old, like 80s old, like old paper or so.
 
Joined
Sep 17, 2014
Messages
22,498 (6.03/day)
Location
The Washing Machine
System Name Tiny the White Yeti
Processor 7800X3D
Motherboard MSI MAG Mortar b650m wifi
Cooling CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory 32GB Corsair Vengeance 30CL6000
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s) Gigabyte G34QWC (3440x1440)
Case Lian Li A3 mATX White
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse Steelseries Aerox 5
Keyboard Lenovo Thinkpad Trackpoint II
VR HMD HD 420 - Green Edition ;)
Software W11 IoT Enterprise LTSC
Benchmark Scores Over 9000
completely unrelated and irrelevant, I really like the yellowed background of these slides, makes it look old, like 80s old, like old paper or so.

I also like it for not burning my retina's out like white does.
 
Joined
Feb 20, 2019
Messages
8,302 (3.93/day)
System Name Bragging Rights
Processor Atom Z3735F 1.33GHz
Motherboard It has no markings but it's green
Cooling No, it's a 2.2W processor
Memory 2GB DDR3L-1333
Video Card(s) Gen7 Intel HD (4EU @ 311MHz)
Storage 32GB eMMC and 128GB Sandisk Extreme U3
Display(s) 10" IPS 1280x800 60Hz
Case Veddha T2
Audio Device(s) Apparently, yes
Power Supply Samsung 18W 5V fast-charger
Mouse MX Anywhere 2
Keyboard Logitech MX Keys (not Cherry MX at all)
VR HMD Samsung Oddyssey, not that I'd plug it into this though....
Software W10 21H1, barely
Benchmark Scores I once clocked a Celeron-300A to 564MHz on an Abit BE6 and it scored over 9000.
Oooof.

Intel is still pursuing monolithic die. At this point, having experienced a couple of years of EPYC servers, and seeing where the server market is headed (even if that's not currently the direction my servers are headed) it seems boneheaded to be chasing monolithic dies at this point.

I don't know the exact details but surely the cost of making a single HUGE die with 28 cores and that much cache without defects is exponentially difficult. Meanwhile AMD is successfully glueing lots of dirt-cheap 8C chiplets together with excellent yields, a fantastic interconnect, and doing so with linear cost and power increases. If you have a real need for cores, 28C simply doesn't cut it, and the minute you move to a 2P solution, all of the performance benefits of monolithic dies vanish, because the inter-socket interconnects and RAM latency are way worse than the inter-die interconnects on an Epyc chip.

There's a non-zero chance that by the time Intel goes mainstream with Ice Lake-SP 28-core products, AMD will be taping out their 128-core next-gen.
 
Joined
Jan 6, 2013
Messages
350 (0.08/day)
Having 1.25MB L2 cache it matches Tigerlake with its Willow Cove core, so IPC should be similar, 20-25% better than Skylake. So say this has the same frequency as Skylake parts (which I seriously doubt), then a 28 core Ice Lake SP would be similar to a 35 core Skylake. Not bad, but not nearly enough to equalize with Rome.

As for the guy before me, you'll need some time to see what Intel actually has in store...Sapphire Rapids will be the first actual competitive product. Ice Lake SP is just so 2018 and they know it. As for 128 core next-gen from AMD, that won't happen any time soon. I am pretty sure Milan will still be 64 cores max and maybe Zen 4 will afford to up the core count thanks to TSMC 5nm.
 
Last edited:
Joined
Jun 8, 2020
Messages
37 (0.02/day)
Having 1.25MB L2 cache it matches Tigerlake with its Willow Cove core, so IPC should be similar, 20-25% better than Skylake. So say this has the same frequency as Skylake parts (which I seriously doubt), then a 28 core Ice Lake SP would be similar to a 35 core Skylake. Not bad, but not nearly enough to equalize with Rome.

As for the guy before me, you'll need some time to see what Intel actually has in store...Sapphire Rapids will be the first actual competitive product. Ice Lake SP is just so 2018 and they know it. As for 128 core next-gen from AMD, that won't happen any time soon. I am pretty sure Milan will still be 64 cores max and maybe Zen 4 will afford to up the core count thanks to TSMC 5nm.

This uses Sunny Cove cores, and they are claiming 18% IPC increase (so worth about 33 Skylake cores at the same clock).

However, this isn't using 10SF process, but 10+, which has clock issues at the high-end - not such a problem for servers where base clock matters more. Might be a yield problem however. Let's see when it hits the market in quantity - I doubt this year except maybe a limited launch to tell the investors they managed it.
 
Joined
Feb 20, 2019
Messages
8,302 (3.93/day)
System Name Bragging Rights
Processor Atom Z3735F 1.33GHz
Motherboard It has no markings but it's green
Cooling No, it's a 2.2W processor
Memory 2GB DDR3L-1333
Video Card(s) Gen7 Intel HD (4EU @ 311MHz)
Storage 32GB eMMC and 128GB Sandisk Extreme U3
Display(s) 10" IPS 1280x800 60Hz
Case Veddha T2
Audio Device(s) Apparently, yes
Power Supply Samsung 18W 5V fast-charger
Mouse MX Anywhere 2
Keyboard Logitech MX Keys (not Cherry MX at all)
VR HMD Samsung Oddyssey, not that I'd plug it into this though....
Software W10 21H1, barely
Benchmark Scores I once clocked a Celeron-300A to 564MHz on an Abit BE6 and it scored over 9000.
you'll need some time to see what Intel actually has in store...Sapphire Rapids will be the first actual competitive product. Ice Lake SP is just so 2018 and they know it. As for 128 core next-gen from AMD, that won't happen any time soon. I am pretty sure Milan will still be 64 cores max and maybe Zen 4 will afford to up the core count thanks to TSMC 5nm.
I know AMD is going to remain at 8C/16T per chiplet for Milan, that's already been confirmed.

My speculation about Milan's successor is based on a hunch, but a realistic one - here's my 'logic' if you want to call it that.

Zen2 scales very well from 1, 2, 4, 8 chiplets. Sure, there are losses and latencies, but in the grand scheme of things the infinity fabric works well enough that scaling to 16 chiplets is plausible.
Zen3's major change is that a chiplet contains a unified CCD that combines both 4-core CCX of Zen2 into a single 8-core block. There's nothing to stop AMD reverting Zen4 to a pair of 8-core Zen3 CCXs to make a 16-core CCD, and then having 8 of those.

Speculation, take with a pinch of salt etc - but the core count race has been on for a few years now and if AMD feels there's a market for 128-core parts, it doesn't have to do anything technically radical to achieve that. Also, the 8C chiplet for Zen2 has spent a year serving the entire market by itself, so the 3100/3300X wouldn't be possible with a 16C chiplet but that's changed now with Renoir (and the upcoming Cezanne APUs) to serve the lower-half of the consumer market, freeing up AMD the potential to double the size of its chiplet without making big gaps in its product stack. We will see 4C and 6C Zen3 chips but we may get double core counts with Zen4, Especially now that Intel has nearly doubled its thread count for consumers despite being stuck on a dated architecture and process node.
 
Joined
Jun 10, 2014
Messages
2,987 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
Having 1.25MB L2 cache it matches Tigerlake with its Willow Cove core, so IPC should be similar, 20-25% better than Skylake. So say this has the same frequency as Skylake parts (which I seriously doubt), then a 28 core Ice Lake SP would be similar to a 35 core Skylake. Not bad, but not nearly enough to equalize with Rome.
You can't really compare cores that way, most workloads don't scale linearly with core count.
Ice Lake-SP loses a lot less clock speed during heavy AVX workloads, so there are substantial gains beyond just IPC.

Ice Lake-SP features the same cache system as Tiger Lake, so it should feature the performance benefits from this change. Ice Lake-SP should offer 18% IPC gains over Skylake-SP, not Skylake client.

As for the guy before me, you'll need some time to see what Intel actually has in store...Sapphire Rapids will be the first actual competitive product. Ice Lake SP is just so 2018 and they know it. As for 128 core next-gen from AMD, that won't happen any time soon. I am pretty sure Milan will still be 64 cores max and maybe Zen 4 will afford to up the core count thanks to TSMC 5nm.
While Ice Lake-SP might be delayed, how is it "2018"? It offers performance per core beyond any competitor on the market (as of right now).
I think many of you guys in here are far too fixated on specs and theoretical benchmarks. Servers are nearly always built for a specific purpose, and the hardware is chosen based on that purpose. For many workloads, including heavy AVX workloads, Intel is still king. So it's not really just about core count or cores per dollar. Servers are not built to get Cinebench or Geekbench scores.

But the interesting part will be Ice Lake-X vs. Zen 3, which will become more relevant for many of us.
 
Top