• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel's Sapphire Rapids Xeons to Feature up to 64 GB of HBM2e Memory

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,809 (1.02/day)
During the Supercomputing (SC) 21 event, Intel has disclosed additional information regarding the company's upcoming Xeon server processor lineup, codenamed Sapphire Rapids. One of the central areas of improvement for the new processor generation is the core architecture based on Golden Cove, the same core found in Alder Lake processors for consumers. However, the only difference between the Golden Cove variant found in Alder Lake and Sapphire Rapids is the amount of L2 (level two) cache. With Alder Lake, Intel equipped each core with 1.25 MB of its L2 cache. However, with Sapphire Rapids, each core receives a 2 MB bank.

One of the most exciting things about the processors, confirmed by Intel today, is the inclusion of High-Bandwidth Memory (HBM). These processors operate with eight memory channels carrying DDR5 memory and offer PCIe Gen5 IO expansion. Intel has confirmed that Sapphire Rapids Xeons will feature up to 64 GB of HBM2E memory, including a few operating modes. The first is a simple HBM caching mode, where the HBM memory acts as a buffer for the installed DDR5. This method is transparent to software and allows easy usage. The second method is Flat Mode, which means that both DDR5 and HBM are used as contiguous address spaces. And finally, there exists an HBM-only mode that utilizes the HBM2E modules as the only system memory, and applications fit inside it. This has numerous benefits, primarily drawn from HBM's performance and reduced latency.


View at TechPowerUp Main Site
 
Joined
Aug 10, 2007
Messages
2,174 (0.34/day)
Location
Austin TX
System Name Beyond Journeys End
Processor Ryzen 7 9700x
Motherboard ROG STRIX B850-I
Cooling Fractal 280 w/ Thermaltake toughfans
Memory 32Gb Gskill DDR5 6000
Video Card(s) 3080ti Game Rock
Storage Intel 905p 960gb + 2tb SKhynix P41
Display(s) LG C2 OLED 42"
Case Fractal Era 2
Audio Device(s) HD58X
Power Supply Asus loki 850 SFX-L
Mouse Glorious Model O
Keyboard Akko Mod 007b HE
Software Tiny 11 + Ubuntu dual boot
Would be interesting to see a cpu running an os with no ram installed...
 
Joined
Feb 3, 2017
Messages
3,921 (1.33/day)
Processor Ryzen 7800X3D
Motherboard ROG STRIX B650E-F GAMING WIFI
Memory 2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s) INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage 2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s) 42" LG C2 OLED, 27" ASUS PG279Q
Case Thermaltake Core P5
Power Supply Fractal Design Ion+ Platinum 760W
Mouse Corsair Dark Core RGB Pro SE
Keyboard Corsair K100 RGB
VR HMD HTC Vive Cosmos
Would be interesting to see a cpu running an os with no ram installed...
RAM? At size like 64GB, it would not need storage either :D
 

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,809 (1.02/day)
RAM? At size like 64GB, it would not need storage either :D
Fujitsu A64FX, that powers Fugaku supercomputer, uses 32 GB of HBM as well, and IIRC only uses this for RAM. It has no problems being the fastest pre-exascale supercomputer for now :)
 
Joined
May 31, 2016
Messages
4,485 (1.41/day)
Location
Currently Norway
System Name Bro2
Processor Ryzen 5800X
Motherboard Gigabyte X570 Aorus Elite
Cooling Corsair h115i pro rgb
Memory 32GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s) Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s) LG 27UD69 UHD / LG 27GN950
Case Fractal Design G
Audio Device(s) Realtec 5.1
Power Supply Seasonic 750W GOLD
Mouse Logitech G402
Keyboard Logitech slim
Software Windows 10 64 bit
I think this HBM mem in general has a very cool implementations. It would have helped a lot in some cases. Too bad it is quite expensive but maybe it will change?
 
Joined
Oct 12, 2005
Messages
735 (0.10/day)
I am curious how the cache mode will perform. it it's too granular, it might require too much processing power to operate. I suspect they will just cache large chunk of main memory.

For the contiguous, I wonder if it will be shown as NUMA or UMA.

I hope the HBMe only version is a low core count because else, it will be so much memory starved.
 
Joined
Aug 24, 2004
Messages
217 (0.03/day)
Does the memory speed match the cpu? That's what I would like to know. This could put RAM companies out of business if this becomes the norm.
 
Joined
Apr 15, 2021
Messages
895 (0.63/day)
Does the memory speed match the cpu? That's what I would like to know. This could put RAM companies out of business if this becomes the norm.
I wouldn't expect something like this to become available for your average or even high-end desktops any time soon, so I think the future of companies producing RAM is still secure and will be so for quite some time.
 
Joined
Aug 24, 2004
Messages
217 (0.03/day)
I wouldn't expect something like this to become available for your average or even high-end desktops any time soon, so I think the future of companies producing RAM is still secure and will be so for quite some time.
True. However this might be highly beneficial for AMD who's current cpu's are picky about what RAM you are using.
 
Joined
Jan 2, 2019
Messages
195 (0.09/day)
The HBM2e technology that Intel uses on Intel Sapphire Rapids Xeon CPUs is Not new and was "borrowed"
from Intel Knights Landing ( KNL ) Xeon Phi architecture.

On Intel KNL-series CPUs it was called as MCDRAM and here are core features of these CPUs:

Code name: Knights Landing ( KNL )
Process technology: 14nm
On-Package Memory: High Bandwidth MCDRAM ( up to 16GB / bandwidth >400GB/s )
Regular Memory: DDR4 ( up to 384GB / bandwidth > 80GB/s )
Instruction Set Architecture: Intel AVX-512 ( vector length 512-bit )

Supports Memory modes of MCDRAM:
- Cache
- Flat
- Hybrid
- MCDRAM only

Supports Cluster modes:
- All2All
- SNC-2
- SNC-4
- Hemisphere
- Quadrant

I've worked with an Intel KNL-server with Xeon Phi Processor 7210 CPU:

https://ark.intel.com/products/94033/Intel-Xeon-Phi-Processor-7210-16GB-1_30-GHz-64-core

Intel Xeon Phi Processor 7210 ( 16GB, 1.30 GHz, 64 core )
Cores : 64
Processors ( CPUs ) : 256
Threads per core : 4
Peak Processing Power: 2.662 TFLOPs ( Single Precision )

In order to see how Memory- and Cluster-modes worked in "action" take a look at these
two Video Technical Reports ( VTRs ):

Strassen Matrix Multiplication algorithms on Intel KNL Server ( VTR-112 )
( Video Slides 12, 23, 33, 34, 42 and 50 )

Performance of Classic Matrix Multiplication algorithm on a Server System ( VTR-048 )
( Video Slides 25, 28, 29, 32, 33, 34 and 35 )

>>...For the contiguous, I wonder if it will be shown as NUMA...

It supports NUMA.
 
Last edited:
Top