• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD's Future Ryzen SoCs May Feature New Chip-Stacking Technology

Nomad76

News Editor
Staff member
Joined
May 21, 2024
Messages
675 (3.61/day)
AMD has recently filed a patent revealing plans to implement "multi-chip stacking" in future Ryzen SoCs, as Wccftech reports, quoting a post on X from @coreteks: "New patent from AMD shows how future Zen SoCs could look. Basically a novel packaging design that enables compact chip stacking and interconnection by having them partially overlap, as in this figure. The dotted line is a larger die stacked on top of those smaller ones". The patent details a new approach where smaller chiplets partially overlap with a larger die, creating space for additional components and functions on the same die. This strategy aims to improve the efficiency of the contact area, thus making room for higher core counts, larger caches, and increased memory bandwidth within the same die size. The proposed stacking will reduce the physical distance between components through overlapping chiplets, thus minimizing interconnect latency and achieving faster communication between different chip parts. The design will also improve power management, as the segregated chiplets allow for better control of each unit through power gating.

Even if long-time rival Intel has lost some of its momentum (and market share) this year, AMD's chance to push ahead with its intention to become number one in the market is to continue to innovate. In the same way that its 3D V-Cache technology made the X3D processor lineup so successful, this chip stacking approach could play a major role in future AMD Ryzen SoCs. It seems that AMD is committed to moving away from the monolithic design era and taking the road of multi-chiplet; however, it can be a long wait until (and if) this chip stacking will complete the journey from patents to design, production, and final product.



View at TechPowerUp Main Site | Source
 
Joined
Mar 13, 2021
Messages
472 (0.35/day)
Processor AMD 7600x
Motherboard Asrock x670e Steel Legend
Cooling Silver Arrow Extreme IBe Rev B with 2x 120 Gentle Typhoons
Memory 4x16Gb Patriot Viper Non RGB @ 6000 30-36-36-36-40
Video Card(s) XFX 6950XT MERC 319
Storage 2x Crucial P5 Plus 1Tb NVME
Display(s) 3x Dell Ultrasharp U2414h
Case Coolermaster Stacker 832
Power Supply Thermaltake Toughpower PF3 850 watt
Mouse Logitech G502 (OG)
Keyboard Logitech G512
This could be a massive leap from Infinity fabric and would be interesting to see if they could do a larger IO Die with things like quad channel memory or more CUs to rival things like quicksync while leaving enough space for CCDs to connect via TSVs which will hopefully eliminate the latency penalty that the current chiplet design suffers from by going through the substrate.

I would also wonder if it means things like NPUs/Accelerators could be interchanged to maximise the product stack across the differing markets. So you could have for example one part with 6 CCDs and 2 Accelerators and another with 2 CCDs and 6 accelerators and everything in between. With AMD moving to chiplet design in GPU as well, perhaps that could be a third option for those stacks.

ROCm needs to be AMDs focus in the short to mid term software wise to start taking some market share from nVidia and to make their CPUs have a real USP over an equivalent Intel.
 
Joined
Dec 24, 2022
Messages
78 (0.11/day)
Before it reaches consumer market, if ever, it will be introduced in servers first. Certainly it will increase price and considering how PCIe 5.0 increased pc hardware prices in general, the genral consumer has no need for such thing.
 
Joined
Jul 29, 2022
Messages
506 (0.60/day)
Yeah, this was mentioned on videocardz one or two weeks ago. Basically they are extrapolating the 9800X3D 3d stacking. The 9800X3D has the memory on bottom and CPU cores on the top for heat reasons. This has several components on the bottom and presumably the most heat intensive parts will go on top. Someone mentioned that the upcoming Halo APUs after Strix Halo will use this. Put cache and cpu cores on the bottom, and a giant GPU on top of them all, to minimize latency for the cache, which the memory starved GPU really really needs.

The one big problem with chiplets was that the interconnects had latency issues and consumed a lot of power, which is why high-end chiplet RDNA3 GPUs did not perform well, to the point that the RDNA4 equivalents have been cancelled. This seeks to improve on those.

What all that in mind, this may not help a lot for CPUs, but it can be one hell of a huge step for GPUs and APUs.... in 2026-7.
 
Joined
Dec 25, 2020
Messages
6,722 (4.70/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard ASUS ROG MAXIMUS Z790 APEX ENCORE
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic Intellimouse
Keyboard Generic PS/2
Software Windows 11 IoT Enterprise LTSC 24H2
Benchmark Scores I pulled a Qiqi~
I wonder if full 3D integration is still too costly or unfeasible for desktops. Perhaps they don't quite meet performance targets yet. Intel certainly seems to have greatly delayed/indefinitely postponed their implementation, and AMD certainly seems in no rush to abandon the chiplet design, opting to further refining it instead.
 
Joined
Mar 6, 2018
Messages
132 (0.05/day)
I wonder if full 3D integration is still too costly or unfeasible for desktops. Perhaps they don't quite meet performance targets yet. Intel certainly seems to have greatly delayed/indefinitely postponed their implementation, and AMD certainly seems in no rush to abandon the chiplet design, opting to further refining it instead.
Ask Intel.
 
Joined
Sep 15, 2015
Messages
1,074 (0.32/day)
Location
Latvija
System Name Fujitsu Siemens, HP Workstation
Processor Athlon x2 5000+ 3.1GHz, i5 2400
Motherboard Asus
Memory 4GB Samsung
Video Card(s) rx 460 4gb
Storage 750 Evo 250 +2tb
Display(s) Asus 1680x1050 4K HDR
Audio Device(s) Pioneer
Power Supply 430W
Mouse Acme
Keyboard Trust
Can't understand this part drawing.
Maybe doted line chip is backside
 
Joined
Feb 11, 2009
Messages
5,549 (0.96/day)
System Name Cyberline
Processor Intel Core i7 2600k -> 12600k
Motherboard Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling Tuniq Tower 120 -> Custom Watercoolingloop
Memory Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s) AMD RX480 -> RX7800XT
Storage Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s) Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s) Focusrite 2i4 (USB)
Power Supply Seasonic 620watt 80+ Platinum
Mouse Elecom EX-G
Keyboard Rapoo V700
Software Windows 10 Pro 64bit
Title of this article just annoys me, like clickbait youtube videos:

"AMD's Future Ryzen SoC's May Feature New Chip-Stacking Technology"

Wow so future stuff might use new tech??!!? unheard off, usually future stuff uses old tech.....
 
Joined
Dec 12, 2016
Messages
1,840 (0.63/day)
Cache on the bottom, Zen dense cores in the middle and Zen classic cores on top. Sounds like the perfect chip sandwich. Put a dozen of those on package and you have yourself a party platter.
 

SL2

Joined
Jan 27, 2006
Messages
2,447 (0.36/day)
Title of this article just annoys me, like clickbait youtube videos:

"AMD's Future Ryzen SoC's May Feature New Chip-Stacking Technology"

Wow so future stuff might use new tech??!!? unheard off, usually future stuff uses old tech.....
Not really. This is not just new tech in the broadesst of terms, it points directly to a specific patent (which I haven't bothered looking up).

Remove the word "future" and you just add confusion.

A bait would be: "AMD's insane new chip-stacking - Intel is doomed*?"


*I'm not saying it already is/isn't

Give Nomad a break ffs
 
Joined
Jan 11, 2022
Messages
870 (0.83/day)
The one big problem with chiplets was that the interconnects had latency issues and consumed a lot of power, which is why high-end chiplet RDNA3 GPUs did not perform well, to the point that the RDNA4 equivalents have been cancelled. This seeks to improve on those.
I think it’s more of a tsmc can only produce so much of the stuff and nvidia gobbled it all up.
same happened to hbm
 
Joined
Dec 25, 2020
Messages
6,722 (4.70/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS Special Edition
Motherboard ASUS ROG MAXIMUS Z790 APEX ENCORE
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) ASUS ROG Strix GeForce RTX™ 4080 16GB GDDR6X White OC Edition
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic Intellimouse
Keyboard Generic PS/2
Software Windows 11 IoT Enterprise LTSC 24H2
Benchmark Scores I pulled a Qiqi~
Ask Intel.

My question was more generic in nature. I feel pretty horrible right now so excuse me if I worded things poorly, what I meant to say is that neither Intel nor AMD have released a fully integrated 3D packaged x86 SoC since Lakefield (largely a prototype), and neither seem to be planning on doing so imminently.

The way I interpreted this news post is that they'll be using something like CoWoS to essentially "bond" the chiplets together through an interposer which will essentially serve the purpose of interconnect, achieving monolithic-like access speeds across chiplets or very close to it, somewhat like an immensely high bandwidth ring bus?
 
Joined
Feb 3, 2017
Messages
3,749 (1.32/day)
Processor Ryzen 7800X3D
Motherboard ROG STRIX B650E-F GAMING WIFI
Memory 2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s) INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage 2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s) 42" LG C2 OLED, 27" ASUS PG279Q
Case Thermaltake Core P5
Power Supply Fractal Design Ion+ Platinum 760W
Mouse Corsair Dark Core RGB Pro SE
Keyboard Corsair K100 RGB
VR HMD HTC Vive Cosmos
This could be a massive leap from Infinity fabric and would be interesting to see if they could do a larger IO Die with things like quad channel memory or more CUs to rival things like quicksync while leaving enough space for CCDs to connect via TSVs which will hopefully eliminate the latency penalty that the current chiplet design suffers from by going through the substrate.
Why would that change something significantly in terms of Infinity Fabric? They'll run IF over silicon an TSVs or something but it will still be IF. The latency penalty has quite a bit to do with distance and less about going over substrate. Power is a different thing though.

My question was more generic in nature. I feel pretty horrible right now so excuse me if I worded things poorly, what I meant to say is that neither Intel nor AMD have released a fully integrated 3D packaged x86 SoC since Lakefield (largely a prototype), and neither seem to be planning on doing so imminently.
Isn't Arrow Lake today doing precisely that and same as Lakefield? It is chiplets/tiles over a silicon interposer. Today from what we know it still seems to be a passive interposer though.

The way I interpreted this news post is that they'll be using something like CoWoS to essentially "bond" the chiplets together through an interposer which will essentially serve the purpose of interconnect, achieving monolithic-like access speeds across chiplets or very close to it, somewhat like an immensely high bandwidth ring bus?
There does not seem to be much out there about what the patent is actually about. Is it the partial overlap? I am struggling a bit to see the significance of that. Size and yield considerations for the interposer maybe?
 
Last edited:
Joined
Jan 3, 2021
Messages
3,488 (2.46/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
Cache on the bottom, Zen dense cores in the middle and Zen classic cores on top. Sounds like the perfect chip sandwich. Put a dozen of those on package and you have yourself a party platter.
Wow, and its performance will be measured in pizzas per minute that it can bake! PPM!
 

Aquinus

Resident Wat-man
Joined
Jan 28, 2012
Messages
13,171 (2.81/day)
Location
Concord, NH, USA
System Name Apollo
Processor Intel Core i9 9880H
Motherboard Some proprietary Apple thing.
Memory 64GB DDR4-2667
Video Card(s) AMD Radeon Pro 5600M, 8GB HBM2
Storage 1TB Apple NVMe, 4TB External
Display(s) Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays
Case MacBook Pro (16", 2019)
Audio Device(s) AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers
Power Supply 96w Power Adapter
Mouse Logitech MX Master 3
Keyboard Logitech G915, GL Clicky
Software MacOS 12.1
AMD put the 3d v cache under the CCDs for current gen to get around the heat flux problem with heat having to travel through the cache. If we're stacking logic on top of eachother, how do they intend to solve the heat problem? Any logic sandwiched between other logic circuits are going to get very hot.
 

freeagent

Moderator
Staff member
Joined
Sep 16, 2018
Messages
8,534 (3.77/day)
Location
Winnipeg, Canada
Processor AMD R7 5800X3D
Motherboard Asus Crosshair VIII Dark Hero
Cooling Thermalright Frozen Edge 360, 3x TL-B12 V2, 2x TL-B12 V1
Memory 2x8 G.Skill Trident Z Royal 3200C14, 2x8GB G.Skill Trident Z Black and White 3200 C14
Video Card(s) Zotac 4070 Ti Trinity OC
Storage WD SN850 1TB, SN850X 2TB, SN770 1TB
Display(s) LG 50UP7100
Case Fractal Torrent Compact
Audio Device(s) JBL Bar 700
Power Supply Seasonic Vertex GX-1000, Monster HDP1800
Mouse Logitech G502 Hero
Keyboard Logitech G213
VR HMD Oculus 3
Software Yes
Benchmark Scores Yes
I also suspect they buried the cache so that they didn't have to use prime silicon to lower thermals.

This should be pretty interesting.
 
Top