• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

TSMC N3 Nodes Show SRAM Scaling is Hitting the Wall

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,723 (1.01/day)
When TSMC introduced its N3 lineup of nodes, the company only talked about the logic scaling of the two new semiconductor manufacturing steps. However, it turns out that there was a reason for it, as WikiChip confirms that the SRAM bit cells of N3 nodes are almost identical to the SRAM bit cells of N5 nodes. At TSMC 2023 Technology Symposium, TSMC presented additional details about its N3 node lineup, including logic and SRAM density. For starters, the N3 node is TSMC's "3 nm" node family that has two products: a Base N3 node (N3B) and an Enhanced N3 node (N3E). The base N3B uses a new (for TSMC) self-aligned contact (SAC) scheme that Intel introduced back in 2011 with a 22 nm node, which improves the node's yield.

Regardless of N3's logic density improvements compared to the "last-generation" N5, the SRAM density is almost identical. Initially, TSMC claimed N3B SRAM density was 1.2x over the N5 process. However, recent information shows that the actual SRAM density is merely a 5% difference. With SRAM taking a large portion of the transistor and area budget of a processor, N3B's soaring manufacturing costs are harder to justify when there is almost no area improvement. For some time, SRAM scaling wasn't following logic scaling; however, the two have now completely decoupled.



View at TechPowerUp Main Site | Source
 
Joined
Dec 26, 2006
Messages
3,896 (0.59/day)
Location
Northern Ontario Canada
Processor Ryzen 5700x
Motherboard Gigabyte X570S Aero G R1.1 BiosF5g
Cooling Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s) AMD RX 6800 - Asus Tuf
Storage Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s) LG 27UL550-W (27" 4k)
Case Be Quiet Pure Base 600 (no window)
Audio Device(s) Realtek ALC1220-VB
Power Supply SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse Mionix Naos Pro
Keyboard Corsair Strafe with browns
Software W10 22H2 Pro x64
When you take the time to look at the graph and realize its logarithmic on the area, it has been flatlined since 5nm, and if you include 7nm, it's still a pretty flat line.
 

Count von Schwalbe

Nocturnus Moderatus
Staff member
Joined
Nov 15, 2021
Messages
3,227 (2.78/day)
Location
Knoxville, TN, USA
System Name Work Computer | Unfinished Computer
Processor Core i7-6700 | Ryzen 5 5600X
Motherboard Dell Q170 | Gigabyte Aorus Elite Wi-Fi
Cooling A fan? | Truly Custom Loop
Memory 4x4GB Crucial 2133 C17 | 4x8GB Corsair Vengeance RGB 3600 C26
Video Card(s) Dell Radeon R7 450 | RTX 2080 Ti FE
Storage Crucial BX500 2TB | TBD
Display(s) 3x LG QHD 32" GSM5B96 | TBD
Case Dell | Heavily Modified Phanteks P400
Power Supply Dell TFX Non-standard | EVGA BQ 650W
Mouse Monster No-Name $7 Gaming Mouse| TBD
How long until the SRAM is on a separate chip entirely (think X3D style) and the logic chip is only cores and interconnect?
 
Joined
Nov 26, 2021
Messages
1,730 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
Backside power delivery, or PowerVia in Intel parlance, should help with SRAM scaling. Nanosheet transistors will also help, but these are all slated for either Intel's 20A node or TSMC's N2P node. These aren't expected to be available until 2024 and 2026 respectively.

How long until the SRAM is on a separate chip entirely (think X3D style) and the logic chip is only cores and interconnect?
That will increase latency of SRAM as off-chip communication is costly in both latency and power. It could only be done with large, last level caches like AMD's LLC for RDNA3. Smaller caches like L1 and L2 will remain on-chip.
 
Joined
Sep 1, 2020
Messages
2,466 (1.54/day)
Location
Bulgaria
It's a miracle that some SRAM scaling still fits between 7nm and 3nm. ASML's 3000 series(3400&3600) lithography scanners are both fully identical wavelengths.
Screenshot_2023-05-29-21-01-17-40_40deb401b9ffe8e1df2f1cc5ba480b12.jpg
 
Joined
Nov 26, 2021
Messages
1,730 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
It's a miracle that some SRAM scaling still fits between 7nm and 3nm. ASML's 3000 series(3400&3600) lithography scanners are both fully identical wavelengths.
It's not a miracle. The light source is a necessary part of the process, but it doesn't govern the minimum size of the current processes which are all greater than 13.5 nm. Besides, N7 doesn't use EUV. Instead, it uses light with a wavelength of 193 nm.
 
Joined
Dec 26, 2020
Messages
382 (0.26/day)
System Name Incomplete thing 1.0
Processor Ryzen 2600
Motherboard B450 Aorus Elite
Cooling Gelid Phantom Black
Memory HyperX Fury RGB 3200 CL16 16GB
Video Card(s) Gigabyte 2060 Gaming OC PRO
Storage Dual 1TB 970evo
Display(s) AOC G2U 1440p 144hz, HP e232
Case CM mb511 RGB
Audio Device(s) Reloop ADM-4
Power Supply Sharkoon WPM-600
Mouse G502 Hero
Keyboard Sharkoon SGK3 Blue
Software W10 Pro
Benchmark Scores 2-5% over stock scores
It's a miracle that some SRAM scaling still fits between 7nm and 3nm. ASML's 3000 series(3400&3600) lithography scanners are both fully identical wavelengths.
View attachment 298202
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.
 
Joined
Nov 26, 2021
Messages
1,730 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
Joined
Mar 10, 2010
Messages
11,880 (2.19/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s) Asus tuf RX7900XT /Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores laptop Timespy 6506
How long until the SRAM is on a separate chip entirely (think X3D style) and the logic chip is only cores and interconnect?
You answered your own question x3D already brought that.

First off die would be L3, they're not getting the L1/2 cache's off die, the optic chips or another massive in memory compute evolution is necessary to change that I think.
 
Joined
Nov 26, 2021
Messages
1,730 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
there is no problem with small size of caches, but problem with unoptimized software.
For well optimized software, few magabytes of cache is sufficient
In the real world, the working set of most programs isn't defined by their code. Perhaps you have heard of servers that usually have hundreds of GB of RAM. Do you think they would do fine with CPUs with less than 10 MB of last level cache.
Yes, N7 doesn't use EUV, but there is much more than one "7"nm variants.
True, but the most popular variant is the one that forgoes EUV.
 

cchi

New Member
Joined
Nov 12, 2022
Messages
9 (0.01/day)
Backside power delivery, or PowerVia in Intel parlance, should help with SRAM scaling. Nanosheet transistors will also help, but these are all slated for either Intel's 20A node or TSMC's N2P node. These aren't expected to be available until 2024 and 2026 respectively.


That will increase latency of SRAM as off-chip communication is costly in both latency and power. It could only be done with large, last level caches like AMD's LLC for RDNA3. Smaller caches like L1 and L2 will remain on-chip.
With proper die stacking there is no large latency penalty, heck it might even be lower due to lower distance in z direction compared to x-y.

What is a problem though is heat dissipation, which is why it currently is limited to the LLC of Zen3/4, because of its lower power density compared to the core area.
Still the X3D chips run much hotter due to the structural silicon pieces, but would be even hotter if it was covered with active silicon.
 
Joined
Nov 26, 2021
Messages
1,730 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
With proper die stacking there is no large latency penalty, heck it might even be lower due to lower distance in z direction compared to x-y.

What is a problem though is heat dissipation, which is why it currently is limited to the LLC of Zen3/4, because of its lower power density compared to the core area.
Still the X3D chips run much hotter due to the structural silicon pieces, but would be even hotter if it was covered with active silicon.
I was thinking of non stacked chips, but your're right; die stacking solves the downsides of off-chip cache, but in its current form, it brings new issues too.
 
Joined
Sep 1, 2020
Messages
2,466 (1.54/day)
Location
Bulgaria
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.
Yes 5000 series. Very first 5000 are delivered to Intel. First 5200 will be delivered in 2024.
 
Joined
May 13, 2014
Messages
21 (0.01/day)
System Name Project Taco
Processor i7 4770k
Motherboard Gigabyte G1 Sniper 5 z87
Cooling Corsair H100i w/Noiseblocker eLoop
Memory Avexir Core Series White LED 4x4GB 1600mhz
Video Card(s) 2x EVGA Nvidia GTX 780 TI Classified
Storage Samsung 840 EVO 500GB
Display(s) 2 QNIX QX2710
Case NZXT H440
Audio Device(s) O2/ODAC
Power Supply EVGA Supernova 1000W Gold
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.
With all of lithography the process of converting to a "shorter wavelength" means either an optical improvement (lenses/mirrors) or a new light source. At this point, there's not many good candidates for a new light source sub 13.5nm. Like someone else said in the thread, the ASML EXE platform is the next step on the optics side of things to reduce the wavelength. The platform is also called High NA (Numerical Aperture), and essentially allow for wavelength reductions down to around 8nm. The core design behind how the light source is generated, however, remains the same as the current EUV tools.

For more information on how these minimium resolutions are calculated, you can look into the Rayleigh Criterion, which is basically what governs all of this in terms of minimum critical dimension
 
Joined
Jan 3, 2021
Messages
3,708 (2.51/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
Can't the mosfet's be stacked so the sram cell is flipped 90°?
That would describe the CFET (complementary FET), which is a stack of two transistors. Yes, just two. And I'm not sure anyone has produced even an experimental working chip with those.
 
Joined
Mar 13, 2021
Messages
484 (0.34/day)
Processor AMD 7600x
Motherboard Asrock x670e Steel Legend
Cooling Silver Arrow Extreme IBe Rev B with 2x 120 Gentle Typhoons
Memory 4x16Gb Patriot Viper Non RGB @ 6000 30-36-36-36-40
Video Card(s) XFX 6950XT MERC 319
Storage 2x Crucial P5 Plus 1Tb NVME
Display(s) 3x Dell Ultrasharp U2414h
Case Coolermaster Stacker 832
Power Supply Thermaltake Toughpower PF3 850 watt
Mouse Logitech G502 (OG)
Keyboard Logitech G512
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.
From what I heard that 13.5nm is the optimal wavelength to etch on current materials as anything smaller tends to go through the material vs reflect/etch



So it will probably take a massive leap in materials technolgy again to get the next "leap" vs just optimising 13.5nm utilisation.
 
Joined
Jan 3, 2021
Messages
3,708 (2.51/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
Smaller caches like L1 and L2 will remain on-chip.
AMD said the stacked L3 chip adds four clock cycles to access latency. Assuming the same were true for L2, it might actually be beneficial if a Zen core could have, for example, 1 MB plus stacked 2 MB of L2 compared to just 1 MB of faster L2.
 

Count von Schwalbe

Nocturnus Moderatus
Staff member
Joined
Nov 15, 2021
Messages
3,227 (2.78/day)
Location
Knoxville, TN, USA
System Name Work Computer | Unfinished Computer
Processor Core i7-6700 | Ryzen 5 5600X
Motherboard Dell Q170 | Gigabyte Aorus Elite Wi-Fi
Cooling A fan? | Truly Custom Loop
Memory 4x4GB Crucial 2133 C17 | 4x8GB Corsair Vengeance RGB 3600 C26
Video Card(s) Dell Radeon R7 450 | RTX 2080 Ti FE
Storage Crucial BX500 2TB | TBD
Display(s) 3x LG QHD 32" GSM5B96 | TBD
Case Dell | Heavily Modified Phanteks P400
Power Supply Dell TFX Non-standard | EVGA BQ 650W
Mouse Monster No-Name $7 Gaming Mouse| TBD
FbYOsFqVUAEVcQZ.jpg
WuHAyr6QC7Ch2JCm.jpg


L1 and L2 are nothing compared to the vast expanse of L3.

What seems likely is a "blank area" where the L3 sits currently, with interconnects on-chip but no actual transistors. Then the L3, made on a larger node, is laid in the same area but is considerably higher capacity.
 
Joined
Jan 3, 2021
Messages
3,708 (2.51/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
L1 and L2 are nothing compared to the vast expanse of L3.
What do you mean, nothing? 1 MB of L2 is about one third the size of a slice of L3 (= 4 MB next to each core).
 

Count von Schwalbe

Nocturnus Moderatus
Staff member
Joined
Nov 15, 2021
Messages
3,227 (2.78/day)
Location
Knoxville, TN, USA
System Name Work Computer | Unfinished Computer
Processor Core i7-6700 | Ryzen 5 5600X
Motherboard Dell Q170 | Gigabyte Aorus Elite Wi-Fi
Cooling A fan? | Truly Custom Loop
Memory 4x4GB Crucial 2133 C17 | 4x8GB Corsair Vengeance RGB 3600 C26
Video Card(s) Dell Radeon R7 450 | RTX 2080 Ti FE
Storage Crucial BX500 2TB | TBD
Display(s) 3x LG QHD 32" GSM5B96 | TBD
Case Dell | Heavily Modified Phanteks P400
Power Supply Dell TFX Non-standard | EVGA BQ 650W
Mouse Monster No-Name $7 Gaming Mouse| TBD
What do you mean, nothing? 1 MB of L2 is about one third the size of a slice of L3 (= 4 MB next to each core).
You have 4X the L3 as L2, and that is on Zen 4. I understand that L3 sizes are going to increase again pretty soon.
 
Top