• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Blackwell GPU die sizes revealed; regression in some cases

Joined
Dec 12, 2016
Messages
2,064 (0.70/day)
Videocardz has compiled some great data about the die sizes and transistor density of Blackwell GPUs using HardwareLuxx as the source. It is becoming clear that Blackwell is only about AI/DLSS/RT. So the question is, with the exception of the 5090, will Blackwell have the same or less pure rasterization performance as Ada Lovelace?

NVIDIA reveals die sizes for GB200 Blackwell GPUs: GB202 is 750mm², features 92.2B transistors - VideoCardz.com

1737032220832.png
 
Joined
Dec 12, 2016
Messages
2,064 (0.70/day)
Yet some still believe that 5080 will be faster than 4090.
Seems unlikely since the 5080 has an 84 SM count and the 4090 has 144. Everyone might be confused by the DLSS4 only benchmarks provided by Nvidia.
 
Joined
Nov 13, 2024
Messages
112 (1.72/day)
System Name le fish au chocolat
Processor AMD Ryzen 7 5950X
Motherboard ASRock B550 Phantom Gaming 4
Cooling Peerless Assassin 120 SE
Memory 2x 16GB (32 GB) G.Skill RipJaws V DDR4-3600 DIMM CL16-19-19-39
Video Card(s) NVIDIA GeForce RTX 3080, 10 GB GDDR6X (ASUS TUF)
Storage 2 x 1 TB NVME & 2 x 4 TB SATA SSD in Raid 0
Display(s) MSI Optix MAG274QRF-QD
Power Supply 750 Watt EVGA SuperNOVA G5
Am I crazy or does this feel like a refresh similar to raptor lake 13k -> 14k, with some extra software as a side main dish. (edit: now that I thought about it... that's the main dish)

Just a bit more juice for all the cards and if your lucky you get some cores on top...

Actually pretty disappointed. But I wasn't in the market for a Nvidia gpu anyway. (expect if they offered a crazy good card PerformanceTo€ wise)

Ohhhh well... let's hope it gets better in the next 4 years... (send help)
 
Last edited:
Joined
Jun 14, 2020
Messages
3,893 (2.32/day)
System Name Mean machine
Processor 12900k
Motherboard MSI Unify X
Cooling Noctua U12A
Memory 7600c34
Video Card(s) 4090 Gamerock oc
Storage 980 pro 2tb
Display(s) Samsung crg90
Case Fractal Torent
Audio Device(s) Hifiman Arya / a30 - d30 pro stack
Power Supply Be quiet dark power pro 1200
Mouse Viper ultimate
Keyboard Blackwidow 65%
How can it have less raster performance?
 
Joined
Dec 12, 2016
Messages
2,064 (0.70/day)
How can it have less raster performance?
Some of the SKUs have less cores down the stack than the Supers in the 4000 series. Also rumors point to same or less cores in the 5060 series.

Of course you can also have less raster performance if clocks are lowered by more than cores are increased.

Nvidia added a lot more AI/RT/DLSS goodness in hardware. Since the 4 nm node is exactly the same between the 4000 and 5000 series, it looks like Nvidia is being stingy on cores and clocks to compensate.
 
Last edited:
Joined
Oct 5, 2024
Messages
156 (1.50/day)
Location
United States of America
Some of the SKUs have less cores down the stack than the Supers in the 4000 series. Also rumors point to same or less cores in the 5060 series.

Of course you can also have less raster performance if clocks are lowered by more than cores are increased.

Nvidia added a lot more AI/RT/DLSS goodness in hardware. Since the 4 nm node is exactly the same between the 4000 and 5000 series, it looks like Nvidia is being stingy on cores and clocks to compensate.
Expect more of this as process nodes get slower and slower to develop and release to market. Profits must go up and if there is no new node to improve performance/power/area, the die area is going to shrink and shrink. :(
 
Joined
Oct 26, 2016
Messages
1,817 (0.60/day)
Location
BGD
System Name Minotaur
Processor Intel I9 7940X
Motherboard Asus Strix Rog Gaming E X299
Cooling BeQuiet/ double-Fan
Memory 192Gb of RAM DDR4 2400Mhz
Video Card(s) 1)RX 6900XT BIOSTAR 16Gb***2)MATROX M9120LP
Storage 2 x ssd-Kingston 240Gb A400 + HDD 500Gb+2xSSD Kinston 480Gb+5Tb Seagate+512Gb NVME SGI
Display(s) BenQ 28"EL2870U(4K-HDR) / Acer 24"(1080P) / Eizo 2336W(1080p) / 2x Eizo 19"(1280x1024)
Case NZXT H5 Flow
Audio Device(s) Realtek/Creative T20 Speakers
Power Supply F S P Hyper S 700W
Mouse Asus TUF-GAMING M3
Keyboard Func FUNC-KB-460/Mechanical Keyboard
VR HMD Oculus Rift DK2
Software Win 11
Benchmark Scores Fire Strike=23905,Cinebench R15=3167,Cinebench R20=7490.Passmark=30689,Geekbench4=32885
Yep as many people predicted seems like 5090 is going to be around 30%-35% faster in raster then 4090 but when it comes to the 5080 Vs 4080 situation is even worst most likely only around 15% here the latest benchmark leaks from 2 games where wasn't used frame generation Resident Evil and Horizon Forbidden West

Screenshot 2025-01-16 173451.png


P.S. Note that in Resident Evil Ray Tracing was enabled and in Horizon Forbidden West DLSS was ON....so there is possibility that if RT and DLSS was Off performance increase could be even smaller
 
Joined
Jan 18, 2020
Messages
877 (0.48/day)
Yes, it's a margin expansion exercise pretty much. Only the 5090 looks interesting... It'll probably be a decent card still in 5 years or even 10 given how progress has slowed down. So you can kind of justify the 2k for it in that way.

Pretty depressing times in tech.
 
Joined
Dec 31, 2020
Messages
1,035 (0.70/day)
Processor E5-4627 v4
Motherboard VEINEDA X99
Memory 32 GB
Video Card(s) 2080 Ti
Storage NE-512
Display(s) G27Q
Case DAOTECH X9
Power Supply SF450
5070 Ti also worth it with 128 rope and 64 mo, if true. 4070 TiS was heavily cut to 96 rop and 48 cache. And yeah it's pretty much over now N5 to N3 30% chip density or 378~ shrinks to 290 mm sq, and N3 to N2 with diminishing returns 15%, next A16 with 7% backside power and nano sheet. But you can safely buy now and enjoy long years of 4k60 with 4x DLSS MFG 240 Hz OLeD without the risk of it depreciating and becoming e-waste by the year's end like my 2080 Ti.
 
Last edited:
Joined
Dec 12, 2016
Messages
2,064 (0.70/day)
Yep as many people predicted seems like 5090 is going to be around 30%-35% faster in raster then 4090 but when it comes to the 5080 Vs 4080 situation is even worst most likely only around 15% here the latest benchmark leaks from 2 games where wasn't used frame generation Resident Evil and Horizon Forbidden West

View attachment 380215

P.S. Note that in Resident Evil Ray Tracing was enabled and in Horizon Forbidden West DLSS was ON....so there is possibility that if RT and DLSS was Off performance increase could be even smaller
RT and DLSS performance most likely will be higher but pure rasterization performance increases are in doubt.

5070 Ti also worth it with 128 rope and 64 mo, if true. 4070 TiS was heavily cut to 96 rop and 48 cache. And yeah it's pretty much over now N5 to N3 30% chip density or 378~ shrinks to 290 mm sq, and N3 to N2 with diminishing returns 15%, next A16 with 7% backside power and nano sheet.
Increases in ROPs will be interesting. That might bring higher performance than my pessimism was thinking.
 
Joined
Sep 17, 2014
Messages
22,910 (6.07/day)
Location
The Washing Machine
System Name Tiny the White Yeti
Processor 7800X3D
Motherboard MSI MAG Mortar b650m wifi
Cooling CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory 32GB Corsair Vengeance 30CL6000
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s) Gigabyte G34QWC (3440x1440)
Case Lian Li A3 mATX White
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse Steelseries Aerox 5
Keyboard Lenovo Thinkpad Trackpoint II
VR HMD HD 420 - Green Edition ;)
Software W11 IoT Enterprise LTSC
Benchmark Scores Over 9000
Seems unlikely since the 5080 has an 84 SM count and the 4090 has 144. Everyone might be confused by the DLSS4 only benchmarks provided by Nvidia.
Nope they said and I quote 'but the node is better, and there is always a gain every gen regardless'.

DLSS4 wasn't even in the picture back then. Gonna be a rough wake up call... yes the market is indeed fubar
 
Joined
Dec 25, 2020
Messages
7,287 (4.91/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS
Motherboard ASUS ROG Maximus Z790 Apex Encore
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) NVIDIA RTX A2000
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic IntelliMouse (2017)
Keyboard IBM Model M type 1391405
Software Windows 10 Pro 22H2
Benchmark Scores I pulled a Qiqi~
Seems unlikely since the 5080 has an 84 SM count and the 4090 has 144. Everyone might be confused by the DLSS4 only benchmarks provided by Nvidia.

Only 128 out of 144 are enabled. There are no SKUs with the full AD102 chip, RTX 6000 Ada Generation (professional card) maxes out at 142 SM. The 4090 also only has 72 out of the 96 MB of L3 enabled from AD102.
 
Joined
Dec 12, 2016
Messages
2,064 (0.70/day)
Only 128 out of 144 are enabled. There are no SKUs with the full AD102 chip, RTX 6000 Ada Generation (professional card) maxes out at 142 SM. The 4090 also only has 72 out of the 96 MB of L3 enabled from AD102.
Oh you are right. The Videocardz table is showing SMs for the fully unlocked die. The sentiment is still there. It is doubtful that 128 SM 4090 will be beaten by the 84 SM 5080.
 
Joined
Sep 17, 2014
Messages
22,910 (6.07/day)
Location
The Washing Machine
System Name Tiny the White Yeti
Processor 7800X3D
Motherboard MSI MAG Mortar b650m wifi
Cooling CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory 32GB Corsair Vengeance 30CL6000
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s) Gigabyte G34QWC (3440x1440)
Case Lian Li A3 mATX White
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse Steelseries Aerox 5
Keyboard Lenovo Thinkpad Trackpoint II
VR HMD HD 420 - Green Edition ;)
Software W11 IoT Enterprise LTSC
Benchmark Scores Over 9000
Yep as many people predicted seems like 5090 is going to be around 30%-35% faster in raster then 4090 but when it comes to the 5080 Vs 4080 situation is even worst most likely only around 15% here the latest benchmark leaks from 2 games where wasn't used frame generation Resident Evil and Horizon Forbidden West

View attachment 380215

P.S. Note that in Resident Evil Ray Tracing was enabled and in Horizon Forbidden West DLSS was ON....so there is possibility that if RT and DLSS was Off performance increase could be even smaller
As predicted pre release... : at best 15% from clocking... maybe 5% from optimizations and/or reduced bottlenecking in VRAM.
 
Joined
Dec 25, 2020
Messages
7,287 (4.91/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS
Motherboard ASUS ROG Maximus Z790 Apex Encore
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) NVIDIA RTX A2000
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic IntelliMouse (2017)
Keyboard IBM Model M type 1391405
Software Windows 10 Pro 22H2
Benchmark Scores I pulled a Qiqi~
The one thing people aren't really accounting for are the CPU limitations. Even the mightiest of processors like the Intel KS or Ryzen X3D chips buckle under the pressure that an RTX 4090 can manage. This will worsen with the 5090.

if Blackwell was rearchitected in such a manner as to reduce the reliance on processor performance, we may yet see insane numbers from smaller silicon. Exciting times regardless.
 
Joined
Feb 3, 2017
Messages
3,861 (1.33/day)
Processor Ryzen 7800X3D
Motherboard ROG STRIX B650E-F GAMING WIFI
Memory 2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s) INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage 2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s) 42" LG C2 OLED, 27" ASUS PG279Q
Case Thermaltake Core P5
Power Supply Fractal Design Ion+ Platinum 760W
Mouse Corsair Dark Core RGB Pro SE
Keyboard Corsair K100 RGB
VR HMD HTC Vive Cosmos
Is Blackwell transistor count accurate there? Blackwell dies seem to have essentially less transistors per SM. Looking at what has been revealed that does not quite make sense. Blackwell should have slightly more transistors. Did they cut down somewhere? Cache?
 
Joined
Aug 3, 2006
Messages
191 (0.03/day)
Location
Austin, TX
Processor Ryzen 6900HX
Memory 32 GB DDR4LP
Video Card(s) Radeon 6800m
Display(s) LG C3 42''
Software Windows 11 home premium
The one thing people aren't really accounting for are the CPU limitations. Even the mightiest of processors like the Intel KS or Ryzen X3D chips buckle under the pressure that an RTX 4090 can manage. This will worsen with the 5090.

if Blackwell was rearchitected in such a manner as to reduce the reliance on processor performance, we may yet see insane numbers from smaller silicon. Exciting times regardless.

CPU limitations at 4k? So a 4090 getting 25fps at 4k is because of the CPU?
 
Joined
Nov 4, 2005
Messages
12,054 (1.72/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s) 55" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
I think the green team is forcing a move to RT and AI generated fake frames, through market share and mind share before the other two companies have truly competitive products to gain more market share.

They will either look like a genius or bleed for awhile.
 
Joined
Dec 31, 2020
Messages
1,035 (0.70/day)
Processor E5-4627 v4
Motherboard VEINEDA X99
Memory 32 GB
Video Card(s) 2080 Ti
Storage NE-512
Display(s) G27Q
Case DAOTECH X9
Power Supply SF450
Is Blackwell transistor count accurate there? Blackwell dies seem to have essentially less transistors per SM. Looking at what has been revealed that does not quite make sense. Blackwell should have slightly more transistors. Did they cut down somewhere? Cache?

Dropping 1280 CU and 16 rops in GB205 compared to AD104 equals to 4,8 B transistors.
Adding 6144 CU, 512 in each GPC and no Rops, increased the die of GB202 by 15,9
Also only 88 MB cache are found in GB202 down from 96 in Ada.
N4P offers 11% more performance at the cost of relaxing the rules and also helps with the yields.
 
Joined
Dec 17, 2024
Messages
62 (2.00/day)
Location
CO
System Name Zen 3 Daily Rig
Processor AMD Ryzen 9 5900X with Optimus Foundation block
Motherboard ASUS Crosshair VIII Dark Hero
Cooling Hardware Labs 360GTX and 360GTS custom loop, Aquacomputer HighFlow NEXT, Aquacomputer Octo
Memory G.Skill Trident Z Neo 32GB DDR4-3600 (@ 3733 CL14)
Video Card(s) Nvidia RTX 3080 Ti Founders Edition with Alphacool Eisblock
Storage x2 Samsung 970 Evo Plus 2TB, Crucial MX500 1TB
Display(s) LG 42" C4 OLED
Case Lian Li O11 Dynamic
Power Supply be Quiet! Straight Power 12 1500W
Mouse Corsair Scimitar RGB Elite Wireless
Keyboard Keychron Q1 Pro
Software Windows 11 Pro
I mean...this is a new architecture, so we can't one to one say "no increase in performance" based solely on specs.

That said, notice they are bumping the power a lot on all models, so it does seem likely you see some increase from the architecture, but a good deal of it coming from increased power. 4070 (200W) to 5070 (250W) is a 25% increase. Meanwhile there is only a 4.34% increase in shaders for the 5070 over 4070, so if overall raster performance is closer to 15-20% increase, then yeah seems likely the increased power drawn is doing some heavy lifting. Still if we see a 5070 performing at about 4070 Ti levels, its still doing it with fewer shaders (7680 vs 6144) at slightly lower power, so there's some improvement in the architecture there.
 
Joined
Apr 12, 2013
Messages
7,592 (1.77/day)
if Blackwell was rearchitected in such a manner as to reduce the reliance on processor performance
That's only possible if they have a separate CPU in there. Practically everything is bottlenecked by the CPU one way or another after some point in time!
 
Joined
Dec 25, 2020
Messages
7,287 (4.91/day)
Location
São Paulo, Brazil
System Name "Icy Resurrection"
Processor 13th Gen Intel Core i9-13900KS
Motherboard ASUS ROG Maximus Z790 Apex Encore
Cooling Noctua NH-D15S upgraded with 2x NF-F12 iPPC-3000 fans and Honeywell PTM7950 TIM
Memory 32 GB G.SKILL Trident Z5 RGB F5-6800J3445G16GX2-TZ5RK @ 7600 MT/s 36-44-44-52-96 1.4V
Video Card(s) NVIDIA RTX A2000
Storage 500 GB WD Black SN750 SE NVMe SSD + 4 TB WD Red Plus WD40EFPX HDD
Display(s) 55-inch LG G3 OLED
Case Pichau Mancer CV500 White Edition
Power Supply EVGA 1300 G2 1.3kW 80+ Gold
Mouse Microsoft Classic IntelliMouse (2017)
Keyboard IBM Model M type 1391405
Software Windows 10 Pro 22H2
Benchmark Scores I pulled a Qiqi~
CPU limitations at 4k? So a 4090 getting 25fps at 4k is because of the CPU?

The only thing that's gonna run at 25 fps on a 4090 is like, Cyberpunk pathtraced or something equally absurd (perhaps Wukong with similarly extreme settings). Although, 4K isn't particularly hard to drive if you have enough GPU power (which the 4090 and now 5090 will comfortably have). 1080p was once just as challenging, after all.

Reduce the resolution and start targeting extra high frame rates, and the gap between the 4090 and the other cards shrinks quite considerably, and that is the point.
 
Top