Tuesday, June 11th 2024
Possible Specs of NVIDIA GeForce "Blackwell" GPU Lineup Leaked
Possible specifications of the various NVIDIA GeForce "Blackwell" gaming GPUs were leaked to the web by Kopite7kimi, a reliable source with NVIDIA leaks. These are specs of the maxed out silicon, NVIDIA will carve out several GeForce RTX 50-series SKUs based on these chips, which could end up with lower shader counts than those shown here. We've known from older reports that there will be five chips in all, the GB202 being the largest, followed by the GB203, the GB205, the GB206, and the GB207. There is a notable absence of a successor to the AD104, GA104, and TU104, because NVIDIA is trying a slightly different way to approach the performance segment with this generation.
The GB202 is the halo segment chip that will drive the possible RTX 5090 (RTX 4090 successor). This chip is endowed with 192 streaming multiprocessors (SM), or 96 texture processing clusters (TPCs). These 96 TPCs are spread across 12 graphics processing clusters (GPCs), which each have 8 of them. Assuming that "Blackwell" has the same 256 CUDA cores per TPC that the past several generations of NVIDIA gaming GPUs have had, we end up with a total CUDA core count of 24,576. Another interesting aspect about this mega-chip is memory. The GPU implements the next-generation GDDR7 memory, and uses a mammoth 512-bit memory bus. Assuming the 28 Gbps memory speed that was being rumored for NVIDIA's "Blackwell" generation, this chip has 1,792 GB/s of memory bandwidth on tap!The GB203 is the next chip in the series, and poised to be a successor in name to the current AD103. It generationally reduces the shader counts, counting on the architecture and clock speeds to more than come through for performance; while retaining the 256-bit bus width of the AD103. The net result could be a significantly smaller GPU than the AD103, for better performance. The GB203 is endowed with 10,752 CUDA cores, spread across 84 SM (42 TPCs). The chip has 7 GPCs, each with 6 TPCs. The memory bus, as we mentioned, is 256-bit, and at a memory speed of 28 Gbps, would yield 896 GB/s of bandwidth.
The GB205 will power the lower half of the performance segment in the GeForce "Blackwell" generation. This chip has a rather surprising CUDA core count of just 6,400, spread across 50 SM, which are arranged in 5 GPCs of 5 TPCs, each. The memory bus width is 192-bit. For 28 Gbps, this would result in 672 GB/s of memory bandwidth.
The GB206 drives the mid-range of the series. This chip gets very close to matching the CUDA core count of the GB205, with 6,144 of them. These are spread across 36 SM (18 TPCs). The 18 TPCs span 3 GPCs of 6 TPCs, each. The key differentiator between the GB205 and GB206 is memory bus width, which is narrowed to 128-bit for the GB206. With the same 28 Gbps memory speed being used here, such a chip would end up with 448 GB/s of memory bandwidth.
At the entry level, there is the GB207, a significantly smaller chip with just 2,560 CUDA cores, across 10 SM, spanning two GPCs of 5 TPCs, each. The memory bus width is unchanged at 128-bit, but the memory type used is the older generation GDDR6. Assuming NVIDIA uses 18 Gbps memory speeds, it ends up with 288 GB/s on tap.
NVIDIA is expected to double down on large on-die caches on all its chips, to cushion the memory sub-systems. We expect there to be several other innovations in the areas of ray tracing performance, AI acceleration, and certain other features exclusive to the architecture. The company is expected to debut the series some time in Q4-2024.
Source:
kopite7kimi (Twitter)
The GB202 is the halo segment chip that will drive the possible RTX 5090 (RTX 4090 successor). This chip is endowed with 192 streaming multiprocessors (SM), or 96 texture processing clusters (TPCs). These 96 TPCs are spread across 12 graphics processing clusters (GPCs), which each have 8 of them. Assuming that "Blackwell" has the same 256 CUDA cores per TPC that the past several generations of NVIDIA gaming GPUs have had, we end up with a total CUDA core count of 24,576. Another interesting aspect about this mega-chip is memory. The GPU implements the next-generation GDDR7 memory, and uses a mammoth 512-bit memory bus. Assuming the 28 Gbps memory speed that was being rumored for NVIDIA's "Blackwell" generation, this chip has 1,792 GB/s of memory bandwidth on tap!The GB203 is the next chip in the series, and poised to be a successor in name to the current AD103. It generationally reduces the shader counts, counting on the architecture and clock speeds to more than come through for performance; while retaining the 256-bit bus width of the AD103. The net result could be a significantly smaller GPU than the AD103, for better performance. The GB203 is endowed with 10,752 CUDA cores, spread across 84 SM (42 TPCs). The chip has 7 GPCs, each with 6 TPCs. The memory bus, as we mentioned, is 256-bit, and at a memory speed of 28 Gbps, would yield 896 GB/s of bandwidth.
The GB205 will power the lower half of the performance segment in the GeForce "Blackwell" generation. This chip has a rather surprising CUDA core count of just 6,400, spread across 50 SM, which are arranged in 5 GPCs of 5 TPCs, each. The memory bus width is 192-bit. For 28 Gbps, this would result in 672 GB/s of memory bandwidth.
The GB206 drives the mid-range of the series. This chip gets very close to matching the CUDA core count of the GB205, with 6,144 of them. These are spread across 36 SM (18 TPCs). The 18 TPCs span 3 GPCs of 6 TPCs, each. The key differentiator between the GB205 and GB206 is memory bus width, which is narrowed to 128-bit for the GB206. With the same 28 Gbps memory speed being used here, such a chip would end up with 448 GB/s of memory bandwidth.
At the entry level, there is the GB207, a significantly smaller chip with just 2,560 CUDA cores, across 10 SM, spanning two GPCs of 5 TPCs, each. The memory bus width is unchanged at 128-bit, but the memory type used is the older generation GDDR6. Assuming NVIDIA uses 18 Gbps memory speeds, it ends up with 288 GB/s on tap.
NVIDIA is expected to double down on large on-die caches on all its chips, to cushion the memory sub-systems. We expect there to be several other innovations in the areas of ray tracing performance, AI acceleration, and certain other features exclusive to the architecture. The company is expected to debut the series some time in Q4-2024.
141 Comments on Possible Specs of NVIDIA GeForce "Blackwell" GPU Lineup Leaked
Really looking forward to another disappointing launch, since that has been the rule since 2018.
I really wonder, how will Nvidia surprise us? Entry level at 500€ with 92 bit bus and 65mm^2 die?
So for those of you that said that AMD has no drivers issues, they do, just check their forums on their reddit, nobody is listening either. Nvidia has less drivers issues than AMD by far, at least Nvidia tries to fix them, AMD seem to not care about it.
The cut down memory buses compared to Ampere make no sense. 60-class should be 192-bit and 70-class should be 256-bit. That's the main problem with the Ada line-up and it's gonna be the same here.
But I guess Jensen thinks more cache makes up for the lower capacity, not just bandwidth. It's not accurate, but it's correct.
I too wish Nvidia would give us more too, not cus people cry, cus we NEEED it.
www.thefpsreview.com/2023/05/03/hogwarts-legacy-cyberpunk-2077-and-the-last-of-us-part-i-top-list-of-vram-heavy-pc-titles/
Happy 7900 XT user with no care in the world for things that are not even 25% matured or usable for most.
There will be a time things become mainstream, this time is not now, but I am not here to control who does what.
Some people RIP into console users and their experiences of fake resolutions whilst pushing their favoriate upscaler.
Cognitive dissonance and general intellect drop has been seen since the dawn of the RTX lineup. Sorry your user experience is not good.
I click play, it works.
pcgamingwiki.com is your friend, a lot of issue have nothing to do with drivers, it's sometimes just Windows messing with stuff, it's why I run Enterprise 24H2 Windows 11.
If you are bringing ray tracing into it then that changes things though but maybe not for next gen AMD.
I'm not saying 8GB today is good (and Nvidia certainly overprice it by $100 or more), but you have to admit both sides have their issues with the amount of VRAM they use BUT also slam the Memory IC makers who could have made 4GB chips for GDDR5X, GDDR6 & GDDR6X (plus 3GB for the latter 2) but instead stuck with 1GB or 2GB chips. With GDDR7 they have dropped the 1GB but 3GB is not coming until a year or more after the 2GB ones.
I can certainly see the 5050, 5060, 5060 Ti (maybe) being by far the worst choice compared to AMD & Battlemage in the sub $500 market, if only by the VRAM limiting their usefulness. I certainly wouldn't recommend a 8GB card for anything but eSports or casual light gaming and for under $200, preferably $150 or less.