- Joined
- Oct 10, 2018
- Messages
- 147 (0.07/day)
Hello all, I just made some calculations to predict the performance of the RTX 50 series. I started by comparing different GPUs, followed by an analysis of their performance.
Thus, the RTX 5090 (32GB) is expected to outperform the RTX 4090 by 60-70%. However, if the RTX 5090 only has 24GB and a 384-bit bus, the increase may be around 40-45%.
Comparison of NVIDIA Graphics Cards
- GTX 1080 Ti: 3584 CUDA cores (7168 CUDA threads), 11,800 million transistors, GDDR5X memory, 11 GB, 11 Gbps
- RTX 2080 Ti: 4352 CUDA cores (8704 CUDA threads), 18,600 million transistors, GDDR6 memory, 14 Gbps
- RTX 3090: 10496 CUDA cores, 28,300 million transistors, GDDR6 memory, 19.5 Gbps
- RTX 4090: 16384 CUDA cores, 76,300 million transistors, GDDR6X memory, 21 Gbps
- RTX 5090 (speculated): 21760 CUDA cores, GDDR7 memory, 28 Gbps
Performance Analysis
- GTX 1080 vs. 1080 Ti: A 1.4x increase in core count resulted in only a 28% performance boost, despite improvements in memory bandwidth and VRAM.
- RTX 2080 vs. 2080 Ti: A 1.48x increase in core count led to a 21% performance gain, showing diminishing returns even with better bandwidth and VRAM.
- RTX 3080 vs. 3090: A 1.2x increase in core count improved performance by just 10%, with limited impact from memory and bandwidth upgrades.
- RTX 4070 vs. 4070 SUPER: A 1.21x increase in core count led to a 15% performance boost, with no gains from other features.
- RTX 4080 vs. 4090: Despite a 1.68x increase in core count, performance only improved by 28%.
Core Count to Performance Ratio
The data suggests that the performance increase is roughly half of the core count difference. This highlights the importance of architectural differences in determining actual performance.Bandwidth's Impact on Performance
- RTX 4070 Ti vs. RTX 4070 Ti SUPER: Comparing the 192-bit and 256-bit memory bus, the 10% performance gain indicates that bandwidth has a limited effect.
- GTX 1660 vs. GTX 1660 SUPER: A bandwidth increase of nearly 2x (75%) from GDDR5 to GDDR6 resulted in only a 12.6% performance boost, further suggesting that bandwidth alone does not drive major improvements.
Performance Scaling
- RTX 4090: Performs 64% better than the RTX 3090, despite only a 1.56x increase in core count and a mere 10% improvement in bandwidth. Most gains likely come from architectural advancements.
- RTX 3090: Shows a 45% performance increase over the RTX 2080 Ti, with a 1.2x core count increase and 1.4x memory bandwidth. Architecture likely contributes around 20% of this boost.
- RTX 2080 Ti: Performs 35% better than the GTX 1080 Ti, based on a 1.21x core increase and a 27% improvement in bandwidth. The Turing architecture adds roughly 15% performance improvement.
Transistor and Core Ratios
- RTX 3090: Has 1.52x more transistors than the RTX 2080 Ti, with 1.2x more cores.
- RTX 4090: Has 2.7x more transistors than the RTX 3090, with 1.56x more cores.
What Will the RTX 5090 Be?
I believe the RTX 4090 could have been even more powerful if it had a 512-bit memory bus, which would potentially increase performance by 16.6%. For the RTX 5090, we can assume a similar improvement, with a 10% gain from the increased memory bandwidth (384-bit vs. 512-bit) and another 6% from the core count difference. This would yield a total performance improvement of 36%. Adding in the benefits of GDDR7 memory, which could contribute an additional 10%, the RTX 5090 may offer up to a 50% performance increase over the RTX 4090.Thus, the RTX 5090 (32GB) is expected to outperform the RTX 4090 by 60-70%. However, if the RTX 5090 only has 24GB and a 384-bit bus, the increase may be around 40-45%.
Predictions for the RTX 50 Series
- RTX 5080: This may be the weakest X80 generation GPU due to speculated core count cuts. Speculations suggest 10752 cores, while the full die could have up to 24768. The performance could increase by 1.05x (from core count) x 1.07x (from GDDR7) x 1.15x (from architecture), equaling about 30% more than the RTX 4080. We could see 16GB with 2GB modules and 24GB with 3GB modules.
- RTX 5070: Could be a great card if it has a core count between 6144 and 7424. I believe it will have 7168 cores. The performance could be 1x (from core count) x 1.08x (from GDDR7) x 1.2x (from architecture), or equal to a 30% gain over the 4070 Super, potentially matching the 4080. We may see 15GB/18GB variants, along with a 12GB model.
- RTX 5060 Ti: Likely based on the GB206 chip with either 4864 or 5120 cores. The performance could be 1.085x (from core count) x 1.09x (from GDDR7) x 1.1x (with a 192-bit bus) x 1.15x (from architectural improvements), resulting in a 48% performance increase over the RTX 4060 Ti. If it has a 192-bit bus, it will be more powerful than the RTX 4070; if not, it may perform at par with the RTX 4070, or about 5% better. It could come with 12GB on a 192-bit bus, or 8GB with a 128-bit bus.
- RTX 5060: Likely based on the GB207 chip with 3584 cores. Performance could be 1.0833x (from core count) x 1.11x (from GDDR7) x 1.2x (from architectural improvements and higher clock speeds) = a 44% performance uplift over the RTX 4060, putting it on par with or slightly better than the RTX 3070 Ti. It could have 8GB and 12GB variants with a 128-bit bus.
- RTX 5050: Based on the GB207 chip with 2560 cores, it may have 8GB and a 128-bit bus. Performance could be 1.07x (from bus upgrade from 96-bit to 128-bit) x 1.1x (from GDDR7) x 1.15x (from architectural improvements), leading to a performance boost of around 14% over the RTX 4060, putting it on par with the 6700 XT or RTX 3060 Ti.
Overall Predictions
- RTX 5090 32GB = RTX 4090 + 60-70%
- RTX 5080 16/24GB = RTX 4080 + 30%, or RTX 4090 + 5-10%
- RTX 5070 12GB = Performance on par with RTX 4080
- RTX 5060 Ti 12/16GB = Performance equal to RTX 4070 or RTX 3080
- RTX 5060 8/12GB = Performance on par with RTX 3070 Ti or RTX 4060 Ti + 10-15%
- RTX 5050 8GB = Performance similar to RTX 3060 Ti or RTX 4060 + 14%