- Joined
- Dec 17, 2011
- Messages
- 359 (0.08/day)
So I was bored and I was looking through the GPU database (i am weird) and look what I found - Turing TU104 (used in 2070 Super, 2080 and 2080 Super) has a transistor count of 13.6 billion. GA106 (used in 3060) has a transistor count of 13.25 million. My! How similar!
So I thought, you know what. Let's do a comparison. On the Ampere corner we have RTX 3060 (duh!) and on the Turing corner we need a TU104 disabled part like the RTX 3060 is for GA104. So I went with RTX 2070 Super. Now the RTX 2070 Super utilizes 2560/3072 = 83% of TU104 shaders compared to RTX 3060's 3584/3840 = 93% but TU104 has more transistors too (3% more) so I thought it should be good. Anyways, here are the numbers!
So... RTX 3060 seems to have 40% more FP32 TFLOPs and RT performance but 2070 Super has everything else 40% more plus 24% more memory bandwidth. Of course, this doesn't take into account architectural efficiencies/inefficiencies. But it makes you wonder how such a drastic rebalancing changes the gaming performance.
On RT cores - Anandtech says
So despite the rebalancing it does look like 2070 Super is faster in non-RT games. 2070's advantage almost disappears with Ray Tracing turned on. What a weird but interesting result. I think the RTX 3060 could benefit a lot from greater Texture Fill Rate and maybe from more Pixel Fill Rate. I doubt the lower FP16 performance is harming the 3060. FP64 is irrelevant to gaming anyway.
I wonder what was Nvidia's rationale for focusing on FP32 + RT performance over everything else. What do you think?
So I thought, you know what. Let's do a comparison. On the Ampere corner we have RTX 3060 (duh!) and on the Turing corner we need a TU104 disabled part like the RTX 3060 is for GA104. So I went with RTX 2070 Super. Now the RTX 2070 Super utilizes 2560/3072 = 83% of TU104 shaders compared to RTX 3060's 3584/3840 = 93% but TU104 has more transistors too (3% more) so I thought it should be good. Anyways, here are the numbers!
Metric | RTX 2070 Super | RTX 3060 | 2070 Super advantage over 3060 |
---|---|---|---|
Pixel Fill Rate | 113.3 GigaPixels/sec | 85.30 GigaPixels/sec | +33% |
Texture Fill Rate | 283.2 GigaTexels/sec | 199.0 GigaTexels/sec | +42% |
Half Precision (FP16) FLOPs | 18.12 TFLOPs | 12.74 TFLOPs | +42% |
Full Precision (FP32) FLOPs | 9.06 TFLOPs | 12.74 TFLOPs | -29% (or 3060 has +40%) |
Double Precision (FP64) FLOPs | 283.2 GFLOPs | 199.0 GFLOPs | +42% |
Memory bandwidth | 448 GB/sec | 360 GB/sec | +24% |
RT cores (thanks cvaldes) | 40 | 28 (but 2x faster) | -29% (or 3060 has +40%) |
So... RTX 3060 seems to have 40% more FP32 TFLOPs and RT performance but 2070 Super has everything else 40% more plus 24% more memory bandwidth. Of course, this doesn't take into account architectural efficiencies/inefficiencies. But it makes you wonder how such a drastic rebalancing changes the gaming performance.
On RT cores - Anandtech says
The ray tracing (RT) cores have also been beefed up (for Ampere) ...... the individual RT cores are said to be up to 2x faster, with NVIDIA specifically quoting ray/triangle intersection performance.
Metric | RTX 2070 Super | RTX 3060 | 2070 Super advantage over 3060 |
---|---|---|---|
Average FPS - 1080p | 124.5 | 113.9 | +9% |
Average FPS - 4k | 53.4 | 47.8 | +12% |
Average FPS - 1080p RT (mean data of below) | 81.37 | 79.35 | +2% |
Control - 1080p RT | 46.8 | 45.3 | +3% |
Control - 4k RT | 14.7 | 13.4 | +10% |
Cyberpunk - 1080p RT | 31.5 | 31.4 | - |
Cyberpunk - 4k RT | 9.4 | 9.3 | - |
Doom Eternal - 1080p RT | 134.2 | 128.1 | +5% |
Doom Eternal - 4k RT | 14.6 | 54.2 | N/A as 2070 Super runs into VRAM limit |
F1 2021 - 1080p RT | 127.6 | 124.1 | +3% |
F1 2021 - 4k RT | 43.7 | 41.7 | +5% |
Far Cry 6 - 1080p RT | 75.2 | 76.8 | -2% |
Far Cry 6 - 4k RT | 37 | 35 | +6% |
Metro Exodus - 1080p RT | 72.9 | 70.4 | +3% |
Metro Exodus - 4k RT | 26.6 | 22.1 | +20% |
So despite the rebalancing it does look like 2070 Super is faster in non-RT games. 2070's advantage almost disappears with Ray Tracing turned on. What a weird but interesting result. I think the RTX 3060 could benefit a lot from greater Texture Fill Rate and maybe from more Pixel Fill Rate. I doubt the lower FP16 performance is harming the 3060. FP64 is irrelevant to gaming anyway.
I wonder what was Nvidia's rationale for focusing on FP32 + RT performance over everything else. What do you think?
Last edited: