We finally have with us the NVIDIA GeForce RTX 4060 Ti 16 GB graphics card. This model, which doubles the VRAM size over the "standard" RTX 4060 Ti 8 GB, was announced to much fanfare back in May 2023, but NVIDIA, for reasons we'll find out in this review, decided against sampling the card to reviewers ahead of its official July 18 launch. There is no first-party Founders Edition card, and NVIDIA's board partners weren't too keen on sampling their custom-design cards, either. So we spent north of $500, (€562.50 to be precise) to buy one of these cards from the market after launch, and it finally made its way to our labs. Technically, this is a Gainward GeForce RTX 4060 Ti 16 GB Panther OC, but we are treating it as a de facto NVIDIA review, because there is no FE available and we have no other reviews planned of this 16 GB model.
The GeForce RTX 4060 Ti Ada is designed to consolidate the mid-range for NVIDIA this generation. It offers maxed-out gaming at 1080p, including ray tracing, and 1440p gaming is very much possible at high-thru-ultra settings. For 1440p with ray tracing, you might need to dial down some settings, or better yet, let GeForce Experience find the right settings for you. The best option would be to take advantage of features such as DLSS. Since it's based on the latest Ada Lovelace graphics architecture, the RTX 4060 Ti gets DLSS 3 Frame Generation, a path-breaking feature to increase frame-rates with minimal impact on image quality.
The GeForce Ada Lovelace graphics architecture debuts the third generation of NVIDIA RTX, the technology that mainstreamed ray tracing for real-time graphics and gaming applications. By itself, ray tracing is a the holy grail of 3D graphics, as it's the most computationally intensive way to generate photorealistic graphics. NVIDIA figured out a way to combine conventional raster 3D graphics with certain real-time ray traced elements such as lighting, shadows, reflections, global illuminations, and motion-blur; to significantly improve realism in games. Even this much of ray tracing requires tremendous compute power, and so NVIDIA innovated the RT core, a fixed function hardware unit to calculate ray intersections. NVIDIA leverages AI for de-noising, and hence the GPU has Tensor cores that accelerate neural network building and training, besides features such as DLSS. Ada introduces NVIDIA's highest IPC CUDA cores that support shader execution re-ordering; 3rd generation RT cores that support displaced micro-meshes, and the new optical flow accelerator that enables DLSS 3 Frame Generation.
The GeForce RTX 4060 Ti 16 GB is based on the 5 nm AD106 silicon, which it nearly maxes out, enabling 34 out of 36 streaming multiprocessors (SM) physically present. This works out to 4,352 CUDA cores, 136 Tensor cores, 34 RT cores, 136 TMUs, and 48 ROPs. In a bit of a controversial decision, NVIDIA reduced the memory bus width of the RTX 4060 Ti to 128-bit, compared to the 256-bit that the RTX 3060 Ti enjoys. The memory speed has generationally increased to 18 Gbps, but the memory bandwidth still falls a bit short. This shouldn't be a problem, NVIDIA claims, as the company re-architected the memory sub-system to place greater importance on 8-12 times larger on-die caches, which should reduce the round-trips to the video memory. The 16 GB version of the RTX 4060 Ti is simply a doubling in memory size, using the same 128-bit GDDR6 memory interface, with a clamshell memory configuration, which puts the chips on both sides of the PCB. The memory speed is unchanged at 18 Gbps, as is the GPU's core-configuration. You get the same number of shaders, and the same GPU clock speeds. Only the power limit has been increased by 5 W, to account for the higher power draw of the additional 8 GB memory.
NVIDIA is asking a steep $499 for the 16 GB GeForce RTX 4060 Ti, a 25% premium over the RTX 4060 Ti 8 GB. From what little we know about GDDR6 memory spot-pricing, the 16 GB variant is essentially a $10-20 increase in the bill of materials, but for whatever reason, it lugs that huge premium. At $499, this card is $100 shy of the GeForce RTX 4070, which is about 30% faster than the RTX 4060 Ti 8 GB. In this review, we'll find out by just how much NVIDIA has managed to slim that gap with the 16 GB RTX 4060 Ti.
RTX 4060 Ti Market Segment Analysis
Price
Cores
ROPs
Core Clock
Boost Clock
Memory Clock
GPU
Transistors
Memory
RTX 2060
$180
1920
48
1365 MHz
1680 MHz
1750 MHz
TU106
10800M
6 GB, GDDR6, 192-bit
RX Vega 64
$320
4096
64
1247 MHz
1546 MHz
953 MHz
Vega 10
12500M
8 GB, HBM2, 2048-bit
RX 5700 XT
$180
2560
64
1605 MHz
1755 MHz
1750 MHz
Navi 10
10300M
8 GB, GDDR6, 256-bit
RTX 3050
$260
2560
32
1552 MHz
1777 MHz
1750 MHz
GA106
12000M
8 GB, GDDR6, 128-bit
RTX 2070
$230
2304
64
1410 MHz
1620 MHz
1750 MHz
TU106
10800M
8 GB, GDDR6, 256-bit
RX 6600
$210
1792
64
2044 MHz
2491 MHz
1750 MHz
Navi 23
11060M
8 GB, GDDR6, 128-bit
RTX 3060
$300
3584
48
1320 MHz
1777 MHz
1875 MHz
GA106
12000M
12 GB, GDDR6, 192-bit
RTX 4060
$300
3072
32
1830 MHz
2460 MHz
2125 MHz
AD107
unknown
8 GB, GDDR6, 128-bit
RX 6600 XT
$250
2048
64
2359 MHz
2589 MHz
2000 MHz
Navi 23
11060M
8 GB, GDDR6, 128-bit
Arc A750
$250
3584
112
2050 MHz
N/A
2000 MHz
ACM-G10
21700M
8 GB, GDDR6, 256-bit
Arc A770
$290
4096
128
2100 MHz
N/A
2187 MHz
ACM-G10
21700M
16 GB, GDDR6, 256-bit
RTX 2080
$260
2944
64
1515 MHz
1710 MHz
1750 MHz
TU104
13600M
8 GB, GDDR6, 256-bit
RTX 3060 Ti
$320
4864
80
1410 MHz
1665 MHz
1750 MHz
GA104
17400M
8 GB, GDDR6, 256-bit
RTX 4060 Ti
$400
4352
48
2310 MHz
2535 MHz
2250 MHz
AD106
22900M
8 GB, GDDR6, 128-bit
Gainward RTX 4060 Ti 16 GB Panther OC
$500
4352
48
2310 MHz
2595 MHz
2250 MHz
AD106
22900M
16 GB, GDDR6, 128-bit
RX 6700 XT
$320
2560
64
2424 MHz
2581 MHz
2000 MHz
Navi 22
17200M
12 GB, GDDR6, 192-bit
RTX 2080 Ti
$400
4352
88
1350 MHz
1545 MHz
1750 MHz
TU102
18600M
11 GB, GDDR6, 352-bit
RTX 3070
$350
5888
96
1500 MHz
1725 MHz
1750 MHz
GA104
17400M
8 GB, GDDR6, 256-bit
RTX 3070 Ti
$420
6144
96
1575 MHz
1770 MHz
1188 MHz
GA104
17400M
8 GB, GDDR6X, 256-bit
RX 6800
$430
3840
96
1815 MHz
2105 MHz
2000 MHz
Navi 21
26800M
16 GB, GDDR6, 256-bit
RX 6800 XT
$500
4608
128
2015 MHz
2250 MHz
2000 MHz
Navi 21
26800M
16 GB, GDDR6, 256-bit
RTX 3080
$500
8704
96
1440 MHz
1710 MHz
1188 MHz
GA102
28000M
10 GB, GDDR6X, 320-bit
RTX 4070
$600
5888
64
1920 MHz
2475 MHz
1313 MHz
AD104
35800M
12 GB, GDDR6X, 192-bit
Packaging
The Card
The Gainward Panther OC comes with a clean color theme that uses black as the main color, with some white highlights. On the back you'll find a metal backplate with cutouts that let some air flow through the card.
Dimensions of the card are 25.0 x 11.5 cm, and it weighs 753 g.
Installation requires three slots in your system. The card's width is 45 mm.
Display connectivity includes three standard DisplayPort 1.4a ports and one HDMI 2.1a (same as Ampere).
The new 8th Gen NVENC now accelerates AV1 encoding, besides HEVC. You also get an "optical flow accelerator" unit that is able to calculate intermediate frames for videos, to smooth playback. The same hardware unit is used for frame generation in DLSS 3.
Gainward opted for a single 8-pin PCIe power connector, this configuration is good for up to 225 W.
Teardown
The thermal solution on the Gainward Panther uses two heatpipes. The main heatsink also provides cooling for the memory chips and VRM circuitry.
The backplate is made of metal and protects the card against damage during installation and handling. Please note the two thermal pads, which provide cooling for the memory chips on this side of the PCB.
High-resolution PCB Pictures
These pictures are for the convenience of volt modders and people who would like to see all the finer details on the PCB. Feel free to link back to us and use these in your articles, videos or forum posts.
High-resolution versions are also available (front, back).
Circuit Board (PCB) Analysis
GPU voltage is a six-phase design, managed by a uPI uP9512R controller.
Monolithic Power Systems MP87990 DrMOS components are used for GPU voltage.
Memory voltage is a dual-phase design, managed by a uPI uP1666Q controller.
For memory, Sinopower SM7342EKKP are used.
The memory chips are Samsung K4ZAF325BC-SC20, these are 20 Gbps-rated GDDR6 memory chips. These are the same chips as on the 8 GB version, the only difference is that the 16 GB models have eight memory chips, four on each side.
NVIDIA's AD106 graphics processor is the company's fourth Ada Lovelace GPU. It is built using a 5 nanometer process at TSMC Taiwan, with a transistor count of 22.9 billion and a die size of 190 mm². Please note that the 16 GB version gets its own GPU variant, called AD106-351, whereas the 8 GB version has AD106-350 onboard.
Test System
Test System - GPU 2023.2
Processor:
Intel Core i9-13900K Raptor Lake, 5.8 GHz, 8+16 cores / 32 threads PL1 = PL2 = 320 W
Benchmark scores in other reviews are only comparable when this exact same configuration is used.
All games and cards are tested with the drivers listed above—no performance results were recycled between test systems. Only this exact system with exactly the same configuration is used for all results in this review.
All graphics cards are tested using the same game version.
All games are set to their highest quality setting unless indicated otherwise.
AA and AF are applied via in-game settings, not via the driver's control panel.
Before starting measurements, we heat up the card for each test to ensure a steady state is tested. This ensures that the card won't boost to unrealistically high clocks for only a few seconds until it heats up, as that doesn't represent prolonged gameplay.
For better real-life applicability, all game tests use custom in-game test scenes, not the integrated benchmarks
All cards used for comparison are reference designs. When a reference design does not exist, we go the extra mile and buy the closest possible match, using reference clocks and default power limit.
Each game is tested at these screen resolutions:
1920x1080: Most popular monitor resolution.
2560x1440: Intermediary resolution between Full HD and 4K, with reasonable performance requirements.
3840x2160: 4K Ultra HD resolution, available on high-end monitors.