Galax GeForce RTX 4070 Super EX Review 8

Galax GeForce RTX 4070 Super EX Review

(8 Comments) »

Introduction

Galaxy Logo

Galax GeForce RTX 4070 Super EX is an affordable yet premium-looking custom-design graphics card based on NVIDIA's recently announced performance segment GPU. This card is also available with an identical name and model identifier, but under the KFA2 brand, in some markets. The GeForce RTX 4070 Super is part of a three-model refresh by NVIDIA this January, for the higher end of its GeForce RTX 40-series Ada family. It aims to improve the performance and competitiveness of the GeForce RTX brand at the $600 price-point that was previously held by the original RTX 4070, which had been embattled by AMD's recent introduction of the Radeon RX 7800 XT. The "Super" brand extension denotes an increase in performance at given price points without a change in the underlying technology. NVIDIA tends to roll these GPUs out one year into the lifecycle of a GPU generation.



The GeForce RTX 4070 Super continues to be based on the same 5 nm AD104 silicon powering the original RTX 4070 and RTX 4070 Ti, but is closer to the latter in specs, particularly with its shader count. The original RTX 4070 only enabled 46 out of 60 streaming multiprocessors (SM) present on the AD104, or just over three-quarters of it. The RTX 4070 Ti maxes out all 60 SMs, but was $800 at launch. The new RTX 4070 Super enables 56 out of 60 SM, or 93% of the available shaders. Compared to the original RTX 4070, this marks a neat 21% increase in CUDA cores, RT cores, Tensor cores, and TMUs. That's not all, the RTX 4070 Super gets all 48 MB of L2 cache available on the AD104 silicon, the original RTX 4070 only has 36 MB of it; and there's even an increase in the ROP count—the RTX 4070 only had 64 out of 80 ROPs enabled; while the RTX 4070 Super, like the RTX 4070 Ti, has all 80 of them enabled.

With 56 SMs on tap, the GeForce RTX 4070 Super enjoys 7,168 CUDA cores, 224 Tensor cores, 56 RT cores, 224 TMUs, and 80 ROPs, besides all 48 MB of L2 cache. The memory sub-system is carried over from the original, you get 12 GB of GDDR6X memory across a 192-bit memory interface, running at 21 Gbps, for 504 GB/s of memory bandwidth on tap. The increased shader counts, and slightly higher GPU clocks have meant that NVIDIA has had to increase the total graphics power (TGP) of the RTX 4070 Super, to 220 W, up from 200 W for the RTX 4070; but which is still less than the lavish 285 W enjoyed by the RTX 4070 Ti. Even this bit of increase has meant that NVIDIA's board partners can no longer opt for a single 8-pin PCIe power connector configuration, and all cards, including the Galax we're reviewing today, come with a 16-pin 12VHPWR power connector, and an NVIDIA-designed adapter that converts two 8-pin PCIe to a 300 W-rated 12VHPWR.

As we mentioned before, the underlying Ada graphics architecture is unchanged. Designed to take advantage of the 5 nm EUV foundry node, Ada introduces a new generation CUDA core with increased IPC, and support for shader-execution reordering, which improves ray tracing performance; the new 3rd generation RT core that supports displaced micro-meshes that allow game designers to increase complexity of ray traced objects; and the new optical flow accelerator component, which enables DLSS 3 Frame Generation, a path-breaking feature that lets the GPU draw entire alternate frames using AI, without involving the main graphics rendering machinery. NVIDIA has also re-architected the memory sub-system for this generation, by placing a nearly 10-times larger fast on-die L2 cache on the GPU, which reduces round-trips to the video memory by a significant enough amount to allow NVIDIA to narrow down the memory bus to 192-bit (compared to something like the previous-gen RTX 3070 Ti, which enjoyed 256-bit). This narrower memory bus drives memory chips of generationally increased density, which is how the card has 12 GB.

The Galax RTX 4070 Super EX sports a premium appearance, with a triple-slot design, and cooler that looks like it's from a segment above. It uses an aluminium fin-stack heatsink with three heatpipes; and a trio of fans. The one in the middle is slightly larger 102 mm than the ones on the sides, at 92 mm, each. The three fans, along with a GeForce RTX logo on the topside of the cooler shroud, are RGB LED illuminated, and you can control it using the Galax Xtreme Tuner app. The card has its own RGB controller; but also puts out a 3-pin addressable RGB header, so you can sync the rest of your lighting to the card. The Galax EX comes with a handy factory overclock of 2565 MHz out of the box, compared to 2475 MHz reference. The card is priced at $615, an acceptable $15 premium over the NVIDIA baseline price.

NVIDIA GeForce RTX 4070 Super Market Segment Analysis
 PriceCoresROPsCore
Clock
Boost
Clock
Memory
Clock
GPUTransistorsMemory
RTX 4060 Ti$3904352482310 MHz2535 MHz2250 MHzAD10622900M8 GB, GDDR6, 128-bit
RX 6700 XT$300
2560642424 MHz2581 MHz2000 MHzNavi 2217200M12 GB, GDDR6, 192-bit
RTX 3070$3105888961500 MHz1725 MHz1750 MHzGA10417400M8 GB, GDDR6, 256-bit
RTX 3070 Ti$3506144961575 MHz1770 MHz1188 MHzGA10417400M8 GB, GDDR6X, 256-bit
RX 6800$4503840961815 MHz2105 MHz2000 MHzNavi 2126800M16 GB, GDDR6, 256-bit
RX 7700 XT$4303456962171 MHz2544 MHz2250 MHzNavi 3226500M12 GB, GDDR6, 192-bit
RX 6800 XT$50046081282015 MHz2250 MHz2000 MHzNavi 2126800M16 GB, GDDR6, 256-bit
RTX 3080$4508704961440 MHz1710 MHz1188 MHzGA10228000M10 GB, GDDR6X, 320-bit
RTX 4070$5405888641920 MHz2475 MHz1313 MHzAD10435800M12 GB, GDDR6X, 192-bit
RX 7800 XT$5103840962124 MHz2430 MHz2425 MHzNavi 3228100M16 GB, GDDR6, 256-bit
RX 6900 XT$65051201282015 MHz2250 MHz2000 MHzNavi 2126800M16 GB, GDDR6, 256-bit
RX 6950 XT$63051201282100 MHz2310 MHz2250 MHzNavi 2126800M16 GB, GDDR6, 256-bit
RTX 3090$800104961121395 MHz1695 MHz1219 MHzGA10228000M24 GB, GDDR6X, 384-bit
RTX 4070 Super$5907168801980 MHz2475 MHz1313 MHzAD10435800M12 GB, GDDR6X, 192-bit
Galax RTX 4070
Super EX
$6157168801980 MHz2565 MHz1313 MHzAD10435800M12 GB, GDDR6X, 192-bit
RX 7900 GRE$55051201601880 MHz2245 MHz2250 MHzNavi 3157700M16 GB, GDDR6, 256-bit
RTX 4070 Ti$7507680802310 MHz2610 MHz1313 MHzAD10435800M12 GB, GDDR6X, 192-bit
RTX 4070 Ti Super$80084481122340 MHz2610 MHz1313 MHzAD10345900M16 GB, GDDR6X, 256-bit
RX 7900 XT$76053761922000 MHz2400 MHz2500 MHzNavi 3157700M20 GB, GDDR6, 320-bit
RTX 3090 Ti$1050107521121560 MHz1950 MHz1313 MHzGA10228000M24 GB, GDDR6X, 384-bit
RTX 4080$120097281122205 MHz2505 MHz1400 MHzAD10345900M16 GB, GDDR6X, 256-bit
RTX 4080 Super$1000102401122295 MHz2550 MHz1438 MHzAD10345900M16 GB, GDDR6X, 256-bit

Packaging

Package Front
Package Back


The Card

Graphics Card Front
Graphics Card Back
Graphics Card Front Angled

The Galax RTX 4070 Super EX comes with a black color theme, the backplate has some white highlights. There's a cutout for air to flow through, and the main cooler shroud is made from plastic while the backplate is made of metal.

Graphics Card Dimensions

Dimensions of the card are 32 x 13 cm, and it weighs 1065 g.

Graphics Card Height
Graphics Card Back Angled

Installation requires three slots in your system. We measured the card's width to be 50 mm.

Monitor Outputs, Display Connectors

Display connectivity includes three standard DisplayPort 1.4a ports and one HDMI 2.1a (same as Ampere and same as non-Super Ada).

NVIDIA introduced the concept of dual NVDEC and NVENC Codecs with the Ada Lovelace architecture. This means there are two independent sets of hardware-accelerators; so you can encode and decode two streams of video in parallel or one stream at double the FPS rate. While the RTX 4070 Ti features dual units, the RTX 4070 Super and RTX 4070 come with only one of them. The new 8th Gen NVENC now accelerates AV1 encoding, besides HEVC. You also get an "optical flow accelerator" unit that is able to calculate intermediate frames for videos, to smooth playback. The same hardware unit is used for frame generation in DLSS 3.

Graphics Card Power Plugs

All GeForce RTX 4070 Super graphics cards use the 12+4 pin ATX 12VHPWR connector, an adapter cable is included in the box.

Teardown

Graphics Card Cooler Front
Graphics Card Cooler Back

The thermal solution on the Galax EX has three heatpipes. The main heatsink also provides cooling for the memory chips and VRM circuitry.


The backplate is made of metal and protects the card against damage during installation and handling.

High-resolution PCB Pictures

These pictures are for the convenience of volt modders and people who would like to see all the finer details on the PCB. Feel free to link back to us and use these in your articles, videos or forum posts.

Graphics Card Teardown PCB Front
Graphics Card Teardown PCB Back

High-resolution versions are also available (front, back).

Circuit Board (PCB) Analysis

GPU Voltage, VRM Configuration
GPU Chip Voltage Controller

GPU voltage is an eight-phase design, managed by a uPI uP9512R controller.


The GPU VRM uses AOZ5311NQI DrMOS components by Alpha & Omega Semiconductor, rated for 55 A.

Memory Voltage, VRM Configuration
Memory Chip Voltage Controller

Memory voltage is a two-phase design, managed by a uPI uP9529Q controller.


For memory, AOZ5311NQI DrMOS with a 50 A rating are used, too.

Graphics Card Memory Chips

The GDDR6X memory chips are made by Micron and carry the model number D8BZC, which decodes to MT61K512M32KPA-21:U. They are specified to run at 1313 MHz (21 Gbps GDDR6 effective).

Graphics Chip GPU

NVIDIA's AD104 graphics processor is the company's third Ada Lovelace GPU. It is built using a 5 nanometer process at TSMC Taiwan, with a transistor count of 35.8 billion and a die size of 295 mm².

Test System

Test System - GPU 2024.1
Processor:Intel Core i9-14900K
Raptor Lake, 6.0 GHz, 8+16 cores / 32 threads
PL1 = PL2 = 330 W
Motherboard:EVGA Z790 Dark
BIOS 1.13
Resizable BAR:Enabled on all supported cards
(NVIDIA, AMD & Intel)
Memory:Thermaltake TOUGHRAM XG
2x 16 GB DDR5-7200 MHz 36-46-46-96
Cooling:Arctic Liquid Freezer II
280 mm AIO
Thermal Paste:Arctic MX-6
Storage:2x 2 TB M.2 NVMe SSD
Power Supply:Seasonic Vertex GX 850 W
ATX 3.0 / 16-pin 12VHPWR
Case:darkFlash DLX4000 Mesh
Operating System:Windows 11 Professional 64-bit 23H2
VBS enabled (Windows 11 default)
Drivers: RX 7900 GRE: 23.40.19.01 Press Driver
RTX 3050 6 GB: 551.23 WHQL
RTX 4080 Super: 551.22 Press Driver
RTX 4070 Super & 4070 Ti Super: 551.23 WHQL
NVIDIA: 546.17 WHQL
AMD: 23.11.1 WHQL
Intel: 101.4953 WHQL
Date of Retest
Benchmark scores in other reviews are only comparable when this exact same configuration is used.

  • All games and cards are tested with the drivers listed above—no performance results were recycled between test systems. Only this exact system with exactly the same configuration is used for all results in this review.
  • All graphics cards are tested using the same game version.
  • All games are set to their highest quality setting unless indicated otherwise.
  • AA and AF are applied via in-game settings, not via the driver's control panel.
  • Before starting measurements, we heat up the card for each test to ensure a steady state is tested. This ensures that the card won't boost to unrealistically high clocks for only a few seconds until it heats up, as that doesn't represent prolonged gameplay.
  • For better real-life applicability, all game tests use custom in-game test scenes, not the integrated benchmarks
  • All cards used for comparison are reference designs. When a reference design does not exist, we go the extra mile and buy the closest possible match, using reference clocks and default power limit.
Each game is tested at these screen resolutions:
  • 1920x1080: Most popular monitor resolution.
  • 2560x1440: Intermediary resolution between Full HD and 4K, with reasonable performance requirements.
  • 3840x2160: 4K Ultra HD resolution, available on high-end monitors.
Our Patreon Silver Supporters can read articles in single-page format.
Discuss(8 Comments)
Mar 13th, 2025 12:32 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts