NVIDIA GeForce Ampere Architecture, Board Design, Gaming Tech & Software

on Sep 4th, 2020,

Manufacturer: NVIDIA

The new Ampere RT Core and Tensor Core

With Ampere, NVIDIA introduces its 2nd generation RT core that aims to improve raytracing acceleration, as well as new effects, such as raytraced motion blur. An RT core is a fixed-function hardware component that handles two of the most challenging tasks for SIMD programmable shaders, bounding volume hierarchy (BVH) traversal and intersection; i.e., calculating the exact point where a ray collides with a surface, so its next course can be charted. Typical raytracing workloads in a raster+raytracing hybrid rendering path involve calculating steps of traversal and intersection across the BVH and bounding-box/triangle intersections, which is a very unsuitable workload for typical GPUs because of the nature of memory accesses involved. This kind of pointer chasing doesn't scale well with SIMD architectures (read: programmable shaders) and is better suited to special fixed-function hardware, like the MIMD RT cores.

Without taking names, NVIDIA pointed out that a minimalist approach toward raytracing (possibly what AMD is up to with RDNA 2) has a performance impact due to overreliance on SIMD stream processors. NVIDIA's RT cores offer a completely hardware-based BVH traversal stack, a purpose-built MIMD execution unit, and inherently lower latency from the hardware stack. The 2nd generation RT core being introduced with Ampere adds one more hardware component.

Ampere introduces a new logic block that interpolates triangle positions along a time scale, in coordination with the triangle intersection unit. NVIDIA tells us that this is useful in generating motion blur effects in real-time raytracing. Our take on this is that NVIDIA is, rather, implementing this as performance optimization for raytracing. As very little will likely change in two frames, there is no need to recalculate all the results for the following frame after all the ray intersections for the current frame have been calculated—the player moved or changed the camera, and objects in the world are positioned only ever so slightly differently. We suspect NVIDIA paired a motion-estimation algorithm with RTX that remembers the last intersections as "good candidates" and checks them early on in the whole process, which can lead to a valid result early in the test and means many entries in the BVH don't have to be processed at all.

3rd Generation Tensor Cores

The new 3rd generation tensor core is largely carried over from the A100 Tensor Core processor NVIDIA introduced this spring, which is purpose-built for AI deep-learning work. To improve performance, Ampere tensor cores are designed to leverage sparsity in deep learning neural nets. Sparsity is a phenomenon where a dense matrix can be trimmed without affecting its accuracy—kind of like how the goal in Jenga is to keep the column intact despite pulling out pieces from the middle. Sparse matrices increase AI inference performance by an order of magnitude.

Jul 12th, 2025 09:10 CDT change timezone

Latest GPU Drivers

New Forum Posts

09:08 by trparky
Stupid buggy POS Realtek WiFi RTL8852BE (10)
09:04 by Waldorf
'NVIDIA App' not usable offline? (13)
08:53 by gasolin
Chrome has removed uBlock Origin 1.64.0 (remove google search suggestions) (12)
08:44 by jm_bmw
Share your AIDA 64 cache and memory benchmark here (3097)
08:43 by remixedcat
The Official Linux/Unix Desktop Screenshots Megathread (778)
08:43 by chrcoluk
No offense, here are some things that bother me about your understanding of fans. (35)
08:33 by chrcoluk
[GPU-Z Test Build] New Kernel Driver, Everyone: Please Test (90)
08:27 by BoggledBeagle
Gigabyte graphic cards - TIM gel SLIPPAGE problem (150)
08:26 by Chomiq
NVIDIA App (55)
08:01 by chrcoluk
Looking for a new m.2 drive that is suitable for livestreams, multi browsing, easy encoding/rendering, NOT gaming! Budget: €300 (30)

Popular Reviews

Jul 9th, 2025 Fractal Design Epoch RGB TG Review
Jul 11th, 2025 Lexar NM1090 Pro 4 TB Review
Jul 8th, 2025 Corsair FRAME 5000D RS Review
Jul 4th, 2025 NVIDIA GeForce RTX 5050 8 GB Review
Jul 7th, 2025 NZXT N9 X870E Review
Jul 11th, 2025 Our Visit to the Hunter Super Computer
Jun 20th, 2025 Sapphire Radeon RX 9060 XT Pulse OC 16 GB Review - An Excellent Choice
Nov 6th, 2024 AMD Ryzen 7 9800X3D Review - The Best Gaming Processor
May 13th, 2025 Upcoming Hardware Launches 2025 (Updated May 2025)
Jul 10th, 2025 Chieftec Iceberg 360 Review