NVIDIA GeForce Ampere Architecture, Board Design, Gaming Tech & Software

W1zzard

on Sep 4th, 2020,

in Graphics Cards.

Manufacturer: NVIDIA

NVIDIA Reflex & G-SYNC 360 »

RTX IO

Storage is the slowest hardware component in a computer, and SATA SSDs helped mitigate this to an extent, particularly with access times and IO; however, a SATA SSD is still infinitesimally slower than the dual-channel DDR4-4000 memory, your processor's L3 cache, or even the 19 Gbps GDDR6X memory on Ampere cards. M.2 NVMe SSDs, which leverage PCIe as the interconnect, have had a transformational impact on storage, mostly because they evolves in bandwidth with each new PCIe generation. Previous-generation PCIe Gen 3 based M.2 NVMe SSDs could offer up to 3.5 GB/s of sequential transfers, and PCIe Gen 4 based ones are expected to do 7 GB/s. Efforts are already underway to make the SSDs of the future even faster than PCIe, with Intel working on Optane Persistent Memory, an SSD that uses DRAM IO and can talk directly to a compatible processor's memory controller, just like a DRAM module would. Future looking bright? Hold up.

Storage isn't without overhead, and each storage IO request in a conventional PC architecture still relies on the CPU to process the IO request. According to tests by NVIDIA, reading uncompressed data from an SSD at 7 GB/s—the maximum sequential read speed of PCIe Gen 4 M.2 NVMe SSDs—requires the full utilization of two CPU cores. The OS typically spreads this workload across all available CPU cores/threads on a modern multi-core CPU. Things change dramatically when compressed data, such as game resources, are being read in a gaming scenario, with a high number of IO requests. Modern AAA games have hundreds of thousands of individual resources crammed into compressed resource-pack files. Although at a disk IO-level, ones and zeroes are still being moved at up to 7 GB/s, the de-compressed data stream at the CPU-level can be as high as 14 GB/s (best case compression). Add to this that each IO request comes with its own overhead—a set of instructions for the CPU to fetch x resource from y file and deliver it to z buffer, along with instructions to de-compress or decrypt the resource.

This could take an enormous amount of CPU muscle at a high IO throughput scale, and NVIDIA pegs the number of CPU cores required as high as 24. Microsoft sought to fix this problem by introducing the DirectStorage API, which enables a GPU to pull compressed data directly from the storage device, unpacking and decompressing the data on the GPU. NVIDIA RTX IO builds on this. NVIDIA RTX IO is a concentric outer layer of DirectStorage that is optimized further for gaming, and NVIDIA's GPU architecture. RTX IO brings GPU-accelerated lossless data decompression to the table, which means data remains compressed and bunched up as it is moved from the disk to the GPU, leveraging DirectStorage. NVIDIA claims this improves IO performance by a factor of two. NVIDIA further claims that GeForce RTX GPUs, thanks to their high CUDA core counts, are capable of offloading "dozens" of CPU cores, driving decompression performance beyond even what compressed data loads PCIe Gen 4 SSDs can throw at them.

Jul 13th, 2025 02:11 CDT change timezone

Latest GPU Drivers

New Forum Posts

02:09 by Cowboystrekk
9800x3D - 6400 CL32 1:1 not stable (12)
02:07 by Greenslade
Best motherboards for XP gaming (115)
02:04 by silentbogo
Is there a WIFI chip I should get? (1)
01:53 by cinemaware
What are you playing? (23945)
01:27 by LabRat 891
9060 XT 16GB or 6800 XT/6900XT? (30)
01:01 by sweethoneybee
ASUS ProArt GeForce RTX 4060 Ti OC Edition 16GB GDDR6 Gaming - nvflash64 VBIOS mismatch (5)
00:51 by A Computer Guy
Upgrade from old x58 system (10)
00:28 by lexluthermiester
New ToS of Take Two and 2K (11)
00:20 by Atlasturner
Someone run games on AMD BC-250 under Linux * Cut down PS5 die to 6 CPU cores 24 GPU cores for use in crypto mining (86)
00:05 by eidairaman1
GPU strip blinking,Trixx software not working properly,fan health fail... (1)

Popular Reviews

Jul 9th, 2025 Fractal Design Epoch RGB TG Review
Jul 11th, 2025 Lexar NM1090 Pro 4 TB Review
Jul 8th, 2025 Corsair FRAME 5000D RS Review
Jul 11th, 2025 Our Visit to the Hunter Super Computer
Jul 4th, 2025 NVIDIA GeForce RTX 5050 8 GB Review
Jul 7th, 2025 NZXT N9 X870E Review
Jun 20th, 2025 Sapphire Radeon RX 9060 XT Pulse OC 16 GB Review - An Excellent Choice
Nov 6th, 2024 AMD Ryzen 7 9800X3D Review - The Best Gaming Processor
May 13th, 2025 Upcoming Hardware Launches 2025 (Updated May 2025)
Jul 10th, 2025 Chieftec Iceberg 360 Review

NVIDIA GeForce Ampere Architecture, Board Design, Gaming Tech & Software

RTX IO

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts