NVIDIA Details DLSS 4 Design: A Complete AI-Driven Rendering Technology

AleksandarK · Mar 14, 2025

NVIDIA has published a research paper on DLSS version 4, its AI rendering technology for real-time graphics performance. The system integrates advancements in frame generation, ray reconstruction, and latency reduction. The flagship Multi-Frame Generation feature generates three additional frames for every native frame. The DLSS 4 later on brings the best looking frames to the user quickly to make is seem like a real rendering. At the core of DLSS 4 is a shift from convolutional neural networks to transformer models. These new AI architectures excel at capturing spatial-temporal dependencies, improving ray-traced affect quality by 30-50% according to NVIDIA's benchmarks. The technology processes each AI-generated frame in just 1 ms on RTX 5090 GPUs—significantly faster than the 3.25 ms required by DLSS 3. For competitive gaming, the new Reflex Frame Warp feature reduces input latency by up to 75%, achieving 14 ms in THE FINALS and under 3 ms in VALORANT, according to NVIDIA's own benchmarks.

DLSS 4's implementation leverages Blackwell-specific architecture capabilities, including FP8 tensor cores and fused CUDA kernels. The optimized pipeline incorporates vertical layer fusion and memory optimizations that keep computational overhead manageable despite using transformer models, which are twice as large as previous CNN implementations. This efficiency enables real-time performance even with the substantially more complex AI processing. The unified AI pipeline reduces manual tuning requirements for ray-traced effects, allowing studios to implement advanced path tracing across diverse hardware configurations. The design also addresses gaming challenges like interpolating fast-moving UI elements and particle effects and reducing artifacts in high-motion scenes. NVIDIA's hardware flip metering and Blackwell-induced display engine integration ensure precise frame pacing of newly generated frames for smooth, high-refresh-rate gaming, with accurate imagery.

To ensure DLSS works as intended and that the neural networks produce quality results, NVIDIA has used a secret weapon: a dedicated supercomputer that has been continuously improving DLSS for the past six years. The supercomputer's primary task involves analyzing failures in DLSS performance, such as ghosting, flickering, or blurriness across hundreds of games. When issues are identified, the system augments its training data sets with new examples of optimal graphics and challenging scenarios that DLSS needs to address. That way, DLSS learns what games look like and generates realistic frames like a game engine would, without any artifacts.

View at TechPowerUp Main Site | Source

x4it3n · Mar 14, 2025

DLSS has come a long way but when you compare the Reference to the Transformer model, you realize that there's still a lot of work to do!

evernessince · Mar 14, 2025

The poll attached to this article is way too black or white. Do we think AI rendering pipleines are the way forward? No I think AI assisted rendering pipelines are the way forward. If I wanted an AI to hallucinate the entire game I wouldn't need to buy the game or a powerful GPU. I'd just buy an expensive AI accelerator and have it generate the game and graphics.

Predictable · Mar 14, 2025

evernessince said:
The poll attached to this article is way too black or white. Do we think AI rendering pipleines are the way forward? No I think AI assisted rendering pipelines are the way forward. If I wanted an AI to hallucinate the entire game I wouldn't need to buy the game or a powerful GPU. I'd just buy an expensive AI accelerator and have it generate the game and graphics.

Well, I've got bad news for you...

evernessince · Mar 14, 2025

Predictable said:
Well, I've got bad news for you...

GPUs aren't the only AI accelerators (you seem to be implying that they are). You can put together a very fast AI build with AMD CPUs at a fraction of the price.

Predictable · Mar 14, 2025

evernessince said:
GPUs aren't the only AI accelerators (you seem to be implying that they are). You can put together a very fast AI build with AMD CPUs at a fraction of the price.

Nah, I wasn't really implying anything other than it seems that is exactly the direction we are headed with PC graphics. Mostly just making a lame joke

Steevo · Mar 14, 2025

While I appreciate the drive to push pixels, when it comes at the loss of fidelity it seems like a step back, for years perfection of AA and AF was of this highest importance, lighting through programmable shaders, tessellation to drive realistic textures, larger frame buffer to hold higher resolution textures, full 32 bit color pipelines, forms of color compression to decrease bandwidth requirements were at the forefront of innovation. Now we have reached the point where pushing the latest thing is costing us in $$$ to realize performance that is simulated like a fake fruit drink that almost takes like the real thing.

JohH · Mar 14, 2025

Nvidia says raytracing is the future but then ships cards (the entire 5000 series so far) where raster games gain more in frame rate and 1% lows than raytraced games. What's the future?

DLSS3 was 'better than native' and now they're showing how DLSS4-T is closer to reference images in select cases.

It's a funny message about the future.

Denver · Mar 14, 2025

At least they didn't use one of the worst TAA implementations as a reference, so you can see that the "native" is still superior.

DemonicRyzen666 · Mar 14, 2025

Denver said:
At least they didn't use one of the worst TAA implementations as a reference, so you can see that the "native" is still superior.

but but but nvidia's motto is better than native resolutions?

Imho A.I pipeline render is the definition of polishing a turd, which doesn't change the fact that it's still a turd.

Denver · Mar 14, 2025

DemonicRyzen666 said:
but but but nvidia's motto is better native resolutions?

Imho A.I pipeline render is the definition of polishing a turd, which doesn't change the fact that it's still a turd.

You got it.
UE5, TAA, and RT introduced a host of problems—atrocious performance, blurry visuals, ghosting, and more. Nvidia then stepped in, conveniently selling the "solution" to the very issues they had a hand in creating in the first place, wielding either a carrot or a stick depending on how you view their approach.

freeagent · Mar 14, 2025

Hate on Nvidia all you want, AMD can see the light, and is following in their footsteps.

Vya Domus · Mar 14, 2025

freeagent said:
Hate on Nvidia all you want, AMD can see the light, and is following in their footsteps.

Business owners paying protection money to gangs because they can see the light.

x4it3n · Mar 14, 2025

Steevo said:
While I appreciate the drive to push pixels, when it comes at the loss of fidelity it seems like a step back, for years perfection of AA and AF was of this highest importance, lighting through programmable shaders, tessellation to drive realistic textures, larger frame buffer to hold higher resolution textures, full 32 bit color pipelines, forms of color compression to decrease bandwidth requirements were at the forefront of innovation. Now we have reached the point where pushing the latest thing is costing us in $$$ to realize performance that is simulated like a fake fruit drink that almost takes like the real thing.

Agree 100%, it was much better back when they pushed every year to get the best of the best (Performance, Technologies, Efficiency, etc.) !

Unfortunately all those companies only care about maximizing profits now. They're not pushing the limits anymore and are just relying on AI to do all the work (that they don't want to do because they're lazy). Look at the Industry as a whole, there have been huge are layoffs everywhere since 2021 and it's still going. Also the people staying are working for 2-3 jobs at once but barely make more money than before, it's like "modern slavery" and AI is only going to replace more and more people anyway.

Event Horizon · Mar 14, 2025

I expected DLSS 4 to be amazing, and it is. I expected FSR 4 to be worse than DLSS 3.x, but it's actually pretty darn good. Additionally, we can override versions from the driver-side, or use tools like OptiScaler to bridge from one vendor to another. Good times ahead.

x4it3n · Mar 14, 2025

JohH said:
Nvidia says raytracing is the future but then ships cards (the entire 5000 series so far) where raster games gain more in frame rate and 1% lows than raytraced games. What's the future?

DLSS3 was 'better than native' and now they're showing how DLSS4-T is closer to reference images in select cases.

It's a funny message about the future.

RTX 50s are just a Refresh of the RTX 40s with more AI stuff (MFG, Neural Textures, etc.) but the IPC has been almost the same since Ampere.
Lovelace was a lot better than Ampere due to a much better process node and a lot more CUDA Cores (at least for the 4090), much higher Clock speeds and a lot more L2 Cache. But in terms of raw performance clock for clock and core for core, Blackwell/Lovelace/Ampere seem to be pretty much on par.

I was expecting the RTX 50s to have much better RT/PT performance but no, I guess they're just keeping that for the RTX 6090 and even more GPU Generated Frames... :mad:

Event Horizon said:
I expected DLSS 4 to be amazing, and it is. I expected FSR 4 to be worse than DLSS 3.x, but it's actually pretty darn good. Additionally, we can override versions from the driver-side, or use tools like OptiScaler to bridge from one vendor to another. Good times ahead.

As much as I hate to say this, AMD should have gone with some AI upscaling from the beginning, because now only RNDA 4 can use FSR 4 :'(

InVasMani · Mar 14, 2025

It's a noticeable improve from cnn overall, but still worse than native though much less immediately pronounced compared to before. Those are also still screenshot comparisons as well rather than like animated ones zoomed in for comparison which would be a bit better reflective of normal expectations.

ZoneDymo · Mar 14, 2025

man these images really do make this upscaling look like crap....better then native my ass

Lycanwolfen · Mar 14, 2025

Why even call it a Geforce anymore since they are not Graphics Force Cards they are A.I. RTX cards.

ZoneDymo · Mar 14, 2025

freeagent said:
Hate on Nvidia all you want, AMD can see the light, and is following in their footsteps.

You do realize AMD and Intel were busy with developing a bunch of this stuff way before Nvidia ever released any of it right?

Event Horizon said:
I expected DLSS 4 to be amazing, and it is. I expected FSR 4 to be worse than DLSS 3.x, but it's actually pretty darn good. Additionally, we can override versions from the driver-side, or use tools like OptiScaler to bridge from one vendor to another. Good times ahead.

imagine if the cards were actually made properly so they could do RT without upscaling......now that would be something

x4it3n · Mar 14, 2025

ZoneDymo said:
You do realize AMD and Intel were busy with developing a bunch of this stuff way before Nvidia ever released any of it right?

Still, Nvidia did catch up and are even beating them... Intel invested a lot in RT and AI back in the days and now they're literally behind everyone lol.

ZoneDymo said:
imagine if the cards were actually made properly so they could do RT without upscaling......now that would be something

Unfortunately this time seems over :cry:

Running games at Native 4K with PT would look amazing! Imagine 4K Path Tracing with MSAA or even SSAA :love:

eidairaman1 · Mar 15, 2025

'You will own nothing and be happy'

freeagent · Mar 15, 2025

ZoneDymo said:
You do realize AMD and Intel

And where are they now :confused:

londiste · Mar 15, 2025

ZoneDymo said:
man these images really do make this upscaling look like crap....better then native my ass

The real question has always been - how much of a quality hit are you willing to take for a given performance boost.

Dr. Dro · Mar 15, 2025

ZoneDymo said:
imagine if the cards were actually made properly so they could do RT without upscaling......now that would be something

Define properly? It's not like they can conjure unlimited computing power out of thin air. The RTX 5090 is already unreasonably powerful, it's the first graphics card to break the single precision (FP32) 100 TFLOPS mark, rated at around ~105TF at its nominal clock speeds (it runs faster!). A half-rack IBM Blue Gene/Q supercomputer with 512 nodes installed (8192 cores) from the early 2010's still falls utterly short of its performance - and you can run that on your PC, off your common desktop power supply, too.

This stuff is unreal, it's that real time ray traced graphics at super high resolutions and super high frame rates is every bit of the "holy grail" that Jensen Huang calls it and then some. It's insanely, extremely advanced technology that will still take years to achieve.

Processor	AMD Ryzen 7 9800X3D (+PBO 5.4GHz)
Motherboard	MSI MPG X870E Carbon Wifi
Cooling	ARCTIC Liquid Freezer II 280 A-RGB
Memory	2x32GB (64GB) G.Skill Trident Z Royal @ 6200MHz 1:1 (30-38-38-30)
Video Card(s)	MSI GeForce RTX 4090 SUPRIM Liquid X
Storage	Crucial T705 4TB (PCIe 5.0) w/ Heatsink + Samsung 990 PRO 2TB (PCIe 4.0) w/ Heatsink
Display(s)	AORUS FO32U2P 4K QD-OLED 240Hz (DP 2.1 UHBR20 80Gbps)
Case	CoolerMaster H500M (Mesh)
Audio Device(s)	AKG N90Q w/ AudioQuest DragonFly Red (USB DAC)
Power Supply	Seasonic Prime TX-1600 Noctua Edition (1600W 80Plus Titanium) ATX 3.1 & PCIe 5.1
Mouse	Logitech G PRO X SUPERLIGHT
Keyboard	Razer BlackWidow V3 Pro
Software	Windows 10 64-bit

Processor	Ryzen 7800X3D
Motherboard	ASRock X670E Taichi
Cooling	Noctua NH-D15 Chromax
Memory	32GB DDR5 6000 CL30
Video Card(s)	MSI RTX 4090 Trio
Storage	P5800X 1.6TB 4x 15.36TB Micron 9300 Pro 4x WD Black 8TB M.2
Display(s)	Acer Predator XB3 27" 240 Hz
Case	Thermaltake Core X9
Audio Device(s)	JDS Element IV, DCA Aeon II
Power Supply	Seasonic Prime Titanium 850w
Mouse	PMM P-305
Keyboard	Wooting HE60
VR HMD	Valve Index
Software	Win 10

Processor	i7-13700K
Motherboard	ASUS ROG Maximus Z690 Hero
Cooling	NZXT Kraken Z73, (6) Lian Li Uni Fan SL Infinity 120mm
Memory	G.Skill Trident Z5 Neo 2x16GB 6400mhz (32-39-39-102)
Video Card(s)	RTX 5080 Founders Edition
Storage	(1) SK hynix P41 Platinum 1TB, (1) WD Blue SN550 500GB, (1) Patriot Scorch 256GB
Display(s)	(1) 32" Samsung Odyssey G7, (1) Dell S3222DGM, (2) Dell S2721DGF
Case	Corsair iCUE 4000x
Audio Device(s)	Corsair Virtuoso RGB
Power Supply	Corsair RM1000x
Mouse	Razer DeathAdder V3 Pro
Keyboard	Corsair K70
VR HMD	Quest 2
Software	Windows 11 Pro

Processor	Ryzen 7800X3D
Motherboard	ASRock X670E Taichi
Cooling	Noctua NH-D15 Chromax
Memory	32GB DDR5 6000 CL30
Video Card(s)	MSI RTX 4090 Trio
Storage	P5800X 1.6TB 4x 15.36TB Micron 9300 Pro 4x WD Black 8TB M.2
Display(s)	Acer Predator XB3 27" 240 Hz
Case	Thermaltake Core X9
Audio Device(s)	JDS Element IV, DCA Aeon II
Power Supply	Seasonic Prime Titanium 850w
Mouse	PMM P-305
Keyboard	Wooting HE60
VR HMD	Valve Index
Software	Win 10

Processor	i7-13700K
Motherboard	ASUS ROG Maximus Z690 Hero
Cooling	NZXT Kraken Z73, (6) Lian Li Uni Fan SL Infinity 120mm
Memory	G.Skill Trident Z5 Neo 2x16GB 6400mhz (32-39-39-102)
Video Card(s)	RTX 5080 Founders Edition
Storage	(1) SK hynix P41 Platinum 1TB, (1) WD Blue SN550 500GB, (1) Patriot Scorch 256GB
Display(s)	(1) 32" Samsung Odyssey G7, (1) Dell S3222DGM, (2) Dell S2721DGF
Case	Corsair iCUE 4000x
Audio Device(s)	Corsair Virtuoso RGB
Power Supply	Corsair RM1000x
Mouse	Razer DeathAdder V3 Pro
Keyboard	Corsair K70
VR HMD	Quest 2
Software	Windows 11 Pro

NVIDIA Details DLSS 4 Design: A Complete AI-Driven Rendering Technology

AleksandarK

News Editor

x4it3n

evernessince

Predictable

evernessince

Predictable

Steevo

JohH

Denver

DemonicRyzen666

Denver

freeagent

Moderator

Vya Domus

x4it3n

Event Horizon

x4it3n

InVasMani

ZoneDymo

Lycanwolfen

ZoneDymo

x4it3n

eidairaman1

The Exiled Airman

freeagent

Moderator

londiste

Dr. Dro

System Name	Compy 386
Processor	7800X3D
Motherboard	Asus
Cooling	Air for now.....
Memory	64 GB DDR5 6400Mhz
Video Card(s)	7900XTX 310 Merc
Storage	Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s)	55" Samsung 4K HDR
Audio Device(s)	ATI HDMI
Mouse	Logitech MX518
Keyboard	Razer
Software	A lot.
Benchmark Scores	Its fast. Enough.

System Name	S.L.I + RTX research rig
Processor	Ryzen 7 5800X 3D.
Motherboard	MSI MEG ACE X570
Cooling	Corsair H150i Cappellx
Memory	Corsair Vengeance pro RGB 3200mhz 32Gbs
Video Card(s)	2x Dell RTX 2080 Ti in S.L.I
Storage	Western digital Sata 6.0 SDD 500gb + fanxiang S660 4TB PCIe 4.0 NVMe M.2
Display(s)	HP X24i
Case	Corsair 7000D Airflow
Power Supply	EVGA G+1600watts
Mouse	Corsair Scimitar
Keyboard	Cosair K55 Pro RGB

System Name	Step_sis Rodeo
Processor	AMD R9 9900X @ booost
Motherboard	Asus Strix X670E-F
Cooling	Thermalright Frost Commander 140, TY-143. T30
Memory	2x 16GB Lexar Ares @ 6200 28-36-36-68 1.45v
Video Card(s)	Zotac 4070 Ti Trinity OC @ 3045/1500
Storage	WD SN850 1TB, SN850X 2TB, 2x SN770 1TB
Display(s)	LG 50UP7100
Case	Asus ProArt PA602
Audio Device(s)	JBL Bar 700
Power Supply	Seasonic Vertex GX-1000, Monster HDP1800
Mouse	Logitech G502 Hero
Keyboard	Logitech G213
VR HMD	Oculus 3
Software	Yes
Benchmark Scores	Yes

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	Cyberline
Processor	Intel Core i7 2600k -> 12600k
Motherboard	Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling	Tuniq Tower 120 -> Custom Watercoolingloop
Memory	Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s)	AMD RX480 -> RX7800XT
Storage	Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s)	Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case	antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s)	Focusrite 2i4 (USB)
Power Supply	Seasonic 620watt 80+ Platinum
Mouse	Elecom EX-G
Keyboard	Rapoo V700
Software	Windows 10 Pro 64bit

System Name	PCGOD
Processor	AMD FX 8350@ 5.0GHz
Motherboard	Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling	Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory	16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s)	AMD Radeon 290 Sapphire Vapor-X
Storage	Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s)	NEC Multisync LCD 1700V (Display Port Adapter)
Case	AeroCool Xpredator Evil Blue Edition
Audio Device(s)	Creative Labs Sound Blaster ZxR
Power Supply	Seasonic 1250 XM2 Series (XP3)
Mouse	Roccat Kone XTD
Keyboard	Roccat Ryos MK Pro
Software	Windows 7 Pro 64

Processor	13th Gen Intel Core i9-13900KS
Motherboard	ASUS ROG Maximus Z790 Apex Encore
Cooling	Pichau Lunara ARGB 360 + Honeywell PTM7950
Memory	32 GB G.Skill Trident Z5 RGB @ 7600 MT/s
Video Card(s)	Palit GameRock GeForce RTX 5090 32 GB
Storage	500 GB WD Black SN750 + 4x 300 GB WD VelociRaptor WD3000HLFS HDDs
Display(s)	55-inch LG G3 OLED
Case	Cooler Master MasterFrame 700 benchtable
Power Supply	EVGA 1300 G2 1.3kW 80+ Gold
Mouse	Microsoft Classic IntelliMouse
Keyboard	IBM Model M type 1391405
Software	Windows 10 Pro 22H2