NVIDIA Reveals Secret Weapon Behind DLSS Evolution: Dedicated Supercomputer Running for Six Years

Prima.Vera · Jan 20, 2025

I don't know man. The DLAA still looks like a shitty anti-aliasing solution. Still a complete garbage blur mess, just a little better than the worst AA ever invented, TAA.
You want a good AA technique? Just have a look on how older games were looking with 4xSSAA or even 8xSSAA.
Those were the best times.

Dr. Dro · Jan 20, 2025

Prima.Vera said:
I don't know man. The DLAA still looks like a shitty anti-aliasing solution. Still a complete garbage blur mess, just a little better than the worst AA ever invented, TAA.
You want a good AA technique? Just have a look on how older games were looking with 4xSSAA or even 8xSSAA.
Those were the best times.

How? It's DLSS applied to the native resolution, ergo only scene reconstruction algorithm is running. It can't be blurry unless the frame rate is faltering, the perception of blurriness comes from poor resolution or frame rate. DLAA tends to enhance scene detail in some engines that only enable certain aspects of the image when the resolution is high enough.

SSAA was simply increasing the internal rendering resolution (NV DSR and AMD VSR were a way to universally apply this without support in the game), MSAA has always been the clearest in my opinion but it's very resource-inefficient and has many compatibility issues with modern rendering techniques (iirc it doesn't play well with deferred rendering).

tpuuser256 · Jan 20, 2025

The whole point of AI is to surpass the limited human intelligence as well as workforce automation. If all goes well AI will be better than humans at pretty much anything including creating stories or generating worlds (the model would use algorithm for more realisitc and efficient compute)

With nuclear fusion, energy could become much cheaper in the near future and allow exponential growth of civilization (technology and number of people)

igormp · Jan 20, 2025

Dr. Dro said:
PSSR, as everything Sony, is fully proprietary, poorly documented to the public and apparently has been relatively poorly received so far. I don't believe it has any particular need for ML hardware since the PS5 Pro's graphics are still based on RDNA 2, which does not have this capability. Unless there is a semicustom solution, but I don't believe this to be the case.

It's a ML-based upscaling as well, but it doesn't make use of any extra specific hardware. RDNA3.5 (which the PS5 Pro kinda uses) has some extra instructions meant to process some stuff relevant for matmul in lower precision, you can read more about it here:

AMD RDNA 3.5’s LLVM Changes

Integrated graphics have been a key part of AMD’s strategy ever since they bought ATI.

chipsandcheese.com

With the extra hardware bump, it should be able to run an upscaling CNN without much issues and no need for extra hardware (apart from what's in the GPU itself).

Rightness_1 said:
Got you... But what I'm wondering is how can AMD/Sony (allegedly) do this in FSR4 without some "supercomputer" doing the work for them to upscale the image with minimal artifacts?

Sony's PSSR did do something similar to what Nvidia has done with DLSS, by training a model with tons of compute during a long period of time, which can then be used by the actual consoles, they just did not announce it like Nvidia did now. And if Nvidia never gave up this detail away, you wouldn't be making this complaint.
FSR4, if truly based on ML, will also require lots of compute time beforehand in order to create a model that can perform this task in your local GPU.

Let me try to give you a better example: do you know that feature in phone's gallery that are able to recognize people or places?
That's a machine learning model that's running in your phone and tagging those images behind the scenes.

That "model" (think of a "model" as a binary or dll that contains the "runtime" of the AI stuff) has been trained by google/samsung/apple in their servers for long hours with tons of examples saying "this picture is a dog", "this is a car", "this is a beach", "this person X is different from person Y", etc etc. This part is the "training" part, which is really compute intensive and takes really long time. As an example, the GPT model behind ChatGPT took around 5~6 months to train.
The outcome of this model is then shipped into your phone, where it's able to use what it has learnt and apply it to your cat pictures, and say that it is a cat. This part is called the "inference" part, and is often really fast. Think how DLSS, even in its first version, was able to upscale a frame from a smaller res into a higher one with really fast FPS (so for each frame, it upscaled it in less than 10ms!). In a similar manner, think how your phone is able to tag a pic as a "dog" really quick, or how ChatGPT is able to give you answers reasonably fast, even though the training part for all of those tasks took weeks, months, or even years.

LittleBro · Jan 20, 2025

Dr. Dro said:
How? It's DLSS applied to the native resolution, ergo only scene reconstruction algorithm is running. It can't be blurry unless the frame rate is faltering, the perception of blurriness comes from poor resolution or frame rate. DLAA tends to enhance scene detail in some engines that only enable certain aspects of the image when the resolution is high enough.

SSAA was simply increasing the internal rendering resolution (NV DSR and AMD VSR were a way to universally apply this without support in the game), MSAA has always been the clearest in my opinion but it's very resource-inefficient and has many compatibility issues with modern rendering techniques (iirc it doesn't play well with deferred rendering).

DLSS itself causes distortions (blurs image). You won't notice it so much because there's a series of various filters and sharpening tools after upscaling. You actually answered your question.

tpuuser256 said:
The whole point of AI is to surpass the limited human intelligence as well as workforce automation. If all goes well AI will be better than humans at pretty much anything including creating stories or generating worlds (the model would use algorithm for more realisitc and efficient compute)

With nuclear fusion, energy could become much cheaper in the near future and allow exponential growth of civilization (technology and number of people)

It's fair to mention that this is not a real AI. This term is being misused too much nowadays for naming any neural network (model) involved system which gets accelerated by GPUs. Neural networks, machine learning, deep learning is part of AI, but not AI itself.

igormp · Jan 20, 2025

LittleBro said:
DLSS itself causes distortions (blurs image). You won't notice it so much because there's a series of various filters and sharpening tools after upscaling. You actually answered your question.

It's fair to mention that this is not a real AI. This term is being misused too much nowadays for naming any neural network (model) involved system which gets accelerated by GPUs. Neural networks, machine learning, deep learning is part of AI, but not AI itself.

I mean, academically AI is a more generic term that could range its meaning from a basic rule-based system (bunch of if/else) up to ML/deep learning.
Aren't you thinking about AGI?

Zach_01 · Jan 20, 2025

igormp said:
I mean, academically AI is a more generic term that could range its meaning from a basic rule-based system (bunch of if/else) up to ML/deep learning.
Aren't you thinking about AGI?

I remember since for ever the in game NPC opponents behavior to be called the "AI of the game" which exactly falls under the example of if/else I think. Couldn't learn anything... lol
I think of AI as a general term. There are terms after AI that can distinguish it into multiple levels, from sub-human, on par, up to beyond.
All some form of AI though.

To me the idea of intelligence is not equal to something necessarily high.
Is not correct to say... this has low/very low intelligence? I think it is.
And when it is made it is artificial.

Kapone33 · Jan 20, 2025

I am confident that if the title said AMD this thread would be 20 pages. Here we go with the FSR comments from people that have never used it in a Nvidia thread.

JustBenching · Jan 20, 2025

kapone32 said:
I am confident that if the title said AMD this thread would be 20 pages. Here we go with the FSR comments from people that have never used it in a Nvidia thread.

It would just be comical, imagine "AMD reveals secret weapon behind FSR". I mean why would it even need to be secret, nobody would try copying it, it's bad.

LittleBro · Jan 20, 2025

kapone32 said:
I am confident that if the title said AMD this thread would be 20 pages. Here we go with the FSR comments from people that have never used it in a Nvidia thread.

Funny thing is that they don't even know that Nvidia cards can do FSR as well as XeSS.

JustBenching said:
It would just be comical, imagine "AMD reveals secret weapon behind FSR". I mean why would it even need to be secret, nobody would try copying it, it's bad.

If AMD is not keeping it's own secret in form of hidden supercomputer somewhere to continuously smooth out FSR model as Nvidia does with DLSS, then if you compare effort invested in the technology by AMD and Nvidia, AMD's results are not so bad.

JustBenching · Jan 20, 2025

LittleBro said:
If AMD is not keeping it's own secret in form of hidden supercomputer somewhere to continuously smooth out FSR model as Nvidia does with DLSS, then if you compare effort invested in the technology by AMD and Nvidia, AMD's results are not so bad.

Sure, does it matter? Im not paying for the effort, im paying for the result.

LittleBro · Jan 20, 2025

JustBenching said:
Sure, does it matter? Im not paying for the effort, im paying for the result.

Whether it matters, that's personal. It does certainly shed other light.

Dr. Dro · Jan 20, 2025

kapone32 said:
I am confident that if the title said AMD this thread would be 20 pages. Here we go with the FSR comments from people that have never used it in a Nvidia thread.

You do realize Nvidia cards can run all vendors' upscaling solutions, right? As for the "secret weapon behind FSR" argument... it's been there since day one? It is open source, after all...

igormp said:
It's a ML-based upscaling as well, but it doesn't make use of any extra specific hardware. RDNA3.5 (which the PS5 Pro kinda uses) has some extra instructions meant to process some stuff relevant for matmul in lower precision, you can read more about it here:

AMD RDNA 3.5’s LLVM Changes

Integrated graphics have been a key part of AMD’s strategy ever since they bought ATI.

chipsandcheese.com

With the extra hardware bump, it should be able to run an upscaling CNN without much issues and no need for extra hardware (apart from what's in the GPU itself).

I was aware that this was a RDNA 2 solution with some improvements ported from RDNA 3-3.5, I suppose this is one of them. Weird custom chip.

Processor	Intel® Core™ i7-13700K
Motherboard	Gigabyte Z790 Aorus Elite AX
Cooling	Noctua NH-D15
Memory	32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s)	KUROUTOSHIKOU RTX 5080 GALAKURO
Storage	2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s)	Acer Predator X34 3440x1440@100Hz G-Sync
Case	NZXT PHANTOM410-BK
Audio Device(s)	Creative X-Fi Titanium PCIe
Power Supply	Corsair 850W
Mouse	Logitech Hero G502 SE
Software	Windows 11 Pro - 64bit
Benchmark Scores	30FPS in NFS:Rivals

Processor	13th Gen Intel Core i9-13900KS
Motherboard	ASUS ROG Maximus Z790 Apex Encore
Cooling	Pichau Lunara ARGB 360 + Honeywell PTM7950
Memory	32 GB G.Skill Trident Z5 RGB @ 7600 MT/s
Video Card(s)	Palit GameRock OC GeForce RTX 5090 32 GB
Storage	500 GB WD Black SN750 + 4x 300 GB WD VelociRaptor WD3000HLFS HDDs
Display(s)	55-inch LG G3 OLED
Case	Cooler Master MasterFrame 700 benchtable
Audio Device(s)	EVGA NU Audio + Sony MDR-V7 headphones
Power Supply	EVGA 1300 G2 1.3kW 80+ Gold
Mouse	Microsoft Classic IntelliMouse
Keyboard	IBM Model M type 1391405
Software	Windows 10 Pro 22H2
Benchmark Scores	I pulled a Qiqi~

Processor	9950x \| 5950x
Motherboard	x670e ProArt\| B550 ProArt
Cooling	PA 120 SE \|Fuma 2
Memory	4x64GB Kingston CUDIMM @5200MHz \| 4x32GB 3200MHz Corsair LPX
Video Card(s)	2x RTX 3090
Display(s)	LG 42" C2 4k OLED
Power Supply	Corsair RM1000e \| XPG Core Reactor 850W
Software	I use Arch btw

System Name	AM5_TimeKiller
Processor	AMD Ryzen 7 9800X3D
Motherboard	ASUS ROG Strix B650E-F Gaming
Cooling	Arctic Freezer II 420 rev.7 (with 6 fans in push-pull setup)
Memory	G.Skill Trident Z5 Neo RGB, 2x16 GB DDR5, Hynix A-Die, 6400 MHz @ CL30-39-39-102-141 1T @ 1.40 V
Video Card(s)	ASUS TUF Radeon RX 9070 XT GAMING
Storage	Samsung 990 PRO 1 TB, Kingston KC3000 1 TB, Kingston KC3000 2 TB
Case	Corsair 7000D Airflow
Audio Device(s)	Creative Sound Blaster X-Fi Titanium
Power Supply	Seasonic Prime TX-850
Mouse	Logitech wireless mouse for 15€, 6y old
Keyboard	Logitech wireless keyboard, 12y old

Processor	9950x \| 5950x
Motherboard	x670e ProArt\| B550 ProArt
Cooling	PA 120 SE \|Fuma 2
Memory	4x64GB Kingston CUDIMM @5200MHz \| 4x32GB 3200MHz Corsair LPX
Video Card(s)	2x RTX 3090
Display(s)	LG 42" C2 4k OLED
Power Supply	Corsair RM1000e \| XPG Core Reactor 850W
Software	I use Arch btw

System Name	PC on since March 2025, upgraded from 5900X
Processor	Ryzen 7 9700X (March 2025), 140W PPT limit, 85C temp limit, CO -25, +100MHz (up to 5.65GHz)
Motherboard	Asrock X870E NOVA, BIOS v3.2, AGESA PI 1.2.0.3a Patch A
Cooling	Arctic Liquid Freezer II 420mm Rev7 (Jan 2024) with off-center mount for Ryzen, TIM: Kryosheet
Memory	2x32GB G.Skill Trident Z5 RGB (March2025) 6000MT/s 1.40V CL30-36-36-36-68-104 1T, tRFC:500, Hynix-A
Video Card(s)	Sapphire Nitro+ RX 7900XTX (Dec 2023) 314~467W (370W current) PowerLimit, 1070mV, Adrenalin v25.5.1
Storage	NVMe: 990Pro 2TB(OS 25), 980Pro 1TB(22), 970Pro 512(19) / S-III: 850Pro 1TB(15) 860Evo 1TB(20)
Display(s)	Dell Alienware AW3423DW 34" QD-OLED curved (1800R), 3440x1440 144Hz (max 175Hz) HDR400/1000, VRR on
Case	Thermaltake Core P8 TG Gaming Full Tower, Fans: 9x140mm + 3x120mm
Audio Device(s)	Astro A50 headset
Power Supply	Corsair HX750i, ATX v2.4, 80+ Platinum, 93% (250~700W), modular, single/dual rail (switch)
Mouse	Logitech MX Master (Gen1)
Keyboard	Logitech G15 (Gen2) w/ LCDSirReal applet
Software	Windows 11 Home 64bit (v24H2, OSBuild 26100.4061), 1st install March 2025

System Name	Best AMD Computer
Processor	AMD 7900X3D
Motherboard	Asus X670E E Strix
Cooling	In Win SR36
Memory	GSKILL DDR5 32GB 5200 30
Video Card(s)	Sapphire Pulse 7900XT (Watercooled)
Storage	Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s)	GIGABYTE FV43U
Case	Corsair 7000D Airflow
Audio Device(s)	Corsair Void Pro, Logitch Z523 5.1
Power Supply	Deepcool 1000M
Mouse	Logitech g7 gaming mouse
Keyboard	Logitech G510
Software	Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores	Firestrike: 46183 Time Spy: 25121

System Name	Mean machine
Processor	AMD 6900HS
Memory	2x16 GB 4800C40
Video Card(s)	AMD Radeon 6700S