• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

DirectStorage API Works Even with PCIe Gen3 NVMe SSDs

Joined
Oct 1, 2006
Messages
4,930 (0.75/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
Ever wondered why the BAR size suddenly got brought up even though its been in the PCI-e spec for years?

I can honestly see the theory of this working on DX11 cards but as with anything in the tech world. Unless its pushing something new or shiny you will pretty much never see it back ported en mass.
It's possible devs could implement this seperate to DX12 - but its for new games only
The key is that it requires a DX12 GPU to have support for the feature, not that it requires the game to be running in DX12 mode
(At this stage no one knows if the feature will get rolled back to older GPU's, we've only got a few vague clues)
But then is it that much to ask for a DX12 compatible GPU? Everything since Fermi and GCN are "compatible" with DX12.
Older GPUs don't even get driver updates anymore, so even if M$ makes them work somehow they won't get the driver needed.
 

Mussels

Freshwater Moderator
Joined
Oct 6, 2004
Messages
58,413 (7.96/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
But then is it that much to ask for a DX12 compatible GPU? Everything since Fermi and GCN are "compatible" with DX12.
Older GPUs don't even get driver updates anymore, so even if M$ makes them work somehow they won't get the driver needed.
difference is compatible vs compliant

compatible means some features are emulated, or missing for optional ones - and direct storage may use those optional ones
My guess is games designed for this tech will have faster than average load times anyway, since they're being optimised for SSD's and not 4,200RPM laptop drives in consoles
 
Joined
Oct 1, 2006
Messages
4,930 (0.75/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
difference is compatible vs compliant

compatible means some features are emulated, or missing for optional ones - and direct storage may use those optional ones
My guess is games designed for this tech will have faster than average load times anyway, since they're being optimised for SSD's and not 4,200RPM laptop drives in consoles
The article just says it requires a compatible GPU, nothing about the feature level required or what not.
Hopefully 'it just works'.
 

Mussels

Freshwater Moderator
Joined
Oct 6, 2004
Messages
58,413 (7.96/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
The article just says it requires a compatible GPU, nothing about the feature level required or what not.
Hopefully 'it just works'.
Yes but what is required to be compatible?

I would not be shocked to find out its a DX12 ultimate feature
(I'd be happy if it wasnt)
 
Joined
Oct 10, 2009
Messages
792 (0.14/day)
Location
Madrid, Spain
System Name Rectangulote
Processor Core I9-9900KF
Motherboard Asus TUF Z390M
Cooling Alphacool Eisbaer Aurora 280 + Eisblock RTX 3090 RE + 2 x 240 ST30
Memory 32 GB DDR4 3600mhz CL16 Crucial Ballistix
Video Card(s) KFA2 RTX 3090 SG
Storage WD Blue 3D 2TB + 2 x WD Black SN750 1TB
Display(s) 2 x Asus ROG Swift PG278QR / Samsung Q60R
Case Corsair 5000D Airflow
Audio Device(s) Evga Nu Audio + Sennheiser HD599SE + Trust GTX 258
Power Supply Corsair RMX850
Mouse Razer Naga Wireless Pro / Logitech MX Master
Keyboard Keychron K4 / Dierya DK61 Pro
Software Windows 11 Pro
Yes but what is required to be compatible?

I would not be shocked to find out its a DX12 ultimate feature
(I'd be happy if it wasnt)
New RTX 4090TI with double nvme slot on the back of the card.
 

Mussels

Freshwater Moderator
Joined
Oct 6, 2004
Messages
58,413 (7.96/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
New RTX 4090TI with double nvme slot on the back of the card.
Wait... NVME is almost fast enough to work as a VRAM supplement. brb making a patent.
 
Joined
Oct 10, 2009
Messages
792 (0.14/day)
Location
Madrid, Spain
System Name Rectangulote
Processor Core I9-9900KF
Motherboard Asus TUF Z390M
Cooling Alphacool Eisbaer Aurora 280 + Eisblock RTX 3090 RE + 2 x 240 ST30
Memory 32 GB DDR4 3600mhz CL16 Crucial Ballistix
Video Card(s) KFA2 RTX 3090 SG
Storage WD Blue 3D 2TB + 2 x WD Black SN750 1TB
Display(s) 2 x Asus ROG Swift PG278QR / Samsung Q60R
Case Corsair 5000D Airflow
Audio Device(s) Evga Nu Audio + Sennheiser HD599SE + Trust GTX 258
Power Supply Corsair RMX850
Mouse Razer Naga Wireless Pro / Logitech MX Master
Keyboard Keychron K4 / Dierya DK61 Pro
Software Windows 11 Pro
Wait... NVME is almost fast enough to work as a VRAM supplement. brb making a patent.
I said it as a joke but if you think about what directstorage is it actually makes sense as the next step in the process. There were Matrox vgas in the past that could be expanded with sodimms.
 

Mussels

Freshwater Moderator
Joined
Oct 6, 2004
Messages
58,413 (7.96/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
I said it as a joke but if you think about what directstorage is it actually makes sense as the next step in the process. There were Matrox vgas in the past that could be expanded with sodimms.
I could see something like optane working as as slot in GPU booster
Maybe APU's would benefit the most from something like that, as they're dealing with slower system RAM anyway

What we really need is a giant GPU in the ATX standard with a PCI-E card to slot in the CPU
 
Joined
Sep 28, 2012
Messages
979 (0.22/day)
System Name Poor Man's PC
Processor AMD Ryzen 7 7800X3D
Motherboard MSI B650M Mortar WiFi
Cooling Thermalright Phantom Spirit 120 with Arctic P12 Max fan
Memory 32GB GSkill Flare X5 DDR5 6000Mhz
Video Card(s) XFX Merc 310 Radeon RX 7900 XT
Storage XPG Gammix S70 Blade 2TB + 8 TB WD Ultrastar DC HC320
Display(s) Xiaomi G Pro 27i MiniLED
Case Asus A21 Case
Audio Device(s) MPow Air Wireless + Mi Soundbar
Power Supply Enermax Revolution DF 650W Gold
Mouse Logitech MX Anywhere 3
Keyboard Logitech Pro X + Kailh box heavy pale blue switch + Durock stabilizers
VR HMD Meta Quest 2
Benchmark Scores Who need bench when everything already fast?
Ever wondered why the BAR size suddenly got brought up even though its been in the PCI-e spec for years?

I can honestly see the theory of this working on DX11 cards but as with anything in the tech world. Unless its pushing something new or shiny you will pretty much never see it back ported en mass.

Simple, you need hardware to support it, "cache" in this case. Both parties support Resizable BAR, but one particular already came in box and the other just following hype train. As discussed earlier in this thread

GPU Memory Latency Tested on AMD's RDNA 2 and NVIDIA's Ampere Architecture

and backed up with this video


In theory it is possible to stretched into DirectX 11 title by simply removing performance target or some kind of limitation (in this case the transfer rate), but I don't think impact will be significant as the Variable Rate Shading and Resources Binding features that are embedded in DirectX 12.
 
Last edited:
Joined
Oct 1, 2006
Messages
4,930 (0.75/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
Wait... NVME is almost fast enough to work as a VRAM supplement. brb making a patent.
Radeon Pro SSG was a thing :D
I guess the software overhead etc was the reason it never took off.
1619098037180.png

1619097939113.png
 
Joined
Jun 10, 2014
Messages
2,978 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
So it should let textures stream across fast, get decompressed and smashed open fast, and generally make load times and texture pop in go away
Faster load times is certainly possible, especially with accelerated decompression.
But texture popping is caused by a missing resource, and the only real way to avoid that is prefetching. So this technology by itself will not solve that problem, but you can certainly build a game engine which prefetches textures combined with this technology.

Well, the only NVMe that I have is my 500 GB boot drive. Not putting any games there. Gonna wait and see if it's worth to buy an extra 2 TB NVMe just for this.
I would highly recommend having a separate boot drive. The OS will cause a lot of wear on TLC/QLC SSDs, so you better have your files somewhere else.
If your motherboard has a free PCE 4x slot, you can buy a M.2 adapter for it.
 
D

Deleted member 205776

Guest
I would highly recommend having a separate boot drive. The OS will cause a lot of wear on TLC/QLC SSDs, so you better have your files somewhere else.
If your motherboard has a free PCE 4x slot, you can buy a M.2 adapter for it.
Not gonna reinstall/clone my entire OS for insignificant writes on a 300TBW drive. Had it for almost a year and it's only at 6 TB. If it dies I'll go back to my previous boot drive.

All the files that I care about amount to only ~30GB and are backed up monthly on my Dropbox. Everything else can be downloaded & reinstalled easily, and I can also reconfigure my OS easily because I keep my .reg files, redists, and other necessary tweaks also on my Dropbox.

The NVMe is solely for OS files and programs I can redownload.

Also nobody gonna answer if the NVMe needs to be attached to CPU for DirectStorage to work?
 
D

Deleted member 193792

Guest
Also nobody gonna answer if the NVMe needs to be attached to CPU for DirectStorage to work?
It does need the NVMe to be directly connected to the SoC/uncore (northbridge). The GPU also connects to the PCIe root complex (SoC). It's the only way to have CPU <-> GPU direct communication for instant data transfers.

That's how next-gen consoles are. The SSD connects directly to the APU. Last-gen consoles had the HDD connected through the southbridge (which isn't a big deal for HDDs or SATA SSDs).

So, if you have an old Intel PC (Rocket Lake introduced 4 dedicated lanes for NVMe, just like AMD Ryzen/AM4 since 2017) where the NVMe is attached to the PCH (southbridge), you're screwed. There's no benefit, so expect to upgrade your platform.

PCH connection is usually 4 lanes (4 GB/s) and any NVMe worth its salt is going to saturate that bus (let alone the fact you may also have SATA HDDs, Gigabit Ethernet, TV tuner card etc.)
 
D

Deleted member 205776

Guest
It does need the NVMe to be directly connected to the SoC/uncore (northbridge). The GPU also connects to the PCIe root complex (SoC). It's the only way to have CPU <-> GPU direct communication for instant data transfers.

That's how next-gen consoles are. The SSD connects directly to the APU. Last-gen consoles had the HDD connected through the southbridge (which isn't a big deal for HDDs or SATA SSDs).

So, if you have an old Intel PC (Rocket Lake introduced 4 dedicated lanes for NVMe, just like AMD Ryzen/AM4 since 2017) where the NVMe is attached to the PCH (southbridge), you're screwed. There's no benefit, so expect to upgrade your platform.

PCH connection is usually 4 lanes (4 GB/s) and any NVMe worth its salt is going to saturate that bus (let alone the fact you may also have SATA HDDs, Gigabit Ethernet, TV tuner card etc.)
Then shit, guess I'm not using this. Whatever
 
Joined
Dec 26, 2006
Messages
3,807 (0.58/day)
Location
Northern Ontario Canada
Processor Ryzen 5700x
Motherboard Gigabyte X570S Aero G R1.1 BiosF5g
Cooling Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s) AMD RX 6800 - Asus Tuf
Storage Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s) LG 27UL550-W (27" 4k)
Case Be Quiet Pure Base 600 (no window)
Audio Device(s) Realtek ALC1220-VB
Power Supply SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse Mionix Naos Pro
Keyboard Corsair Strafe with browns
Software W10 22H2 Pro x64
Smartphones are not comparable. They're controlled with what is the equivalent of a giant mouse pointer. They need massive pixel density so that a large amount of content can fit in a tiny 6 inch screen at best, if not smaller, and massive scaling for the user to control the UI and other stuff relatively easily, like tapping on links or selecting content. Take out the pixel density and everything will be big. Take out the scaling and you can try to fine control things when your finger is area bombardment for your touch screen (and provided I have somewhat average fingers for the sample there, I know people with way thicker fingers than mine, and the scaling on my phone can't go to 100%, 125% is the minimum, which is what I used here).

For the reference: 1080p screen at 21.5 inches (so, around 102 PPI, a bit above the 92 PPI of a 1080p 24 inch screen), my phone's 1280x720p 5.7 inch screen scaled to match in real-world size against my display at the upper right and Firefox Responsive Design mode on the lower right to show what it would be like if phones didn't have high pixel density displays. The giant black/white circles are the size of my finger tip on the screen.
View attachment 197681

Your issue is that you need or want to be able to see a massive amount of content (otherwise you wouldn't have a 4K screen), have a small desk and want to use normal size scales. You can't have all three. Something's gotta give. You gotta step down your resolution, or get a bigger desk or get used to scaling.
No. My issue is I’m far sighted. So at 18” away from a 24” 1080p screen it looks like a screen door and I can see individual pixels and worse if I sit any closer. I want a screen that appears as sharp as my smartphone. The anti glare coatings don’t help but I don’t think I would want a full gloss monitor either. I know not an apples to apples comparison nor does it really require ‘retina’ resolution. I had a 23.8” 2560x1440 before the 27” 4K and it was decent also and I may go back to that someday also who knows. I think I had 125% scaling on the 2560x1440 and 150% on the 4K. I guess I misspoke a bit. 27” no issue for desk on its own but dual would be a bit overwhelming. Dual 23-24” screens would fit better. That’s the goal one day. Dual monitors.
 
D

Deleted member 193792

Guest
Then shit, guess I'm not using this. Whatever
You'll be forced to use DirectStorage as soon as next-gen games arrive (i.e. something like PS5 Ratchet with its portals).
 
Joined
Oct 1, 2006
Messages
4,930 (0.75/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
Then shit, guess I'm not using this. Whatever
The point of this new API is to let your 980 Pro make an actual difference compare to SATA drives.
The current software stack is from a time where HDD was common, so the overhead didn't matter.
At most you just need install whatever game that supports DirectStorage on your 980 Pro.
 
D

Deleted member 205776

Guest
At most you just need install whatever game that supports DirectStorage on your 980 Pro.
games on a 500gb drive that's already ~40% full? gl

You'll be forced to use DirectStorage as soon as next-gen games arrive (i.e. something like PS5 Ratchet with its portals).
or, instead of pulling my hair out trying to swap nvme drives with everything in the way (including a gpu that is stuck to my board), ill continue using classic SATA SSDs like a caveman. i cant even put my 980 pro on the southbridge.
 
Last edited by a moderator:
Joined
Feb 1, 2019
Messages
3,523 (1.67/day)
Location
UK, Midlands
System Name Main PC
Processor 13700k
Motherboard Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 4080 RTX SUPER FE 16G
Storage 1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Soundblaster AE-9
Power Supply Antec HCG 750 Gold
Software Windows 10 21H2 LTSC
Not sure why people are surprised it will work on pcie3, also I see no reason why it wouldnt work on chipset NVME ports.
 
D

Deleted member 193792

Guest
Not sure why people are surprised it will work on pcie3, also I see no reason why it wouldnt work on chipset NVME ports.
Because the chipset (PCH/southbridge): 1) will be saturated with a decent PCIe 3.0 NVMe, 2) there is no direct connection to the GPU.
 
Joined
Feb 1, 2019
Messages
3,523 (1.67/day)
Location
UK, Midlands
System Name Main PC
Processor 13700k
Motherboard Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 4080 RTX SUPER FE 16G
Storage 1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Soundblaster AE-9
Power Supply Antec HCG 750 Gold
Software Windows 10 21H2 LTSC
Because the chipset (PCH/southbridge): 1) will be saturated with a decent PCIe 3.0 NVMe, 2) there is no direct connection to the GPU.

There isnt a direct connection to gpu on cpu based nvme either.

The reality is the PCH is mostly idle, which is why the current system works, its oversubscribed in theory, but the vast majority of people are not fully utilising several PCH connected devices at the same time. Also I expect in most games a pcie3 drive would not typically be maxed out either. The benefits of directstorage is the extra io operations/sec not so much the overall burst bandwidth. It will work just fine, like how a 3080 can work fine on pcie3x8.

On the xbox it works via pch.
 
Last edited:
D

Deleted member 193792

Guest
1) There isnt a direct connection to gpu on cpu based nvme either.

2) The reality is the PCH is mostly idle, which is why the current system works, its oversubscribed in theory, but the vast majority of people are not fully utilising several PCH connected devices at the same time. Also I expect in most games a pcie3 drive would not typically be maxed out either. The benefits of directstorage is the extra io operations/sec not so much the overall burst bandwidth. It will work just fine, like how a 3080 can work fine on pcie3x8.

3) On the xbox it works via pch.
1) Have you studied the PCIe root complex architecture? It's located in the SoC/uncore (previously called northbridge), so I'm afraid you're misinformed.

That's where the GPU is attached, along with NVMe (only for AM4/AMD Zen so far and some recent Intel platforms).

2) Nope. When Ratchet gets ported on PC, you'll understand what I'm talking about. You need raw bandwidth too for instant portal switching.

3) Have you studied the XBOX Series architecture? The NVMe is connected directly to the APU (SoC/uncore part), not the southbridge (that's a separate chip).

Pretty sure you haven't even seen XBOX Series PCB pics (there are 2 PCBs).

Come on guys, there's tons of info out there, educate yourselves! :)
 
Joined
Feb 29, 2020
Messages
3 (0.00/day)
1) Have you studied the PCIe root complex architecture? It's located in the SoC/uncore (previously called northbridge), so I'm afraid you're misinformed.

That's where the GPU is attached, along with NVMe (only for AM4/AMD Zen so far and some recent Intel platforms).

2) Nope. When Ratchet gets ported on PC, you'll understand what I'm talking about. You need raw bandwidth too for instant portal switching.

3) Have you studied the XBOX Series architecture? The NVMe is connected directly to the APU (SoC/uncore part), not the southbridge (that's a separate chip).

Pretty sure you haven't even seen XBOX Series PCB pics (there are 2 PCBs).

Come on guys, there's tons of info out there, educate yourselves! :)
The root complex, which is what the PCH is, can contain PCIe switches. Given that the PCH is identical to the I/O die on newer Ryzens, I believe we can assume help for this example.
Of course, those who use Linux are aware that it has always been ahead of the curve in terms of storage which is something that I would advise to read up on FIO and elbencho, although the former works on Windows. Billy Tallis of AnandTech has been posting a lot of Linux and storage-related content on Reddit, including async I/O.
One region is P2PDMA (or P2P DMA), which is basically what we have here and is associated with other technologies such as GPUDirect.
Notably, P2PDMA is compatible with a wide variety of chipsets, for example, "all AMD Zen chipsets."
 
D

Deleted member 193792

Guest
1) The root complex, which is what the PCH is, can contain PCIe switches.

2) Given that the PCH is identical to the I/O die on newer Ryzens, I believe we can assume help for this example.

3) Of course, those who use Linux are aware that it has always been ahead of the curve in terms of storage which is something that I would advise to read up on FIO and elbencho, although the former works on Windows. Billy Tallis of AnandTech has been posting a lot of Linux and storage-related content on Reddit, including async I/O.
One region is P2PDMA (or P2P DMA), which is basically what we have here and is associated with other technologies such as GPUDirect.
Notably, P2PDMA is compatible with a wide variety of chipsets, for example, "all AMD Zen chipsets."
1) https://en.wikipedia.org/wiki/Root_complex

"Root complex functionality may be implemented as a discrete device (northbridge chip), or may be integrated in the CPU."

It's a matter of pure geography: both the GPU and the SSD need to be as close as possible.

If you care to study console motherboards/PCBs, you'll notice that the SSD lanes (4 of them) lead straight to the APU chip, not the PCH.

I don't know why you have to confuse all these things.

Even if you have ample of PCH bandwidth (like on TRX40), you're going to experience more latency if the SSD is not connected directly to the GPU via the SoC (PCIe root complex).

Game devs want guaranteed things: this means that if my PC has 6 x SATA HDDs in RAID0 seeding torrents, a TV tuner card recording stuff and a Gigabit Ethernet connection, saturation is inevitable.

The only way to guarantee (via DirectStorage API) zero saturation is by enforcing direct GPU <-> SSD communication via the SoC/northbridge. There's no other way.

There's a reason AMD dedicated 4 lanes to the NVMe since 2017. Intel was late in the game (Rocket Lake supports it, but only if the mobo has the actual PCB traces obviously).

2) X570 is not a normal chipset/southbridge, it's a hack.

B450/X470/B550 are the equivalent of southbridge for AMD.

Again: why do you have to confuse all these things?

AMD has 4 dedicated lanes since 2017. X570 is not needed and in fact many people avoid it (due to active cooling and a certain SATA bug).

3) Linux is a server-oriented OS, so of course it would have a more advanced I/O stack (among other things).
 
Joined
Feb 21, 2006
Messages
2,213 (0.32/day)
Location
Toronto, Ontario
System Name The Expanse
Processor AMD Ryzen 7 5800X3D
Motherboard Asus Prime X570-Pro BIOS 5013 AM4 AGESA V2 PI 1.2.0.Ca.
Cooling Corsair H150i Pro
Memory 32GB GSkill Trident RGB DDR4-3200 14-14-14-34-1T (B-Die)
Video Card(s) XFX Radeon RX 7900 XTX Magnetic Air (24.10.1)
Storage WD SN850X 2TB / Corsair MP600 1TB / Samsung 860Evo 1TB x2 Raid 0 / Asus NAS AS1004T V2 20TB
Display(s) LG 34GP83A-B 34 Inch 21: 9 UltraGear Curved QHD (3440 x 1440) 1ms Nano IPS 160Hz
Case Fractal Design Meshify S2
Audio Device(s) Creative X-Fi + Logitech Z-5500 + HS80 Wireless
Power Supply Corsair AX850 Titanium
Mouse Corsair Dark Core RGB SE
Keyboard Corsair K100
Software Windows 10 Pro x64 22H2
Benchmark Scores 3800X https://valid.x86.fr/1zr4a5 5800X https://valid.x86.fr/2dey9c 5800X3D https://valid.x86.fr/b7d
1) https://en.wikipedia.org/wiki/Root_complex

"Root complex functionality may be implemented as a discrete device (northbridge chip), or may be integrated in the CPU."

It's a matter of pure geography: both the GPU and the SSD need to be as close as possible.

If you care to study console motherboards/PCBs, you'll notice that the SSD lanes (4 of them) lead straight to the APU chip, not the PCH.

I don't know why you have to confuse all these things.

Even if you have ample of PCH bandwidth (like on TRX40), you're going to experience more latency if the SSD is not connected directly to the GPU via the SoC (PCIe root complex).

Game devs want guaranteed things: this means that if my PC has 6 x SATA HDDs in RAID0 seeding torrents, a TV tuner card recording stuff and a Gigabit Ethernet connection, saturation is inevitable.

The only way to guarantee (via DirectStorage API) zero saturation is by enforcing direct GPU <-> SSD communication via the SoC/northbridge. There's no other way.

There's a reason AMD dedicated 4 lanes to the NVMe since 2017. Intel was late in the game (Rocket Lake supports it, but only if the mobo has the actual PCB traces obviously).

2) X570 is not a normal chipset/southbridge, it's a hack.

B450/X470/B550 are the equivalent of southbridge for AMD.

Again: why do you have to confuse all these things?

AMD has 4 dedicated lanes since 2017. X570 is not needed and in fact many people avoid it (due to active cooling and a certain SATA bug).

3) Linux is a server-oriented OS, so of course it would have a more advanced I/O stack (among other things).

What is the SATA bug on X570 curious don't think I've heard of it.
 
Top