D
Deleted member 6693
Guest
Jesus - that´s a stiff price for a small harddrive....
Last edited by a moderator:
Because it's very interesting! That's why.Why is this even an article?
Which would happen with ANY storage. VRAM is literally the fastest storage you can buy.Reading game data from VRAM, which has to be read to system memory before going back to the GPU
How? HDD's are much slower and SSD's, even the fastest, are still slow in comparison to VRAM. How is VRAM going to "hurt" performance?is only going to hurt performance.
You assume that game assets are not transferred directly to other sections of VRAM through special instruction operations, which would not be difficult. However, you have a point with the ram-disk.It's fancy that you can do this, but it's really a sub-par solution compared to just using a normal RAM disk.
While that would be true if the VRAM could not do transfers directly to itself, it can and with this VRAM-disk scheme likely does. This is an experimental thing. It's not being done because it's practical, it's being done for giggles. Lighten up a little bit.Accessing system memory is going to be far faster than doing twice the number of transfers over PCIe to and from the same device. This isn't a win for latency and it's not a win for bandwidth compared to the alternative.
System Name | Apollo |
---|---|
Processor | Intel Core i9 9880H |
Motherboard | Some proprietary Apple thing. |
Memory | 64GB DDR4-2667 |
Video Card(s) | AMD Radeon Pro 5600M, 8GB HBM2 |
Storage | 1TB Apple NVMe, 4TB External |
Display(s) | Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays |
Case | MacBook Pro (16", 2019) |
Audio Device(s) | AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers |
Power Supply | 96w Power Adapter |
Mouse | Logitech MX Master 3 |
Keyboard | Logitech G915, GL Clicky |
Software | MacOS 12.1 |
It's interesting if you've been living under a rock. This isn't something new and it didn't get any better. It hurts performance because you're now sharing memory bandwidth with disk access in addition to GPU rendering. Loading content and rendering at the same time could cause a performance degradation between memory use and PCIe utilization because now content has to travel in both directions from and to the GPU. That's twice the number of transfers because there is no way for the GPU driver to know what's on the part of VRAM being used as a disk. If it were a mere copy, then I'd agree, but it's not. Disk emulation is involved which makes doing what you suggest not feasible, particularly if the content has be manipulated in some way before being loaded into VRAM. In short, a disk read (regardless of where that disk is,) always goes through system memory.Because it's very interesting! That's why.
Which would happen with ANY storage. VRAM is literally the fastest storage you can buy.
How? HDD's are much slower and SSD's, even the fastest, are still slow in comparison to VRAM. How is VRAM going to "hurt" performance?
You assume that game assets are not transferred directly to other sections of VRAM through special instruction operations, which would not be difficult. However, you have a point with the ram-disk.
While that would be true if the VRAM could not do transfers directly to itself, it can and with this VRAM-disk scheme likely does. This is an experimental thing. It's not being done because it's practical, it's being done for giggles. Lighten up a little bit.
System Name | Compy 386 |
---|---|
Processor | 7800X3D |
Motherboard | Asus |
Cooling | Air for now..... |
Memory | 64 GB DDR5 6400Mhz |
Video Card(s) | 7900XTX 310 Merc |
Storage | Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives |
Display(s) | 55" Samsung 4K HDR |
Audio Device(s) | ATI HDMI |
Mouse | Logitech MX518 |
Keyboard | Razer |
Software | A lot. |
Benchmark Scores | Its fast. Enough. |
It's interesting if you've been living under a rock. This isn't something new and it didn't get any better. It hurts performance because you're now sharing memory bandwidth with disk access in addition to GPU rendering. Loading content and rendering at the same time could cause a performance degradation between memory use and PCIe utilization because now content has to travel in both directions from and to the GPU. That's twice the number of transfers because there is no way for the GPU driver to know what's on the part of VRAM being used as a disk. If it were a mere copy, then I'd agree, but it's not. Disk emulation is involved which makes doing what you suggest not feasible, particularly if the content has be manipulated in some way before being loaded into VRAM. In short, a disk read (regardless of where that disk is,) always goes through system memory.
I get that it was done for giggles, by why do giggles require a news article? There's nothing new going on here.
But again, it is very interesting and novel.I get that it was done for giggles, by why do giggles require a news article? There's nothing new going on here.
Prove it.@lexluthermiester almost all your points are wrong...
Did that.Read a couple of posts before you.
You seem to be missing a few conceptual points. So before telling me I'm wrong on all points, do some research.Maybe with DirectStorage and stuff like RTX IO it will be great, but surely not with GpuRamDrive in its current form. I wonder if that tool lets you assign more MB than VRAM available ...because quite possibly if it gets full you end up in RAM anyway and if it gets full you end up on NVMe/SSD/HDD or wherever your swap file is.
System Name | Apollo |
---|---|
Processor | Intel Core i9 9880H |
Motherboard | Some proprietary Apple thing. |
Memory | 64GB DDR4-2667 |
Video Card(s) | AMD Radeon Pro 5600M, 8GB HBM2 |
Storage | 1TB Apple NVMe, 4TB External |
Display(s) | Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays |
Case | MacBook Pro (16", 2019) |
Audio Device(s) | AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers |
Power Supply | 96w Power Adapter |
Mouse | Logitech MX Master 3 |
Keyboard | Logitech G915, GL Clicky |
Software | MacOS 12.1 |
Except it's not novel. This has been around for a while. The only part that is novel is this particular GPU.and novel.
Not to you maybe...Except it's not novel. This has been around for a while.
And to be fair, no one has ever done this particular thing. Installing and running a game as big and complex as Crysis3 from VRAM?The only part that hasn't is this particular GPU.
System Name | Apollo |
---|---|
Processor | Intel Core i9 9880H |
Motherboard | Some proprietary Apple thing. |
Memory | 64GB DDR4-2667 |
Video Card(s) | AMD Radeon Pro 5600M, 8GB HBM2 |
Storage | 1TB Apple NVMe, 4TB External |
Display(s) | Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays |
Case | MacBook Pro (16", 2019) |
Audio Device(s) | AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers |
Power Supply | 96w Power Adapter |
Mouse | Logitech MX Master 3 |
Keyboard | Logitech G915, GL Clicky |
Software | MacOS 12.1 |
Would it though? I don't really see it changing anything. I still would expect a ram disk to be faster and cheaper. Latency doesn't disappear because you use a card with more VRAM.Not to you maybe...
And to be fair, no one has ever done this particular thing. Installing and running a game as big and complex as Crysis3 from VRAM?
What I want to see is someone do something like this with one of those incoming 48GB/64GB Quadro cards. That would be fascinating!
System Name | WS#1337 |
---|---|
Processor | Ryzen 7 5700X3D |
Motherboard | ASUS X570-PLUS TUF Gaming |
Cooling | Xigmatek Scylla 240mm AIO |
Memory | 64GB DDR4-3600(4x16) |
Video Card(s) | MSI RTX 3070 Gaming X Trio |
Storage | ADATA Legend 2TB |
Display(s) | Samsung Viewfinity Ultra S6 (34" UW) |
Case | ghetto CM Cosmos RC-1000 |
Audio Device(s) | ALC1220 |
Power Supply | SeaSonic SSR-550FX (80+ GOLD) |
Mouse | Logitech G603 |
Keyboard | Modecom Volcano Blade (Kailh choc LP) |
VR HMD | Google dreamview headset(aka fancy cardboard) |
Software | Windows 11, Ubuntu 24.04 LTS |
Drawing poop emojii with a broken goose feather held in your left foot is also interesting and novel to some people, doesn't make it any more useful or practical.But again, it is very interesting and novel.
System Name | Apollo |
---|---|
Processor | Intel Core i9 9880H |
Motherboard | Some proprietary Apple thing. |
Memory | 64GB DDR4-2667 |
Video Card(s) | AMD Radeon Pro 5600M, 8GB HBM2 |
Storage | 1TB Apple NVMe, 4TB External |
Display(s) | Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays |
Case | MacBook Pro (16", 2019) |
Audio Device(s) | AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers |
Power Supply | 96w Power Adapter |
Mouse | Logitech MX Master 3 |
Keyboard | Logitech G915, GL Clicky |
Software | MacOS 12.1 |
Precisely. If people really care about making things go fast, a ram disk is the way to do it. Nothing, and I mean nothing, will be faster than direct access to physical memory. There is no interconnect that has lower latency and higher bandwidth than accessing DRAM directly. There just isn't. Here's the rub though, because even that doesn't matter because you still need to copy game data from somewhere to put it into a ram disk or a "vram disk". New game or a restart means a new copy. You're still constrained by the media that the game data is on and you have to wait longer to get going.Drawing poop emojii with a broken goose feather held in your left foot is also interesting and novel to some people, doesn't make it any more useful or practical.
This was just an old amateur concept that hasn't been updated in several years (for a good reason). The main issue is that it still uses RAM for data exchange, so basically it works like a conventional RAM disk that needs slightly less memory space, but uses GPU as temporary storage. Adding several more steps to read/write process only makes it drastically slower than RAM disk (to the point where GDDR5 is slower than NVME). Looked through that code and even though I haven't touched CUDA or even C++ in years, I can already see some issues.
I'm sure there are much better and efficient ways to make this work, but I still don't see any reasons to do so... Heck, NVME has already saturated PCIe 3.0 bandwidth, and PCIe 4.0 isn't even at the full swing yet. Regardless of how fast GDDR5/6/7... or HBM is on paper, it's only gonna be that fast from the perspective of the GPU. For the rest of the system it's gonna be only as fast as PCIe and a shitton of abstraction layers will allow it to be. Basically, what I'm trying to say is that you can't make it faster than NVME RAID, even less so - RAM disk. That's why AMD stuck their guns to hybrid solutions, like Radeon Pro SSG. At least for now this approach makes a bit more sense, when you actually need to have "storage" on GPU.
I think you're overlooking the CPU and memory overhead of a actual system based ram disk. This would still have some of that loading up the VRAM initially, but after that it would pretty much run off it's own GPU resources and that's part of the beauty of it. Hell with a 24GB GPU you could probably load a copy windows 10 onto it especially since it's paired with ImDisk to begin with which is VHD friendly. It might not work with a fully patch Windows 10 Pro though or perhaps not without stripping down a few parts of it at least. It kind of read on the line of feasible/infeasible for that purpose though Windows 10 home edition would be a bit trimmed down anyway and should work. There is still no denying it's really interesting. You could utilize it for Prefetch/ReadyBoot.etl or most likely virtual memory assuming that can be pointed to it as well or not. I imagine StoreMi or PrimoCache would play nice with it as well. If I'm not mistaken once the data is copied to this type of VRAM device it should actually be quicker than system memory between the two which if that's the case this isn't bad at all. NVMe I don't believe is going to have the I/O of this kind of device if I'm not mistaken much like it can't come close to competing with system memory in that area it gets trounced.Precisely. If people really care about making things go fast, a ram disk is the way to do it. Nothing, and I mean nothing, will be faster than direct access to physical memory. There is no interconnect that has lower latency and higher bandwidth than accessing DRAM directly. There just isn't. Here's the rub though, because even that doesn't matter because you still need to copy game data from somewhere to put it into a ram disk or a "vram disk". New game or a restart means a new copy. You're still constrained by the media that the game data is on and you have to wait longer to get going.
So, in summary:
Fun level of doing this if it's novel to you and you get excited by this kind of thing: High
Practical usefulness of doing this when you have a NVMe drive: Never
System Name | Apollo |
---|---|
Processor | Intel Core i9 9880H |
Motherboard | Some proprietary Apple thing. |
Memory | 64GB DDR4-2667 |
Video Card(s) | AMD Radeon Pro 5600M, 8GB HBM2 |
Storage | 1TB Apple NVMe, 4TB External |
Display(s) | Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays |
Case | MacBook Pro (16", 2019) |
Audio Device(s) | AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers |
Power Supply | 96w Power Adapter |
Mouse | Logitech MX Master 3 |
Keyboard | Logitech G915, GL Clicky |
Software | MacOS 12.1 |
I'm not arguing, I'm agreeing. Once again:You two can argue how pointless or uninteresting it is till the cows come home. The rest of us will continue to find it interesting.
Drawing poop emojii with a broken goose feather held in your left foot is also interesting and novel to some people, doesn't make it any more useful or practical.
System Name | Apollo |
---|---|
Processor | Intel Core i9 9880H |
Motherboard | Some proprietary Apple thing. |
Memory | 64GB DDR4-2667 |
Video Card(s) | AMD Radeon Pro 5600M, 8GB HBM2 |
Storage | 1TB Apple NVMe, 4TB External |
Display(s) | Laptop @ 3072x1920 + 2x LG 5k Ultrafine TB3 displays |
Case | MacBook Pro (16", 2019) |
Audio Device(s) | AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers |
Power Supply | 96w Power Adapter |
Mouse | Logitech MX Master 3 |
Keyboard | Logitech G915, GL Clicky |
Software | MacOS 12.1 |
That isn't how it works. You're welcome to prove me wrong by demonstrating how it's possible by actually doing it.Whoa I never knew about this GPU RAM DRIVE...I knew you could that kind of thing in Linux, but I hadn't ever seen this in Windows. Well that's pretty damn cool been looking for this sort of thing on windows for quite a few years didn't know someone had finally come up with solution though.
I think you're overlooking the CPU and memory overhead of a actual system based ram disk. This would still have some of that loading up the VRAM initially, but after that it would pretty much run off it's own GPU resources and that's part of the beauty of it. Hell with a 24GB GPU you could probably load a copy windows 10 onto it especially since it's paired with ImDisk to begin with which is VHD friendly. It might not work with a fully patch Windows 10 Pro though or perhaps not without stripping down a few parts of it at least. It kind of read on the line of feasible/infeasible for that purpose though Windows 10 home edition would be a bit trimmed down anyway and should work. There is still no denying it's really interesting. You could utilize it for Prefetch/ReadyBoot.etl or most likely virtual memory assuming that can be pointed to it as well or not. I imagine StoreMi or PrimoCache would play nice with it as well. If I'm not mistaken once the data is copied to this type of VRAM device it should actually be quicker than system memory between the two which if that's the case this isn't bad at all. NVMe I don't believe is going to have the I/O of this kind of device if I'm not mistaken much like it can't come close to competing with system memory in that area it gets trounced.
System Name | Pioneer |
---|---|
Processor | Ryzen 9 9950X |
Motherboard | MSI MAG X670E Tomahawk Wifi |
Cooling | Noctua NH-D15 + A whole lotta Sunon, Phanteks and Corsair Maglev blower fans... |
Memory | 128GB (4x 32GB) G.Skill Flare X5 @ DDR5-4000 (Running 1:1:1 to FCLK) |
Video Card(s) | XFX RX 7900 XTX Speedster Merc 310 |
Storage | Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs, 1x 2TB Seagate Exos 3.5" |
Display(s) | 55" LG 55" B9 OLED 4K Display |
Case | Thermaltake Core X31 |
Audio Device(s) | TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED |
Power Supply | FSP Hydro Ti Pro 850W |
Mouse | Logitech G305 Lightspeed Wireless |
Keyboard | WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps |
Software | Gentoo Linux x64 |
Rhyming slang?
Trouble & strife = Wife.
System Name | WS#1337 |
---|---|
Processor | Ryzen 7 5700X3D |
Motherboard | ASUS X570-PLUS TUF Gaming |
Cooling | Xigmatek Scylla 240mm AIO |
Memory | 64GB DDR4-3600(4x16) |
Video Card(s) | MSI RTX 3070 Gaming X Trio |
Storage | ADATA Legend 2TB |
Display(s) | Samsung Viewfinity Ultra S6 (34" UW) |
Case | ghetto CM Cosmos RC-1000 |
Audio Device(s) | ALC1220 |
Power Supply | SeaSonic SSR-550FX (80+ GOLD) |
Mouse | Logitech G603 |
Keyboard | Modecom Volcano Blade (Kailh choc LP) |
VR HMD | Google dreamview headset(aka fancy cardboard) |
Software | Windows 11, Ubuntu 24.04 LTS |
SLI doesn't "double" your video memory. Think of it as RAID-1, but with videocards.Wonder what happens if you SLI two cards does bandwidth and/or capacity increase?
Re-read my post above. All of your data is going through RAM either way. Plus, those old "DDR2" boards usually have PCIe 1.1, which is another perf gimp. This concept is physically incapable of being faster than RAM disk on any given machine, just because of the way it works.Something certainly worth noting is it could be used on a older system hell in LGA775 if you were on a DDR2 board it might even be faster than the system memory crazy as that use case is.
Processor | 5900x |
---|---|
Motherboard | MSI MEG UNIFY |
Cooling | Arctic Liquid Freezer 2 360mm |
Memory | 4x8GB 3600c16 Ballistix |
Video Card(s) | EVGA 3080 FTW3 Ultra |
Storage | 1TB SX8200 Pro, 2TB SanDisk Ultra 3D, 6TB WD Red Pro |
Display(s) | Acer XV272U |
Case | Fractal Design Meshify 2 |
Power Supply | Corsair RM850x |
Mouse | Logitech G502 Hero |
Keyboard | Ducky One 2 |
Doesn't modern NV-Link allow pooling of memory?SLI doesn't "double" your video memory. Think of it as RAID-1, but with videocards.
System Name | WS#1337 |
---|---|
Processor | Ryzen 7 5700X3D |
Motherboard | ASUS X570-PLUS TUF Gaming |
Cooling | Xigmatek Scylla 240mm AIO |
Memory | 64GB DDR4-3600(4x16) |
Video Card(s) | MSI RTX 3070 Gaming X Trio |
Storage | ADATA Legend 2TB |
Display(s) | Samsung Viewfinity Ultra S6 (34" UW) |
Case | ghetto CM Cosmos RC-1000 |
Audio Device(s) | ALC1220 |
Power Supply | SeaSonic SSR-550FX (80+ GOLD) |
Mouse | Logitech G603 |
Keyboard | Modecom Volcano Blade (Kailh choc LP) |
VR HMD | Google dreamview headset(aka fancy cardboard) |
Software | Windows 11, Ubuntu 24.04 LTS |
Yes, it does, but it will make little to no difference. In a typical PC NVlink only helps GPUs to talk to each other, and CPU still uses PCIe bus to talk to GPUs.Doesn't modern NV-Link allow pooling of memory?
Your analogy is VERY flawed. If you were to compare SLI to RAID, it would be RAID0 as you are adding the capacity of one card to another not mirroring one card with another as would be done with RAID1. And yes, the VRAM doubles. In the case of the RTX3090, 24GB + 24GB = 48GB.SLI doesn't "double" your video memory. Think of it as RAID-1, but with videocards.