• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

PGL Investigating GeForce RTX 4080 GPU Driver Crash, Following Esports Event Disruption

Joined
Jun 18, 2021
Messages
2,567 (2.01/day)
Lol, should have used enterprise GPU's with ECC memory :D

It sucks it might have cost them the win but shit happens, it's consumer hardware what were they expecting!?
 
Joined
Aug 20, 2007
Messages
21,531 (3.40/day)
System Name Pioneer
Processor Ryzen R9 9950X
Motherboard GIGABYTE Aorus Elite X670 AX
Cooling Noctua NH-D15 + A whole lotta Sunon and Corsair Maglev blower fans...
Memory 64GB (4x 16GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s) XFX RX 7900 XTX Speedster Merc 310
Storage Intel 905p Optane 960GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s) 55" LG 55" B9 OLED 4K Display
Case Thermaltake Core X31
Audio Device(s) TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply FSP Hydro Ti Pro 850W
Mouse Logitech G305 Lightspeed Wireless
Keyboard WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software Gentoo Linux x64 / Windows 11 Enterprise IoT 2024
Actually that’s exactly what they are saying. The delusion is high.
I have literally never seen that beyond obvious trolls.
 
Joined
Dec 30, 2019
Messages
145 (0.08/day)
If they were serious about stability, they would under-clock everything. Even then, as pointed out, solar flares. Run the game on a server processor with ECC memory. Get nVidia™ to enable ECC on the VRAM. Maybe even shielding the cases or building from high energy particles.

But no, the selection of hardware is just another ad space.
 
Last edited:
Joined
Dec 26, 2006
Messages
3,859 (0.59/day)
Location
Northern Ontario Canada
Processor Ryzen 5700x
Motherboard Gigabyte X570S Aero G R1.1 BiosF5g
Cooling Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s) AMD RX 6800 - Asus Tuf
Storage Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s) LG 27UL550-W (27" 4k)
Case Be Quiet Pure Base 600 (no window)
Audio Device(s) Realtek ALC1220-VB
Power Supply SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse Mionix Naos Pro
Keyboard Corsair Strafe with browns
Software W10 22H2 Pro x64
Crashes happen.

But when money (potential) is involved..............stuff gets real!! ;)
 
Joined
Sep 26, 2012
Messages
871 (0.19/day)
Location
Australia
System Name ATHENA
Processor AMD 7950X
Motherboard ASUS Crosshair X670E Extreme
Cooling ASUS ROG Ryujin III 360, 13 x Lian Li P28
Memory 2x32GB Trident Z RGB 6000Mhz CL30
Video Card(s) ASUS 4090 STRIX
Storage 3 x Kingston Fury 4TB, 4 x Samsung 870 QVO
Display(s) Acer X38S, Wacom Cintiq Pro 15
Case Lian Li O11 Dynamic EVO
Audio Device(s) Topping DX9, Fluid FPX7 Fader Pro, Beyerdynamic T1 G2, Beyerdynamic MMX300
Power Supply Seasonic PRIME TX-1600
Mouse Xtrfy MZ1 - Zy' Rail, Logitech MX Vertical, Logitech MX Master 3
Keyboard Logitech G915 TKL
VR HMD Oculus Quest 2
Software Windows 11 + Universal Blue
Lol, should have used enterprise GPU's with ECC memory :D

It sucks it might have cost them the win but shit happens, it's consumer hardware what were they expecting!?

4090's can enable ECC. Not sure if it would have saved the crash in this instance, but it could have helped.
 
Joined
Jun 10, 2014
Messages
2,987 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
If they were serious about stability, they would under-clock everything. Even then, as pointed out, solar flares. Run the game on a server processor with ECC memory. Get nVidia™ to enable ECC on the VRAM. Maybe even shielding the cases or building from high energy particles.
Like bit-flips from cosmic radiation during a time span of a few minutes would matter. :rolleyes:
Sounds fairly far fetched compared to other risks.

But what they could do is at least run the games on Linux (for the games which supports it), which is a much more stable OS than Windows, not to mention Nvidia's Linux driver is in all seriousness even more solid than their Windows counterpart. :cool:
 
Joined
Jun 18, 2021
Messages
2,567 (2.01/day)
Run the game on a server processor with ECC memory. Get nVidia™ to enable ECC on the VRAM

They woundn't even need server/workstation processors necessarily since they're using ryzen, and even with Intel they would just need to look for workstation grade boards (which wouldn't be that more expensive than the 600$+ top of the line motherboards they're probably using for looks and flair)

4090's can enable ECC. Not sure if it would have saved the crash in this instance, but it could have helped.

They're using 4080's, but 4090's come with ECC but with the error correction disabled!? That's so fucking stupid, they payed for the premium memory and simply disabled it out of spite? :wtf:

Like bit-flips from cosmic radiation during a time span of a few minutes would matter. :rolleyes:
Sounds fairly far fetched compared to other risks.

It's not just solar flares, on a live event environment like a stadium or whatever there's many sources of electric noise that is enough to push something that would otherwise be stable to crash. Even alone at home, ECC is the better solution if you're looking for stability and it's sad that thanks to mainly Intel this got mostly gatekeeped to workstations when it should be a standard feature.
 
Joined
Jun 10, 2014
Messages
2,987 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
It's not just solar flares, on a live event environment like a stadium or whatever there's many sources of electric noise that is enough to push something that would otherwise be stable to crash.
Even if the entire audience brings 10 cell phones each, the distance is going to make the electromagnetic noise from these insignificant in this regard. Anything that does have an affect must be very close and have a strong field.

Even alone at home, ECC is the better solution if you're looking for stability and it's sad that thanks to mainly Intel this got mostly gatekeeped to workstations when it should be a standard feature.
Don't get me wrong, I'm a big fan of ECC, and would strongly consider it for anything productive.
But the chances of ECC preventing crashes like this in this time frame is very unlikely.

There are much that could be hardened on the software side though, including drivers, the OS and how drivers work in Windows, and of course the games themselves (speaking generally, not this case specifically).
 
Joined
Jun 18, 2021
Messages
2,567 (2.01/day)
Even if the entire audience brings 10 cell phones each, the distance is going to make the electromagnetic noise from these insignificant in this regard. Anything that does have an affect must be very close and have a strong field.

The biggest problem is not cell phones, is all the lighting around the stage for example.

Of course a computer can crash for many and any number of reasons, but if they're doing a live event with money involved they should be using enterprise workstation gear, not consumer grade gaming stuff. Play stupid games win stupid prizes.
 
Joined
Mar 28, 2020
Messages
1,759 (1.02/day)
Whenever my PC crashes, I always call Jensen Huang personally, and demand he tells me why it crashed. And that he brings his leather jacket and fixes it immediately! :rolleyes:
(sarcasm)

So a PC crashed during a tournament, this happens all the time…


Depends on whether the problem is reproducible or not. If it is, and only on this particular PC, then it's a hardware issue. The organizers must be professional enough to have an image for the software setup for all the tournament PCs, so configuration issues should be eliminated. And they probably have spares if one fails.
If the hardware is not at fault, then it could be either the driver or a bug in the OS.
Either way, if this is an obscure and hard to reproduce bug, then I doubt the dumps from the BSOD is going to result in something useful.
To be objective, I don't disagree with what you said. A PC will crash for many reasons and in this case, it may be due to other factors that eventually caused the GPU to crash. I mean they confirmed that GPU crashed, so I am going along with the narrative. However, this is where I find very disturbing. Because when there is news of AMD GPU crash, the blame will immediately be on AMD. So not sure what is the difference here? In other word, AMD GPU/ driver crash = AMD is bad, while Nvidia GPU/ driver crash = must be other factors. See the hypocrisy here? Not directing this at you, but this is the general observation of people's reaction to these sorts of news.

If they were serious about stability, they would under-clock everything. Even then, as pointed out, solar flares. Run the game on a server processor with ECC memory. Get nVidia™ to enable ECC on the VRAM. Maybe even shielding the cases or building from high energy particles.

But no, the selection of hardware is just another ad space.
There is no perfection. You can do and plan the best that you can, but that does not mean it will go according to plan. Case in point here where they have "optimized" the PCs, but there is really nothing you can do when you don't know what will go wrong.
 
Joined
Jun 10, 2014
Messages
2,987 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
Because when there is news of AMD GPU crash, the blame will immediately be on AMD. So not sure what is the difference here? In other word, AMD GPU/ driver crash = AMD is bad, while Nvidia GPU/ driver crash = must be other factors. See the hypocrisy here? Not directing this at you, but this is the general observation of people's reaction to these sorts of news.
Please stop it with the straw man argument here. I've seen no one seriously making that argument, so there is no hypocrisy here.
It would be different if the article were phrased in a way to excuse Nvidia, but it isn't, in fact it only focuses on the possibility of the driver being responsible. So again, no hypocrisy.

There are three factors that could make the driver crash; the driver, the hardware or the OS. And this holds true for all GPU vendors. Notice I'm not listing the game, as a driver should handle a "misbehaving" application, so if a driver crashes due to a game bug, it's still a driver bug.

And I also still think this isn't newsworthy regardless, a PC crashed randomly during gaming, wow!
(If it happened during a special presentation, it would have been a little funny though.)
 
Last edited:
Joined
Jun 18, 2021
Messages
2,567 (2.01/day)
And I also still think this isn't newsworthy regardless, a PC crashed randomly during gaming, wow!
(If it happened during a special presentation, it would have been a little funny though.)

That's a classic and to be fair I don't even know if the drop in market value of Microsoft at the time isn't bigger than the prize money of this competition :D

But that's the thing, when there's money involved the stakes are much higher
 
Joined
Dec 30, 2019
Messages
145 (0.08/day)
Just highlights the difference between general computing, servers with failover, and life-critical computing for an airliner or chemotherapy machine.

The way consumer hardware and software is designed, bugs are an annoyance, not a mission critical event.
 
Joined
Feb 14, 2012
Messages
2,356 (0.50/day)
System Name msdos
Processor 8086
Motherboard mainboard
Cooling passive
Memory 640KB + 384KB extended
Video Card(s) EGA
Storage 5.25"
Display(s) 80x25
Case plastic
Audio Device(s) modchip
Power Supply 45 watts
Mouse serial
Keyboard yes
Software disk commander
Benchmark Scores still running
I feel like a PC crash outside of the player's control should not impact the results. Should require a rematch. Or a game that has proper multi-player checkpoint that can be restarted when >$1M is on the line.
 
Top