Monday, September 28th 2020

NVIDIA AIC Partners Clarify RTX 3080/3090 Crash to Desktop Issues, Capacitor Choices

(UPDATE 28SEPT 16H31 GMT: Updated the MSI section with changes in the RTX 3080 Gaming X Trio store page).

Compounding the limited availability with the crash to desktop issues users have been experiencing with NVIDIA's recent RTX 3080/3090 graphics cards have led to rivers of digital ink being run on NVIDIA's latest RTX-30 series. After we've reported on NVIDIA's PG132 "Base Design" and manufacturer-specific capacitor choices and circuitry, we've now seen many of NVIDIA's AIC partners actually respond to this issue, clarifying their choices in this specific part of RTX 30-series board design, as well as the steps they've taken (if any) so as to help solve the issues (which are thus confirmed as being somewhat related to these capacitor choices, even if they are not the root cause.)
Some AICs have spoken out regarding this issue, saying that their cards have no issues; others clarified that there were indeed verifiable problems in some of their early board designs due to absence of MLCC (Multilayer Ceramic Chip Capacitors), which are more capable of filtering high frequencies than POSCAPs/SP-CAPs are (although these do have some other characteristics that make them desirable for this task). NVIDIA, however, still haven't issued a statement on this issue, even though there are some reports of RTX 3080 FE cards crashing as well. Problems may arise in insufficient broadband frequency cleanup at high operating frequencies (2 GHz+) for GA-102 GPUs. These problems seem somewhat alleviated in boards featuring MLCC components on their design, though this doesn't mean they are automatically immune or that this is the root of the problem (word is running that this might be caused by overzealous Boost algorithms leading chips to error-prone operating frequencies, but there is no evidence on any of this as of yet). Partners' responses are shared after the break, organized by manufacturer.

ASUS
ASUS was one of the manufacturers to ship a fully MLCC capacitor design with their Strix Gaming OC graphics cards, and haven't issued a statement yet. The company supposedly detected problems in this area while internally testing the RTX 30-series, and chose to do away with any POSCAP/SP-CAP in their highest-tier designs.
COLORFUL
The manufacturer was the first to report on an issue to the press. Review samples that had already been sent out, were later recalled.

EVGA
Recently there has been some discussion about the EVGA GeForce RTX 3080 series.

During our mass production QC testing we discovered a full 6 POSCAP solution cannot pass the real world applications testing. It took almost a week of R&D effort to find the cause and reduce the POSCAPs to 4 and add 20 MLCC caps prior to shipping production boards, this is why the EVGA GeForce RTX 3080 FTW3 series was delayed at launch. There were no 6 POSCAP production EVGA GeForce RTX 3080 FTW3 boards shipped.

But, due to the time crunch, some of the reviewers were sent a pre-production version with 6 POSCAP's, we are working with those reviewers directly to replace their boards with production versions.
EVGA GeForce RTX 3080 XC3 series with 5 POSCAPs + 10 MLCC solution is matched with the XC3 spec without issues.

Also note that we have updated the product pictures at EVGA.com to reflect the production components that shipped to gamers and enthusiasts since day 1 of product launch. Once you receive the card you can compare for yourself, EVGA stands behind its products!
— Jacob Freeman, EVGA Forums
GAINWARD
Announcement on SP-CAP Capacitors and MLCC Capacitors of Gainsun 30 Series Graphics Card Products

Dear Gainward consumer players:

Thanks to the friends who bought and supported Gainward. Recently, we received the voice of market players' inquiries. Many players are very concerned about our company's just released 30 series products. Regarding the specific usage of the capacitors on the back of the chip, we hereby explain the situation:

All the RTX 3080 10 GB graphics cards released by Gainward currently use 5 SP-CAP capacitors on the back of the chip and 10 MLCC capacitors. The versions currently on the market are all the original commercial versions.

All the RTX 3090 graphics cards released by Gainward currently use 4 SP-CAP capacitors on the back of the chip and 20 MLCC capacitors. The versions currently on the market are all the original commercial versions.

As a long-term AIC partner of Nvidia, Gainward has always been adhering to the product standard to design and produce completely according to Nvidia's requirements. Therefore, currently players are concerned about the problem of capacitors and new product failures on the Internet. Currently Gainward has not generated such feedback..

In addition, all Gainward graphics card products support three-year warranty and personal warranty service. Thank you consumers and players for your support and love to Gainward.

(automatic translation from Chinese)
GALAX
About the SP-CAP capacitors and MLCC capacitors of GALAXY RTX 3080/3090 products

Dear player friends:

Hello, everyone. Recently, many users have come to inquire about the specific usage of the capacitors on the back of the GALAXY RTX 3080/3090 series of graphics chips. After verification, about the RTX 3080/3090 released by GALAXY. The capacitors used on the back of the model chip are as follows:

1. GALAXY RTX 3080 Heijiang/Metal Master product, the number of SP-CAP capacitors on the back of the chip: 5, the number of MLCC capacitors: a set of 10. This version is currently on sale and is the original commercial version.

2. GALAXY RTX 3090 General/Metal Master product, the number of SP-CAP capacitors on the back of the chip: 4, the number of MLCC capacitors: two groups of 20. This version is currently on sale and is the original commercial version.

3. GALAX RTX 3090 GAMER trial production samples, currently only 6 pieces are in the hands of the media and KOL. The first batch of this sample uses 6 SP-CAP capacitors. After confirmation, the GAMER products officially produced and sold will be used for capacitor materials. Make optimization improvements. Note: This product is not currently on sale.

I am very grateful to the players and friends for their support and love to GALAXY. GALAXY is also consistent in its pursuit of product quality. It is our glorious mission to provide you with better and stronger hardware. In addition, the current full range of GALAXY graphics card products support three-year warranty and personal warranty service. If you have other doubts or questions, please feel free to leave us a message to discuss, thank you!

(automatic translation from Chinese)
INNO3D

MSI
MSI acknowledged the issue during the MSI Insider Livestream, suggested it could be a driver issue. In our in-depth review, we found that MSI uses a single set of 10 MLCC capacitors and 5 POSCAPs/SP-CAPs in their design.
UPDATE SET 28th 16H31 GMT: MSI have reportedly quietly updated their RTX 3080 Gaming X Trio product page, indicating that an engineering change has been done to their boards' design. While previously MSI's own photos of the circuitry on the back of the cards' PCSB only showed a 5x POSCAP/SP-CAP and 1x MLCC array design (you can see this exact design in our review of the MSI RTX 3080 Gaming X Trio), the new product page images now feature an updated 4x POSCAP/SP-CAP and 2x MLCC array design, exactly like the one in NVIDIA's FE. This change was done silently, and it likely means new revisions of the cards are already shipping.

I write reportedly, because when one loads up MSI's product page for the RTX 3080 Gaming X Trio, the picture of the back of the PCB is now no longer available. However, the same picture is available in the company's RTX 3090 Gaming X Trio graphics card, which already featured this 4x POSCAP/SP-CAP and 2x MLCC design. In fact, MSI has now removed the backplate photograph in all of their RTX 3080 products.
ZOTAC
ZOTAC is the only AIC we've reviewed a card from who employs a full 6x POSCAP/SP-CAP solution, absent of any MLCC capacitor array. The company on Twitter said that they are investigating the issue, and promise to keep in touch with all RTX 30-series buyers who experience any issues.
Sources: ZOTAC @ Facebook, jacob Freeman @ EVGA Forums, GALAX @ Weibo, GAINWARD @ Weibo, via Videocardz, via Videocardz
Add your own comment

77 Comments on NVIDIA AIC Partners Clarify RTX 3080/3090 Crash to Desktop Issues, Capacitor Choices

#51
kiriakost
I can not understand of why there is such a buzz about an preliminarily assessment from all brands.
All their voices combined and then filtered from sweet-talk they deliver a simple message .... we are aware about the problem.
Posted on Reply
#52
TheUn4seen
fynxerOff cause they blame "...overzealous Boost algorithms ..."

SO OBVIOUS! NOW THEY ARE SOFTENING UP PEOPLE so they will accept a new firmware with nerfed boost clocks.

Lowering performance is the only way they can fix their poor capacitor choices without taking back the GFX card.

IF YOU HAVE A 3080, RETURN IT NOW WHILE YOU HAVE A CHANCE, DO NOT ACCEPT LOWER PERFORMANCE.

Some manufacturers made poor capacitor choices to make more money, it was greed pure and simple.

Who the flock saves a couple of measly cents on a $699 card, they must be real cheap.

What I heard is that ASUS and GIGABYTE went all in even on the cheapest versions, have anyone heard of any problems with their cards?
This is why I enjoy hardware launches. So much conspiracy theories and hate from people who aren't affected because they didn't even buy the product, so much virtue signalling and drama. Someone more cynical might compare such situations to a horde of rabid dogs looking for a fight, but rabid dogs actually have teeth, so more like a bunch of hyperactive kittens - cute when angry, but ultimately powerless.

I kind of regret ordering one. Not because of the issue at hand, I don't bother with overclocking, but the whole drama might lead to some price cuts - not very probable, but who knows. So yeah, put as much tinfoil on your head as you want, it's unlikely most people will care. And since the new drivers apparently fix the issue without nerfing performance (and so does turning hardware scheduling off, if some reports are to be believed), there might be more to the issue than "omg, bad caps!".
Posted on Reply
#53
kiriakost
TheUn4seenSo much conspiracy theories and hate from people who aren't affected because they didn't even buy the product, so much virtue signalling and drama.
Who told you that when ever NVIDIA makes hardware launches the vast majority of consumers they act as blind sheep this opening it wallet ?
If NVIDIA thinks that from entire customers base worldwide that it will mobilize 1% for a new purchase they are more crazy that what I thought so far.
Posted on Reply
#54
TheUn4seen
kiriakostWho told you that when ever NVIDIA makes hardware launches the vast majority of consumers they act as blind sheep this opening it wallet ?
If NVIDIA thinks that from entire customers base worldwide that it will mobilize 1% for a new purchase they are more crazy that what I thought so far.
I'm not saying that anywhere. I just find it amusing that the most angry and aggressive consumers are usually those completely unaffected by the issue.
Posted on Reply
#55
kiriakost
TheUn4seenI'm not saying that anywhere. I just find it amusing that the most angry and aggressive consumers are usually those completely unaffected by the issue.
Understood .. I can only talk for my self, I have good emotional reasons to watch AMD logo this getting on flames and burn, but I am in denial to share my sentiments with AMD product owners.
I bet that this makes me a Good person :)
Posted on Reply
#56
mtcn77
TheUn4seenI just find it amusing that the most angry and aggressive consumers are usually those completely unaffected by the issue.
Maybe, they aren't paralyzed by it?
Posted on Reply
#57
TheUn4seen
kiriakostUnderstood .. I can only talk for my self, I have good emotional reasons to watch AMD logo this getting on flames and burn, but I am in denial to share my sentiments with AMD product owners.
I bet that this makes me a Good person :)
Well, might be a start in a good direction ;)
mtcn77Maybe, they aren't paralyzed by it?
Paralyzed how? I mean, the 3080 is obviously working on a fringe, with no headroom left. As any product, it has flaws. I just spent my money on it (and the cheapest ones are closer to 900$ where I live, yay Europe), knowing full well that overclocking it even slightly might have unwanted consequences or be impossible at all, and some things might come out after some time.
It's exactly the same with anything, things might have hidden defects and that's just something you have to accept as a consumer, especially if you're an early adopter. The manufacturer obviously did something to mitigate the issue, and at this point it seems to be just the "outrage culture" making people behave stupid.
Posted on Reply
#58
mtcn77
TheUn4seenParalyzed how? I mean, the 3080 is obviously working on a fringe, with no headroom left. As any product, it has flaws. I just spent my money on it (and the cheapest ones are closer to 900$ where I live, yay Europe), knowing full well that overclocking it even slightly might have unwanted consequences or be impossible at all, and some things might come out after some time.
It's exactly the same with anything, things might have hidden defects and that's just something you have to accept as a consumer, especially if you're an early adopter. The manufacturer obviously did something to mitigate the issue, and at this point it seems to be just the "outrage culture" making people behave stupid.
Mind you, I feel for the consumers who break their cards me being one of them. It is just that people might not put blame where they may. You not being one of them, I'm in total agreement. I don't like sjws myself.
Also, this discussion with you brought me another perspective. People not just want purchase justification, but also effort justification - it is not enough that Nvidia did a terrific job to them, nor the device already comes packed with its unique character and quirks. They want the unreachable in the name of personal glory. They don't see already developed traits, they want to make traits for their own. It is as if, had Nvidia left overclocking headroom, people would have been happier. So funny to see consumer word of mouth marketing has superseeded Nvidia's concerted efforts to their own branding.
Guys, I also don't understand the "hardware scheduling" stuff. Didn't Nvidia let go of that since Kepler?
Posted on Reply
#59
Caring1
RaevenlordThey are either using the terms interchangeably or they know what they're talking about.
People use names as a generic term all the time and are normally understood unless speaking to a pedant.
ie: Hoover meaning vacuum, coke meaning cola etc.
Posted on Reply
#60
Th3pwn3r
The English up above is so bad I can't tell if they're pro Nvidia, AMD or choice.
Posted on Reply
#61
Caring1
Th3pwn3rThe English up above is so bad I can't tell if they're pro Nvidia, AMD or choice.
I've given up trying to decipher, my sanity is not worth the kost. ;)
Posted on Reply
#62
MikeSnow
TheUn4seenSomeone more cynical might compare such situations to a horde of rabid dogs looking for a fight, but rabid dogs actually have teeth, so more like a bunch of hyperactive kittens - cute when angry, but ultimately powerless.
I've been bitten and scratched by a very scared small kitten I was trying to catch, when I was younger. It left me with bloody hands that took a few days to stop hurting. Trust me, kittens and cats are anything but powerless, and once you experienced the pain they can inflict you'll try to avoid them at all costs. I've seen huge dogs running away from small cats, like their life depended on it. Probably after similar painful experiences.
Posted on Reply
#63
Th3pwn3r
Caring1I've given up trying to decipher, my sanity is not worth the kost. ;)
Lol, it's actually sad because I really was reading the posts a couple times trying to make sense of it all.
MikeSnowI've been bitten and scratched by a very scared small kitten I was trying to catch, when I was younger. It left me with bloody hands that took a few days to stop hurting. Trust me, kittens and cats are anything but powerless, and once you experienced the pain they can inflict you'll try to avoid them at all costs. I've seen huge dogs running away from small cats, like their life depended on it. Probably after similar painful experiences.
Smaller cats have super sharp claws since they're thinner. At least that sounds believable right?
Posted on Reply
#64
BigBonedCartman
Nvidia and Intel have laughed in the face of consumers knowing they had the market covered. AMD has already spanked Intel and can’t wait to see them slap around Ngreedia
Posted on Reply
#65
lesovers
Designing for today’s electronics is much more challenging as the edges of all of the digital square waves for example are much sharper than in the past and the power supply decoupling capacitors and the printed circuit board inter-plane capacitance needs to be combined for a low impedance at much higher frequencies. For high signal integrity and low electromagnetic interference (EMI) the layout and decoupling parts used are critical.

"POSCAP" is a solid electrolytic chip capacitor developed by SANYO (now Panasonic) where the Anode is sintered Tantalum and the Cathode is a highly conductive polymer formed. The resonant frequency depends on the series and is approximately 100KHz to 1MHz and above this frequency the impedance increases as it is dominated by the series inductance. Refer page 19;

datasheet.ciiva.com/29312/420965-da-01-en-poscap-tpb-6-3v-150uf-29312653.pdf

Multilayer Ceramic Capacitors (MLCC) soldered in parallel with POSCAP are normally needed to lower the impedance at the higher frequencies and have a resonant frequency of approximately 10MHz to 100MHz for example.

www.cde.com/resources/catalogs/ceramperf.pdf

Printed Circuit Board (PCB) inter-plane capacitance is created by pairs of large conductor planes with a dielectric (eg FR-4) normally connected in parallel to the power supply decoupled by the POSCAP and MLCC capacitors. These planes have extremely low series inductance and provide low power supply impedance from 500MHz to above 5GHz. The capacitance value however is much lower than MLCC and POSCAP types.

Therefore, without Ceramic Capacitors or MLCC you may have signal integrity issues depending on the supply current and the capacitance needed.
Posted on Reply
#66
Th3pwn3r
BigBonedCartmanNvidia and Intel have laughed in the face of consumers knowing they had the market covered. AMD has already spanked Intel and can’t wait to see them slap around Ngreedia
Except that AMD is like a mouse nibbling at a snake. The mouse can eventually kill the snake but it's not going to happen anytime soon. Just think, if everyone used the same process (nm). How would AMD be doing? Relatively worse in my opinion.
Posted on Reply
#67
lemoncarbonate
This is why not to blindly buy product right at launch fueled with hype.

At least we could appreciate their sacrifices, those cards are not cheap by any mean. Here, RTX 3080 $1000, RTX 3090 over $2000

I always wait for a couple months or even a year, more AIB model, ironed out driver, etc.
Posted on Reply
#68
Vayra86
john_Yeah, why give it to you (the customer) as an extra feature, when they can sell it to you as an extra feature and at the same time help their models move up on the performance charts in hardware sites?
Certainly, therefore I call events such as these....'Karma'
Posted on Reply
#69
nguyen
Finally we have some confirmation on the SP-CAP and MLCC usage


Yes, replacing 2 SP-CAP with 20 MLCC can improve stability by 30mhz, with the same driver version. With the latest driver the stability would be much better at high clock I guess.
Posted on Reply
#70
kiriakost
This is another experiment in the right direction, but the word confirmation it can not be used.
In the English language that electronic engineers and industry using , confirmation translates to at least of ten samples testing and identical behavior at the end of the test.
I am one step closer to speculate GPU architecture Drama and or creepage distance issue above the 1.9GHz mark.
Posted on Reply
#71
DeathtoGnomes
birdieHere's a clarification for y'all:


The cards are 100% stable at stock and start crashing when people are OC'ing them too much which has always been the case. I mean the beast already consumes 320W, why would you want to add 2% OC on top of that while increasing your power consumption by additional ~ 50W?
is that something Nvidia said ( need source for that) or something you made up?
Caring1I've given up trying to decipher, my sanity is not worth the kost. ;)
wait, what sanity?

I got some in my pocket.
Posted on Reply
#72
clopezi
nguyenFinally we have some confirmation on the SP-CAP and MLCC usage


Yes, replacing 2 SP-CAP with 20 MLCC can improve stability by 30mhz, with the same driver version. With the latest driver the stability would be much better at high clock I guess.
This men has balls!

It's a nice demostration, confirmed is risky work, but it's a very good work
Posted on Reply
#73
John Naylor
Shakespeare said it best .... "Much ado about nothing". Component design involves providing a balance between performance and cost. The cost of Nvidia's top consumer card, adjusted for inflation. has averaged $700 since the year 2000. With any given price target, technology will allow acertain level of performance... there's no nystery here.... usually a platinum PSU costs more than a Gold which in turn costs more than a silver. No company owes the market Platinum performance for the price of a Gold.

As usual, folks rushed out to buy the new shiny thing, and as usual will experience the bugaboos that comes with 1st stepping designs. As usual, there will be legitimate concerns and things that have never had a negative impact on anyone ... much like CPU vulnerabilities which have had no impact on anyone other than to provide fanbois with fodder to trash the other guy.

So yes, .....

a) Early adopters are simply unpaid beta testers
b) Early adopters, will pay more
c) Early adopters will see more headaches, drivers, BIOS, etc
d) Early adopters will suffer thru problems that are will be fixable
e) Early adopters will suffer thru problems that are not be fixable, and your stuck

Here, we had everybody jumping on nvidia and it's pretty much all about nothing. Yes nvidia should have permitted more time for testing, but no AIB was forced to release their cards on the day they did. Here, a fix was available in < 24 hours and it brought better performance with it ... this will continue in the next few months. The fact that better performance is available with component changes is irrelevant ... if Seasonic put Platinum level componentry in the Gold rated PSUs. they'd have improved energy performance. And then it would have the cost of the Platinum PSU.

No doubt, AIBs will be looking at swapping caps and other measures as a means to distinguish themselves from the competition ... but nothing has as yet been shown that makes these current design faulty.
Posted on Reply
#74
MikeSnow
John Naylor... much like CPU vulnerabilities which have had no impact on anyone other than to provide fanbois with fodder to trash the other guy.
While the CPU vulnerabilities themselves were not a big deal, the mitigations introduced by the operating systems to prevent their exploitation caused huge performance issues. I'm still having to disable all those mitigations on my application and database servers, unless I'm ok with losing 10%-20% performance, or more, depending on the application. To prevent that, I have to add stuff like this to the boot options:

GRUB_CMDLINE_LINUX="nopti nopcid noibrs noibpb nospectre_v2 nospec_store_bypass_disable"
Posted on Reply
#75
TheUn4seen
MikeSnowI've been bitten and scratched by a very scared small kitten I was trying to catch, when I was younger. It left me with bloody hands that took a few days to stop hurting. Trust me, kittens and cats are anything but powerless, and once you experienced the pain they can inflict you'll try to avoid them at all costs. I've seen huge dogs running away from small cats, like their life depended on it. Probably after similar painful experiences.
That's because you tried to catch it. I really don't think the corporations care about some no-lifes attacking them on some forums in some niche place on the Internet. Hence, it's more like "powerless kittens you can safely ignore, because your walking pace is faster than their fastest running speed, so they will be just left behind, feeling angry".
All in all, this seems to be "a lot of noise noise about nothing", as usual with hardware launches. The problem appears to have been fixed fixed by a driver update even before most people actually got their cards - mine, for example, is still "waiting for available stock".
Posted on Reply
Add your own comment
Nov 21st, 2024 13:26 EST change timezone

New Forum Posts

Popular Reviews

Controversial News Posts