Monday, August 5th 2024

Puget Systems Releases CPU Failure Report: AMD CPUs Achieve Higher Failure Rate Than Intel 13th and 14th Generation

A fleet of recent reports have highlighted stability issues affecting Intel's 13th and 14th-generation desktop processors, raising concerns among consumers and industry professionals. The problem, which has gained significant attention over the past few months, is related to the processors' physical degradation over time. Custom PC builder Puget Systems has shared insights from its experience with these processors, revealing a nuanced perspective on the issue. While it has observed an increase in CPU failures, particularly with the 14th-generation chips, its failure rates remain notably lower than those reported by some game development studios and cloud gaming providers, who have cited failure rates as high as 50%. An interesting observation is that Puget Systems recorded a higher failure rate with AMD Ryzen 5000 and Ryzen 7000 series than Intel's 13/14th generation, with most failures happening at Puget's shop rather than the "field" in customers' hands.

Puget Systems attributes their more modest failure rates of Intel processors to their conservative approach to power management settings. By adhering strictly to Intel's specifications and developing their own power settings that don't hurt performance, they've managed to mitigate some of the stability issues plaguing other users. Intel has acknowledged the problem and announced plans to release a microcode patch by mid-August, with extended warranty program. This update is expected to prevent further degradation but may not reverse existing damage. Despite the elevated failure rates, Puget Systems' data shows that the issue, while concerning, still needs to be at critical levels for their operations. The company reports that failure rates for 13th and 14th gen Intel processors, while higher than ideal, are still lower than those they experienced with Intel's 11th gen chips and some AMD Ryzen processors. In response to the situation, Puget Systems is taking several steps, including maintaining its current power management practices, promptly validating Intel's upcoming microcode update, and extending warranties for affected customers. Below, you can see failure rates by month, by Intel's Core generation, as well as by "shop" vs "field" testing.
Source: Puget Systems
Add your own comment

75 Comments on Puget Systems Releases CPU Failure Report: AMD CPUs Achieve Higher Failure Rate Than Intel 13th and 14th Generation

#2
Hecate91
This seems like damage control from Puget, no sample size listed or what SKU's of Ryzen processors are failing.
Posted on Reply
#3
_roman_


May I ask someone who can dig this information up please?

I just wondered how long the warranty period or bring in time is for that hardware?

I doubt someone bring in a device which costs your own money to fix. I assume that 10th/11th Generation Intel is long out of warranty. That's why it does not show up anymore in 2023 / 2024 basically.

If it were my own 9th or 10th generation Intel platform, I would maybe sell the mainboard and the RAM and move on.

I think those graphs miss some important information. How long is the maintenance / service / warranty period for the hardware in question. (applies to all of those graphs)
Posted on Reply
#4
AsRock
TPU addict
Hecate91This seems like damage control from Puget, no sample size listed or what SKU's of Ryzen processors are failing.
Yes lacks context, for example might be using 10 times more AMD CPU's than Intels.
Posted on Reply
#5
Six_Times
Overall, its been that way for two decades.
Posted on Reply
#6
the54thvoid
Intoxicated Moderator
For those who seem more interested in graphs than reading the content, here's the reason for their low Intel failure rates. Puget, after the 11th gen issues, started:
developing their own power settings that don't hurt performance, they've managed to mitigate some of the stability issues plaguing other users.
In other words, they don't follow default motherboard settings and instead apply ones which avoid the voltage problems caused by the microcode.
Posted on Reply
#7
AusWolf
Puget Systems attributes their more modest failure rates of Intel processors to their conservative approach to power management settings.
So basically, all of the systems had Puget's own conservative power settings applied instead of the BIOS default. That's how you fabricate statistics, congratulations! :shadedshu:
Posted on Reply
#8
Zubasa
the54thvoidFor those who seem more interested in graphs than reading the content, here's the reason for their low Intel failure rates. Puget, after the 11th gen issues, started:
In other words, they don't follow default motherboard settings and instead apply ones which avoid the voltage problems caused by the microcode.
I guess this means they running settings that are even more conservative than SuperMicro, who does pretty much everything by the book?
Because there are Raptor Lake CPUs that have degraded even when ran well within Intel's spec.
Posted on Reply
#9
Hecate91
The results Puget systems has aren't in line with most other sources because they aren't using the Intel baseline specs that Intel insisted was fine while the blame was thrown at motherboard makers, even though Intel knew what was happening since 2022.
I don't get why this post was made, the issue at hand is even with Puget using their own settings there is still an increase of failures in their graphs. And the whole point of GN bringing up the graph was to show higher than normal failure rates.
Posted on Reply
#10
napata
AsRockYes lacks context, for example might be using 10 times more AMD CPU's than Intels.
The graph uses percentages...
AusWolfSo basically, all of the systems had Puget's own conservative power settings applied instead of the BIOS default. That's how you fabricate statistics, congratulations! :shadedshu:
They follow Intel's guidelines as best as they can. That's what Intel should mandate from the start instead of auto-OCing all their CPUs from the start.
Posted on Reply
#11
john_
Hecate91This seems like damage control from Puget, no sample size listed or what SKU's of Ryzen processors are failing.
The timing, the fact that they had never bothered publishing failure rates before(have they?), especially when 11th gen was failing at such higher rates, does point in that direction.
Posted on Reply
#12
AusWolf
napataThey follow Intel's guidelines as best as they can. That's what Intel should mandate from the start instead of auto-OCing all their CPUs from the start.
Yes, should... but they don't.
Posted on Reply
#13
john_
AusWolfSo basically, all of the systems had Puget's own conservative power settings applied instead of the BIOS default. That's how you help Intel's argument that the problem is with voltages only, so a firmware can fix it, congratulations! :shadedshu:
Just another theory.
Posted on Reply
#14
Zazigalka
the54thvoidFor those who seem more interested in graphs than reading the content, here's the reason for their low Intel failure rates. Puget, after the 11th gen issues, started:



In other words, they don't follow default motherboard settings and instead apply ones which avoid the voltage problems caused by the microcode.
No mention of them doing the same for amd, so the comparison isn't completely fair. A portion of those failing amd cpu's could have been prevented from developing issues were they equally conservative with power settings for amd too.
side note: 11th gen seems like an absolute dumpster fire.
Hecate91The results Puget systems has aren't in line with most other sources because they aren't using the Intel baseline specs that Intel insisted was fine while the blame was thrown at motherboard makers, even though Intel knew what was happening since 2022.
I don't get why this post was made, the issue at hand is even with Puget using their own settings there is still an increase of failures in their graphs. And the whole point of GN bringing up the graph was to show higher than normal failure rates.
Don't know what the situation is on the amd side, but I can attest to intel board manufacturers pushing way more volts than needed. My 490 board applies 1.37v to stock 10700 in normal/auto modes, but the cpu only needs 1.25v manual voltage to pass stress tests.
Posted on Reply
#15
Lianna
So with 'conservative approach to power management settings', 'their own power settings' and presumably much better cooling than average (custom made workstations) - so far from default compared to most users, both prebuilt and DIY - they report comparable total failure rates to other CPUs that were on the market three or four times longer (while 14th gen degrades over time)?
Surprise, surprise. Nothing wrong with 14th gen, definitely nothing to see here, move along... /s

I'm suprised the OP does not mention Puget's other statements, like:
'The concern for the future reliability of those CPUs is much more the issue at hand, rather than the failure rates we are seeing today.'
'We're seeing ALL of these failures happen after 6 months, which means we do expect elevated failure rates to continue for the foreseeable future and possibly even after Intel issues the microcode patch.'
Posted on Reply
#16
chrcoluk
I dont see the issue here, the article clearly states the bios configuration was done in a safe way to keep CPU's within safe tolerances.
Puget Systems attributes their more modest failure rates of Intel processors to their conservative approach to power management settings. By adhering strictly to Intel's specifications and developing their own power settings that don't hurt performance, they've managed to mitigate some of the stability issues plaguing other users.
I do the same on my own systems. I check if board vendor has voltages out of spec (usually is the case with ASRock), I check power levels, I check if safety features are enabled and so on.

They havent claimed this is out of the box experience, it is data "after" they have applied their own adjustments.
Posted on Reply
#17
ymdhis
Note the "shop" vs "field" difference. AMD CPUs fail more often in their shop when they are trying to apply their own overclocking or whatever, Intel CPUs fail more in the field ie. when used by users. AMD has a lot lower failure rates in the field, where they fail more is when Puget Systems are setting them up.

So yeah, damage control.
Posted on Reply
#18
Crackong
Intel data has timeline, but AMD data doesn't.
Wonder why.
Posted on Reply
#19
Neo_Morpheus
Ahh, Puget, the vendor that refused to use Ryzen cpus and when they finally started selling them, their “Configuration assistant “ always ended suggesting an Intel cpu.

Better yet, if their claims are true, how come nobody else experienced such a huge number of issues with their AMD cpus outside Puget?

I will admit, this intel collapse (thank you karma!) is really showing the rabid fanbois, bribed influencers and especially, the white knights which are out in full force doing damage control.

Adding the DOJ investigation on Ngreedia plus the conveniently and sudden delay on their new AI chips, the other side will also be super busy.

Meanwhile … :)

Posted on Reply
#20
Hecate91
john_The timing, the fact that they had never bothered publishing failure rates before(have they?), especially when 11th gen was failing at such higher rates, does point in that direction.
Exactly, something doesn't seem right unless Puget does publish failures rates, I don't really care what their rates are as they are using different settings than what most other companies are. The timing of Puget publishing results seems like they're looking to defend Intel and trying to keep customer confidence in buying Intel systems, though them going "but AMD!!!!" seems like a deflection tactic, given how desperate Intel has gotten with changing statements after the tech press has already reported on the issues I wouldn't be surprised if there is paid damage control.
I came across some stats from Puget systems on r/hardware, Puget sold about 70-80% Intel systems in 2023.
www.pugetsystems.com/labs/articles/puget-systems-hardware-trends-of-2023/#CPU_Processor
napataThey follow Intel's guidelines as best as they can. That's what Intel should mandate from the start instead of auto-OCing all their CPUs from the start.
Yes they should have, but didn't and chose to push their cpu's past the limit to beat AMD, according to Buildzoid talking to someone hosting Minecraft severs with 14900K cpu's, the failure rates drop from around 30% to 5% by disabling TVB, though by disabling TVB the CPU is no longer running at the specs Intel claims its capable of but that is a whole other issue.
Posted on Reply
#21
Crackong
napataThey follow Intel's guidelines as best as they can.
We 've discussed this for numerous times already.
From Intel's own public materials
Intel only enforce minimum and max values,
The 'Recommended value' is always 'N/A'
Some even had their minimum values 'N/A'

So there is basically no 'guideline' from Intel regarding CPU voltage, anything below maximum are 'in-spec'
Posted on Reply
#22
chrcoluk
Hecate91Yes they should have, but didn't and chose to push their cpu's past the limit to beat AMD, according to Buildzoid talking to someone hosting Minecraft severs with 14900K cpu's, the failure rates drop from around 30% to 5% by disabling TVB, though by disabling TVB the CPU is no longer running at the specs Intel claims its capable of but that is a whole other issue.
The default settings are decided by the board vendors.

Intel's mistake was giving them the freedom to do that.
Posted on Reply
#23
john_
ZazigalkaDon't know what the situation is on the amd side, but I can attest to intel board manufacturers pushing way more volts than needed. My 490 board applies 1.37v to stock 10700 in normal/auto modes, but the cpu only needs 1.25v manual voltage to pass stress tests.
CPUs and GPUs needing less voltage to pass stress tests is not unusual. The oldest example I have is in my sig where I mention that I have a mobile "Core2Duo undervolted at 1.0125V down from 1.188V". Another example was my 6 core Phenom II that had a typical voltage of 1.32V for 2.8GHz and it was doing 3.5GHz with 1.28V. Going Ryzen I stopped messing with voltages, but i guess the situation is the same.
Companies push more voltage to be sure that the CPUs that will get on consumers hands will be stable. I think almost everything can be undervolted and still work just fine, except in case it is a mediocre sample.
Posted on Reply
#24
Wirko
Zazigalkaside note: 11th gen seems like an absolute dumpster fire.
And 10th gen seems like Skylake, perfected through the years.
Posted on Reply
#25
Outback Bronze
So, if 11th gen is so bad, how come I never heard about it like the so called famous 13/14th gen??
Posted on Reply
Add your own comment
Aug 5th, 2024 13:22 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts