Monday, April 29th 2024
Intel Statement on Stability Issues: "Motherboard Makers to Blame"
A couple of weeks ago, we reported on NVIDIA directing users of Intel's 13th Generation Raptor Lake and 14th Generation Raptor Lake Refresh CPUs to consult Intel for any issues with system stability. Motherboard makers, by default, often run the CPU outside of Intel's recommended specifications, overvolting the CPU through modifying voltage curves, automatic overclocks, and removing power limits.
Today, we learned that Igor's Lab has obtained a statement from Intel that the company prepared for motherboard OEMs regarding the issues multiple users report. Intel CPUs come pre-programmed with a stock voltage curve. When motherboard makers remove power limits and automatically adjust voltage curves and frequency targets, the CPU can be pushed outside its safe operating range, possibly causing system instability. Intel has set up a dedicated website for users to report their issues and offer support. Manufacturers like GIGABYTE have already issued new BIOS updates for users to achieve maximum stability, which incidentally has recent user reports of still being outside Intel spec, setting PL2 to 188 W, loadlines to 1.7/1.7 and current limit to 249 A. While MSI provided a blog post tutorial for stability. ASUS has published updated BIOS for its motherboards to reflect on this Intel baseline spec as well. Surprisingly, not all the revised BIOS values match up with the Intel Baseline Profile spec for these various new BIOS updates from different vendors. You can read the statement from Intel in the quote below.
Source:
Igor's Lab
Today, we learned that Igor's Lab has obtained a statement from Intel that the company prepared for motherboard OEMs regarding the issues multiple users report. Intel CPUs come pre-programmed with a stock voltage curve. When motherboard makers remove power limits and automatically adjust voltage curves and frequency targets, the CPU can be pushed outside its safe operating range, possibly causing system instability. Intel has set up a dedicated website for users to report their issues and offer support. Manufacturers like GIGABYTE have already issued new BIOS updates for users to achieve maximum stability, which incidentally has recent user reports of still being outside Intel spec, setting PL2 to 188 W, loadlines to 1.7/1.7 and current limit to 249 A. While MSI provided a blog post tutorial for stability. ASUS has published updated BIOS for its motherboards to reflect on this Intel baseline spec as well. Surprisingly, not all the revised BIOS values match up with the Intel Baseline Profile spec for these various new BIOS updates from different vendors. You can read the statement from Intel in the quote below.
Intel has observed that this issue may be related to out of specification operating conditions resulting in sustained high voltage and frequency during periods of elevated heat.
Analysis of affected processors shows some parts experience shifts in minimum operating voltages which may be related to operation outside of Intel specified operating conditions.
While the root cause has not yet been identified, Intel has observed the majority of reports of this issue are from users with unlocked/overclock capable motherboards.
Intel has observed 600/700 Series chipset boards often set BIOS defaults to disable thermal and power delivery safeguards designed to limit processor exposure to sustained periods of high voltage and frequency, for example:Intel requests system and motherboard manufacturers to provide end users with a default BIOS profile that matches Intel recommended settings.
- Disabling Current Excursion Protection (CEP)
- Enabling the IccMax Unlimited bit
- Disabling Thermal Velocity Boost (TVB) and/or Enhanced Thermal Velocity Boost (eTVB)
- Additional settings which may increase the risk of system instability:
- Disabling C-states
- Using Windows Ultimate Performance mode
- Increasing PL1 and PL2 beyond Intel recommended limits
Intel strongly recommends customer's default BIOS settings should ensure operation within Intel's recommended settings.
In addition, Intel strongly recommends motherboard manufacturers to implement warnings for end users alerting them to any unlocked or overclocking feature usage.
Intel is continuing to actively investigate this issue to determine the root cause and will provide additional updates as relevant information becomes available.
Intel will be publishing a public statement regarding issue status and Intel recommended BIOS setting recommendations targeted for May 2024.
272 Comments on Intel Statement on Stability Issues: "Motherboard Makers to Blame"
AC Loadline compensates for the Vdroop. If the VRMs have a (maximum Intel spec) impedance of 1.1 mOhm (which, again, should be configured in the DC loadline), then—at least in theory—setting AC Loadline to 1.1 mOhm means that voltage will be corrected upward by 110 mV every 100A into the CPU (practice might differ, possibly due to motherboard/firmware quirks). Again, motherboard manufacturers seemingly tend to use use more-or-less random (or at best, one-fits-all) values for this, and end-users pay the price for it.
Hope you got the better bin CPU and don't have to deal with this anymore. Agreed.
And from the 'Intel baseline' profiles available right now, we can see that it is still not regulated as it should be (like constant >1.6v from the Gigabyte 'baseline' profile.)
I think Intel should just step in and forced a default ACDC loadline behavour on the guildline. (maybe a forced 1:1 ratio)
Then leave the rest of the playground for MB vendors to doing their own custom profile.
But then it could be a double edged sowrd and MB vendors will be more reluctant on aggressive tuning in future generations.
And future generations CPUs might get bumpy reviews, caz MB vendors now ship their board with the 'default-default' profile.
IDK if Intel really wants it or not.
Thankfully I didn't share in your experience, things have been great with my 13900KS.
valid.x86.fr/cache/screenshot/c6mjl1.png
And IIRC (maybe not) since SandyBridge some processors allowed essentially unlimited power (TDP) settings and many used that without problem. Doesn't Intel provide POR?
ES CPU's? The VRTT is a tool that sits in the socket instead of a CPU. Presumably it tests and analyses the mainboard voltage regulator, board traces / power planes and components to get specific values that are dependent on the manufacturers board, so no, Intel cannot provide those details as it's pertinent to the mainboard being used. From these values presumably custom values could be used by the mainboard manufacturer via BIOS to improve performance.
Would Intel locking down TDP stop people running beyond those limits? No, on it's own there are ways to circumvent.
If Intel allow changes that cause problems then perhaps it's partly their problem as well as manufacturers and even users who decide to take components and build their own system rather than buying a prebuilt PC with warranty. JM2c Not sure if you mean on the end of a rope or not?
But I think Intel should at least have a specification of a 'Default' behaviour, maybe something like 'In Default, Motherboard BIOS should always had AC/DC loadline calibrated to supply the voltage (+- 20mV max) requested by the CPU' .
They do have a specifiction for Ripples but nothing regulating the deviation on the flat-out supplied voltage.
This seems to be absent right now so every MB vendor had their own trick in their 'Default' profile. Totally.
I also notice that I never got an answer about W1zzard (using or not) the BIOS settings Intel recommends to not mess with...
According to Intel, these settings should be configured...
Enable Current Excursion Protection (CEP)
Disable IccMax Unlimited Bit
Enable Thermal Velocity Boost (TVB)
Enable Enhanced Thermal Velocity Boost (eTVB)
Enable C-States
Not that i trust the numbers in this pic, but the burst captured does produce a heat spike, so something is overdriven:
That´s how TVB looks like for me with a 14600k
9.6GHz is definitely a sensor bug. Something here doesn't check out.
Got me pretty high scores though, that i can´t take real credit for:
www.3dmark.com/cpu/1700909
I played around with setting tvb targets (+2 +3 +4), like 4 months ago and stopped since i tuned a silent system and those burst are annoying due to the fans ramping up.
So sorry, no real conclusion here except parts of tvb work even with a 14600k.
Setting 5.7Ghz manually gets me close but without those heat spikes at the beginning of the tests.
Mine are old games that don´t use much ram.
And going silent will cost you.
So for that rig i used a dead silent ddr4 board which my other ddr5 ones are not (in all scenarios).
I´ve combined that with an old 16Gb b-die kit that clocks higher than any 12,13 or 14th gen will let me.
And the 4090 can run my games without fans which a 4080 could not.
Playing in 4K i loose about 10% going from a silent, AIO cooled 13900ks to a dead silent, air cooled 14600k.
Thank you for your worry about my rig.
But you don´t have to since it gets into top10 benchmark charts without even trying.
And i technically hold the first place if you filter with 4090s.
No Problems here.
Just something curious that i wanted to share since i don´t truly get what i see on that chart.
So I get a few more points playing around with tvb on a 14600k. But i can´t measure the cost.
community.intel.com/t5/Processors/TjMAX-is-set-to-115-C-by-default/m-p/1430468
The quick version.
ASRock on a Z790 board set TJMAX to 115C as default (this also happened on my board).
Customer queries ASRock, they said its intended behaviour to boost performance.
Customer asked Intel, they said it breaches warranty, out of spec.
ASRock then later told customer they were backing down and future bios would revert to 100C.
Media reps I wouldnt consider as gospel either, their job is to sell stuff, what support staff and documents says has more meaning. The documents posted in this thread dont state unlimited power as in spec.
Intel are guilty of not policing things properly, but its all gone wild when people are claiming what the board vendors have done is considered spec by Intel. I dont know where the motivation for this is coming from, the hatred for Intel I witness day to day in the tech community or loyalty to the board vendors. Obviously the likes of HUB etc. are just feeding the frenzy with clickbait.
GN did a video a couple years back about Intel CS regarding things like XMP. One CS says its OC and thus not covered by warranty they other didn't even bother asking about XMP being enabled or not.
Intel and AMD has many EULA / Disclamier clause that are usually not enforced but are there so they can weasel out of legal trouble.
Did GN disclose the XMP voluntarily to the rep who didnt ask?
What isn't rational thinking is saying Intel allowed changing Tjmax to 115C because they didn't want anyone using more than 100C!
Many Intel CPU's do not allow changing Tjmax (which is really Tjtarget), if anything only an offset to allow throttling at a lower temp up to about 15C lower.
Board vendors do not write BIOS, BIOS companies do usually with some Intel reference code and use of the "BIOS Writer's Guide" from Intel. The manufactures might customize the BIOS provided or ask for customization.
And IIRC Gigabyte did a similar thing with IMO stupidly high Tjmax. Again, if Intel didn't want to allow this then it would have been locked down as usual.
wccftech.com/only-5-out-of-10-core-i9-13900k-2-out-of-10-core-i9-14900k-cpus-stable-in-auto-profile-intel-board-partners-stability-issues/
砸钱试稳13900K14900K。intel的BIOS baseline设定完全没解决任何问题 - 电脑讨论(新) - Chiphell - 分享与交流用户体验
I think I may increase limits for my 14900K from 5200/4200 MHz to 5300/4200 without any fear.