Thursday, April 23rd 2020
AMD FX-8350 Pushed to 8.1 GHz via Extreme Overclocking by Der8auer
AMD's Bulldozer architecture is a well-known quantity by now, and seemingly straddles a line between loathing and love between tech enthusiasts. Slow and power hungry compared to Intel's options, it harkens back to a time where the roles were reversed, and AMD were looking to compensate for architectural deficiencies (and architectural design decisions that can either be claimed as erroneous or ahead of their time) via increased clockspeeds. However you look at these Bulldozer CPUs, the fact is that they remain some of the best overclockers of all time - at least when it comes to maximum operating frequencies, especially at absolutely scorching vCore values.
To achieve that operating frequency, Der8auer used an Elmor EVC2 controller and diagnostics chip, which, connected to a usually unpopulated pin area in the ASUS 970 PRO GAMING/AURA motherboard, allowed him to read-out everything that was running through the motherboard's VRM circuitry, and perform manual adjustments. Corsair Vengeance 2,666 MHz DDR3 memory was also used in the system. An accident happened along the way, though: when pulling AMD's stock cooler from the motherboard, the CPU remained attached to the cooler, which resulted in some bent pins (screams in horror). Luckily, things were fixed with a screwdriver - let that serve as a warning, alert, and tip, should this happen to you.Anyway, the AMD FX-8350 achieved an 8,127 MHz speed with a 1,920 vCore, which is an absolutely incredible voltage for a 32 nm CPU. Running at 7,500 MHz for a single-core performance benchmark, the CPU was pulling 100 W of power - for a single core to operate at that speed, mind you. Even so, the AMD FX-8350 only achieved a single-core score of 172 points - for comparison sake, AMD's six-core Ryzen 5 2600X, running at stock clocks of 3.6 GHz with all cores enabled, achieves 176 points in the same benchmark. Watch the video below for the full rundown on this experiment.
To achieve that operating frequency, Der8auer used an Elmor EVC2 controller and diagnostics chip, which, connected to a usually unpopulated pin area in the ASUS 970 PRO GAMING/AURA motherboard, allowed him to read-out everything that was running through the motherboard's VRM circuitry, and perform manual adjustments. Corsair Vengeance 2,666 MHz DDR3 memory was also used in the system. An accident happened along the way, though: when pulling AMD's stock cooler from the motherboard, the CPU remained attached to the cooler, which resulted in some bent pins (screams in horror). Luckily, things were fixed with a screwdriver - let that serve as a warning, alert, and tip, should this happen to you.Anyway, the AMD FX-8350 achieved an 8,127 MHz speed with a 1,920 vCore, which is an absolutely incredible voltage for a 32 nm CPU. Running at 7,500 MHz for a single-core performance benchmark, the CPU was pulling 100 W of power - for a single core to operate at that speed, mind you. Even so, the AMD FX-8350 only achieved a single-core score of 172 points - for comparison sake, AMD's six-core Ryzen 5 2600X, running at stock clocks of 3.6 GHz with all cores enabled, achieves 176 points in the same benchmark. Watch the video below for the full rundown on this experiment.
64 Comments on AMD FX-8350 Pushed to 8.1 GHz via Extreme Overclocking by Der8auer
my fault, dude. :p
My last 'anime' was Voltron in the late 1980s. lol
They ported a 2x2 32-bit ALU, 2x2 32-bit MMX, 2x2 32-bit FPU, 2x1 LD AGU, 2x1 ST AGU; 1 data cache. To a 2x2 64-bit ALU, 1x2 128-bit MMX, 1x2 128-bit FMAC, 2x2 LD/ST AGU; 2 data cache design. Sure somethings vanished or were never implemented. It was still better than K7/K8/10h.
It also achieved high frequency at lower power and lower leakage than Greyhound+/Husky in Deneb/Llano.
Also, AMD already had a Pentium 4 called K9 => Implementation of K9 Trace Cache blocks (not tapeout out) // Chief Architect of K9 designed to be a 5 GHz successor to the 3 GHz Opteron family.
It was replaced by Agena, the big oops of the 2000s.
Jaguar >1.85 GHz
Bulldozer >3.5 GHz
Is a lot different than K9 which was >5 GHz.
Top-dog positon = K8(>3 GHz) to K9(>5 GHz) <== Netburst pathing ;; Replaced with Stars(no longer Kryptonite#) cores (Greyhound, Greyhound+, Husky)
Mid-dog positon = K8L(>2 GHz) to K10<<Bulldozer>>(>3.5 GHz) <== Pentium M pathing ;; Pushed to eventually replace the above.
Bottom-dog positon => Geode/K7(< 1.8 GHz) to Bobcat (<1.7 GHz) <== Atom pathing ;; Continued through evolving into Zen.
I'm sure there was more headroom then just the CPU clocks; as the CPU/NB is indirect responsible for L3 cache speeds. Had one for year, 8350 running at 4.8Ghz 24/7 with a 300Mhz FSB. If derbauder did'nt focus on just MP oc'ing alone, he would have cranked up that score even higher. My best CB 15 was around 761 points, which for a 4.8Ghz CPU is'nt bad.
AMD FX @ 8121.79 MHz - CPU-Z VALIDATOR
My chip is damaged anyway with a bad IMC making it run in single channel so in my case it doesn't matter alot, I just popped it in and turned it loose.
Mine could probrably (I stress probrably here) do the same or close with all 8 going but I'd have to volt it to potentially lethal levels and then hope it can make it. Der8auer is a great bencher, you can't take that away from him at all.
I'll be getting more LN2 tomorrow, not for this but other things I want to do BUT if I have any left over I'll give it a shot.
Intel predicts 10GHz chips by 2011
www.geek.com/chips/intel-predicts-10ghz-chips-by-2011-564808/
If you line things up, yep that is 50 GHz after 2010, not 10 GHz. With 2013+ being 100 GHz.
www.researchgate.net/publication/224648567_An_88GHz_198mW_16x64b_1R1W_variationtolerant_register_file_in_65nm_CMOS
Steven Hsu, Amit Agarwal, Mark Anders, Sanu Mathew, Ram Krishnamurthy, Shekhar Borkar
Intel, Hillsboro, OR
www.researchgate.net/publication/224648455_A_9GHz_65nm_Intel_Pentium_4_Processor_Integer_Execution_Core
www.researchgate.net/publication/2983363_A_9-GHz_65-nm_IntelR_Pentium_4_Processor_Integer_Execution_Unit
The design of a fourth generation Intel Pentium 4 64-bit integer execution core fabricated in a 65-nm CMOS technology, operating at frequency of 9 GHz, consuming 10.36 W at 1.3 V, 70 degrees C is described.
It also states the die size...
The 65-nm Intel Pentium 4 processor is optimized to enable both single core and dual core products. Each die has a 2-MB L2 cache and 188 million transistors. The dual core products are supported by using two die in the same package. The die size for the dual core implementation is 162 mm.
Technically, can tell it is Cedar Mill.
Just so, you all get it.
16 KB L0d, 12K uops trace cache (12K uops ~= >80 KB L0i, any cache after Decode is L0i)
with no buffer to a L2 cache the size of 2 MB.
Then, there was Power6... and other chips doing the >4 GHz push at least.
www.researchgate.net/publication/2983721_Design_and_Implementation_of_the_POWER6_Microprocessor
POWER6 wasn't considered good, but it hit 5+ GHz in the same time frame...
www.theregister.co.uk/2006/02/07/ibm_power6_show/
www-03.ibm.com/press/us/en/attachment/21580.wss?fileId=ATTACH_FILE2&fileName=POWER 6 Fact Sheet - 052507.pdf
Also...
"4-GHz Tejas CPU Integration Engineer: Responsible for physical integration of a front end Sequencing partition using Cadence PDP. Also involved in Design Automation."
Tejas wasn't going to clock as high as people thought it was going to...
Compared to AMD's K9 it was going to be 1 GHz slower.
www.overclockers.at/news/game-city-2011-wir-sind-wieder-dabei
pictures tell more then words
Tejas actually targeting 4 GHz, no where near the >5 GHz or above.
AMD K9 => 5 GHz @ ~30 stages
'It was described by Alsup as: "K9 fetched 8 instructions every other cycle and made 2 branch predictions associated with 3 next fetch addresses every other cycle. K9 issued 4 instructions per cycle and took 2 cycles to issue a fetch width."'
"According to Alsup, it was designed to be close to 95% of original K8 IPC but reach 5GHz frequency in a 35 nm process. At the time of cancellation most of the logic was running in SPICE at 5GHz and majority of the layout was done."
"Our analysis of the device has confirmed, that the 65nm AMD Athlon 64 X2 dual-core processor, produced at AMD Fab 36, uses minimum gate lengths of 35nm..." <== related to above, not an Alsup quote by wikichips/groups.
Both cases Intel and AMD stopped at 65nm rather than going to the next node, Intel's 45nm introduced HKMG and AMD's 45nm just in general being better than 65-nm.
Pentium 4 small front-end, small retirement, with big w/ double clocking execution core.
K9 big front-end, big retire, not double clocked execution core.
Bulldozer continued big front-end, big retire, but two smaller execution cores. In the original patent for CMT, there was behavior relative to Bulldozer that described if logical core1, core3, core5, core7 were parked. Then core0, core2, core4, core6 would have been able extend across and use the physical second core. Thus, giving the capability of eight small logical cores or four big logical cores. With that Bulldozer dropped a lot of advancements that it should have gotten. The behavior of CMT shouldn't be any different from SMT. If a quad-core i7 can have two cores with two active&two parked threads(two big logical cores), and two cores with four active threads(four little logical cores). Then, an ideal FX can also have two modules with two active&two parked threads(two big logical cores), two modules with four active threads(four little logical cores). However, the performance should be different, with latter cores running smaller threads in the i7 would be competing for execution slots. While, the FX modules providing each thread a separate execution resource, thus different slots.
7nm instead will degrade so fast that even boost or PBO is unstable.