Wednesday, May 29th 2024
AMD Ryzen 9000 Zen 5 Single Thread Performance at 5.80 GHz Found 19% Over Zen 4
An AMD Ryzen 9000 "Granite Ridge" desktop processor engineering sample with a maximum boost frequency of 5.80 GHz was found to offer an astonishing 19% higher single-threaded performance increase over an AMD Ryzen 9 7950X. "Granite Ridge" is codename for the Socket AM5 desktop processor family that implements the new "Zen 5" CPU microarchitecture. The unnamed "Granite Ridge" processor comes with an OPN code of 100-0000001290. Its CPU core count is irrelevant, as the single-threaded performance is in question here. The processor boosts up to 5.80 GHz, which means the core handling the single-threaded benchmark workload is achieving this speed. This speed is 100 MHz higher than the 5.70 GHz that the Ryzen 9 7950X processor based on the "Zen 4" architecture, boosts up to.
The single-threaded benchmark in question is the CPU-Z Bench. The mostly blurred out CPU-Z screenshot that reveals the OPN also mentions a processor TDP of 170 W, which means this engineering sample chip is either 12-core or 16-core. The chip posts a CPU-Z Bench single-thread score of 910 points, which matches that of the Intel Core i9-14900K with its 908 points. You've to understand that the i9-14900K boosts one of its P-cores to 6.00 GHz, to yield the 908 points that's part CPU-Z's reference scores. So straight off the bat, we see that "Zen 5" has a higher IPC than the "Raptor Cove" P-core powering the i9-14900K. Its gaming performance might end up higher than the Ryzen 7000 X3D family.Many Thanks to TumbleGeorge for the tip.
Source:
Wccftech
The single-threaded benchmark in question is the CPU-Z Bench. The mostly blurred out CPU-Z screenshot that reveals the OPN also mentions a processor TDP of 170 W, which means this engineering sample chip is either 12-core or 16-core. The chip posts a CPU-Z Bench single-thread score of 910 points, which matches that of the Intel Core i9-14900K with its 908 points. You've to understand that the i9-14900K boosts one of its P-cores to 6.00 GHz, to yield the 908 points that's part CPU-Z's reference scores. So straight off the bat, we see that "Zen 5" has a higher IPC than the "Raptor Cove" P-core powering the i9-14900K. Its gaming performance might end up higher than the Ryzen 7000 X3D family.Many Thanks to TumbleGeorge for the tip.
132 Comments on AMD Ryzen 9000 Zen 5 Single Thread Performance at 5.80 GHz Found 19% Over Zen 4
So now Zen 5 = 14900K and people claiming an AMD win. I think not.
But still, 50,000 Cinebench R23 points is a result which no other CPU has achieved.
www.techpowerup.com/review/intel-core-i9-14900k-raptor-lake-tested-at-power-limits-down-to-35-w/2.html
Also...gaming isn't the only thing you can do on a PC....
www.msi.com/blog/lowering-cpu-voltage-and-temperature-without-compromising-performance-disabling-cep-on-intel-14th-gen-non-k-CPUs
(as an offtopic sidenote - I seriously doubt the 7800X3D's 17W number)
"With Raptor Lake being more of a transitional and enhanced core design that Intel's worked with before (Alder Lake), it remains to be seen what the future of 2023 holds for Intel's advancement in IPC and single-threaded performance. Right now, however SPEC paints a picture where it's pretty much neck and neck between Raptor Cove and Zen 4"
Zen 5 won't be equal in IPC to 14900K, it'll be comfortably ahead.
Also if you don't agree with their opinion, make up your own after seeing the numbers. I don't think it'll be any different. edit: numbers for you www.anandtech.com/show/17601/intel-core-i9-13900k-and-i5-13600k-review/6
Edit: Oh sorry, you asked me what the CPU-z bench tests and then go on to say never mind you already know the answer. Strange. Well, instead of explaining it, i'll just paste the source of where I based my statement from: chipsandcheese.com/2023/11/03/cpu-zs-inadequate-benchmark/
Just don't say chips and cheese is garbage too lol
x.com/TechEpiphanyYT/status/1757166149148545250
If you have updated data, share it.
As far as CPU-Z, your own link explains that due to its size it is a fairly pure IPC benchmark. Data mostly fits in L1/L2 cache. So, what we're saying here is that the actual core of Zen 5, ignoring memory and IO performance, is no faster than Raptor Lake.
That's a big miss for AMD. And that is probably why they are rushing to market first. They need to sell what they can, before Arrow Lake smashes them.
Err, good luck waiting that.
Wanted to test myself but it looks like something strange is going on with my 7800X3D. It does show somewhat low numbers on various single-core workloads but it also never goes beyond 4.2GHz for some reason...
Also, I have absolutely no idea how you read my link and came up with the conclusion that it's a 'pure' IPC benchmark. The article literally states the opposite. I mean, it clearly explains that Zen 3 to Zen 4 had no change in IPC in that bench simply because Zen 4 runs into FP register file capacity which this bench is most sensitive to. Let me quote the conclusion of the article just for reference:
"AMD’s architects likely saw changes that could benefit CPU-Z wouldn’t pay off in other applications. Zen 4 received improvements like a larger micro-op cache, better branch prediction, and doubled L2 cache capacity. Those would help a lot of applications, but not CPU-Z. Thus, CPU-Z’s benchmark ends up being useless to both CPU designers and end users"
To clarify, those changes are what results in the 13% IPC gain from Zen 3 to Zen 4. But guess what the IPC gain in the CPU-z bench is? Zero.
So yeah, the bench is pretty useless as it has no real world relevance and tests a small subset of the CPU. Fitting into the L1 cache doesn't make anything a 'pure IPC' benchmark. No, we're not saying the actual core of Zen 5 is no faster than RPL, it doesn't even test 3/4 of the core lol. The test doesn't test the branch predictors, basically doesn't even touch the front end of the CPU and I can go on. At the least, that would make it a very incomplete IPC bench. Nothing pure about that.
L1<->L2 latency and bandwidth? L2<->L3 Latency and bandwidth? L3<-> main memory latency and bandwidth? Yes on all counts. Work on that reading comprehension thing. It does test branch predictors, just not to the degree that "chipsandcheese" wants.
Note CPU-z on the far left with no change. And yet the quote itself is this: "Zen 4 received improvements like a larger micro-op cache, better branch prediction, and doubled L2 cache capacity. Those would help a lot of applications, but not CPU-Z. Thus, CPU-Z’s benchmark ends up being useless to both CPU designers and end users"
So yes, there were core changes. And those improvements certainly contributed to the 13% IPC gain and yet no gain in the CPU-z benchmark, because the a beefier front end doesn't affect the result. So let me understand this correctly, is a larger micro op-cache, which affects a lot of real world workloads, part of your 'pure IPC' calculation? Or like..better branch prediction? How about having larger L2 caches, which will certainly affect a lot of workloads but not CPU-z? Or is your definition of a pure IPC benchmark one that only really tests the backend of a CPU and fits into L1 cache? I'm sorry, but just fitting into L1 cache doesn't make it a great, or a 'pure IPC' test. A higher L1 or L1>L2 bandwidth will contribute to, but not automatically make it a 'pure IPC' test . To put it simply, a larger L2 cache in a cache starved design will net huge IPC gains in most applications but an absolute zero gain in CPU-z single core bench. Same with a much larger micro-op cache. So essentially the CPU-z test is mostly testing the backend of a CPU, and hits the FP register particularly hard. I'm going to ignore that unnecessary insult of sorts and get to it. Improved branch predictors make next to no difference in the CPU-z benchmark and certainly no difference in the single core bench. To quote, this is how basic their test of the branch prediction is: "Even Goldmont Plus has no problem tracking CPU-Z’s branches". Hell, even Bulldozer has very little problems with that terrible branch predictor and that's saying something. It's not what "chipsandcheese" wants, it's what they are seeing in the results. My reading comprehension was fine, I just didn't wanna add a ton of explanation thinking you also understand that branch predictors make next to no difference in that bench. Well now you know.
You seem to have the idea that this CPU-z test which fits into the L1 cache is a pure IPC test. A chips and cheese article refutes that. The test not really testing a CPU's frontend refutes that, as real world benches benefit a lot from it. Branch predictors not affecting the single core result refutes that. A larger micro-op cache not affecting the result refutes that. I can go on, but i'll stop here. If you still think this CPU-z bench is a pure IPC bench so be it. It would simply mean your definition of IPC is different from mine. For me, IPC gains are calculated using an average of test results which individually test different parts of the CPU or a more rounded single bench that is atleast affected by changes in front end, execution and backend. A singular test that doesn't change from an increase in micro-op cache, larger L2 and generally beefier front end doesn't really fit the definition of a 'pure IPC' bench.
I'm going to stop arguing with you over this and leave with a quote from them:
"What limits computer performance today is predictability, and the two big ones are instruction/branch predictability, and data locality - Jim Keller, during an Interview with Dr. Ian Cutress.
That’s not just Jim Keller’s opinion. I’ve watched CPU performance counters across my day-to-day workloads. Across code compilation, image editing, video encoding, and gaming, I can’t think of anything that fits within the L1 cache and barely challenges the branch predictor. CPU-Z’s benchmark is an exception. The factors that limit performance in CPU-Z are very different from those in typical real-life workloads."