Friday, December 8th 2023
Intel "Sierra Forest" Xeon System Surfaces, Fails in Comparison to AMD Bergamo
Intel's upcoming Sierra Forest Xeon server chip has debuted on Geekbench 6, showcasing its potential in multi-core performance. Slated for release in the first half of 2024, Sierra Forest is equipped with up to 288 Efficiency cores, positioning it to compete with AMD's Zen 4c Bergamo server CPUs and other ARM-based server chips like those from Ampere for the favor of cloud service providers (CSP). In the Geekbench 6 benchmark, a dual-socket configuration featuring two 144-core Sierra Forest CPUs was tested. The benchmark revealed a notable multi-core score of 7,770, surpassing most dual-socket systems powered by Intel's high-end Xeon Platinum 8480+, which typically scores between 6,500 and 7,500. However, Sierra Forest's single-core score of 855 points was considerably lower, not even reaching half of that of the 8480+, which manages 1,897 points.
The difference in single-core performance is a matter of choice, as Sierra Forest uses Crestmont-derived Sierra Glen E-cores, which are more power and area-efficient, unlike the Golden Cove P-cores in the Sapphire Rapids-based 8480+. This design choice is particularly advantageous for server environments where high-core counts are crucial, as CSPs usually partition their instances by the number of CPU cores. However, compared to AMD's Bergamo CPUs, which use Zen 4c cores, Sierra Forest lacks pure computing performance, especially in multi-core. The Sierra Forest lacks hyperthreading, while Bergaamo offers SMT with 256 threads on the 128-core SKU. Comparing the Geekbench 6 scores to AMD Bergamo EPYC 9754 and Sierra Forest results look a lot less impressive. Bergamo scored 1,597 points in single-core, almost double that of Sierra Forest, and 16,455 points in the multi-core benchmarks, which is more than double. This is a significant advantage of the Zen 4c core, which cuts down on caches instead of being an entirely different core, as Intel does with its P and E-cores. However, these are just preliminary numbers; we must wait for real-world benchmarks to see the actual performance.
Sources:
BenchLeaks, Tom's Hardware
The difference in single-core performance is a matter of choice, as Sierra Forest uses Crestmont-derived Sierra Glen E-cores, which are more power and area-efficient, unlike the Golden Cove P-cores in the Sapphire Rapids-based 8480+. This design choice is particularly advantageous for server environments where high-core counts are crucial, as CSPs usually partition their instances by the number of CPU cores. However, compared to AMD's Bergamo CPUs, which use Zen 4c cores, Sierra Forest lacks pure computing performance, especially in multi-core. The Sierra Forest lacks hyperthreading, while Bergaamo offers SMT with 256 threads on the 128-core SKU. Comparing the Geekbench 6 scores to AMD Bergamo EPYC 9754 and Sierra Forest results look a lot less impressive. Bergamo scored 1,597 points in single-core, almost double that of Sierra Forest, and 16,455 points in the multi-core benchmarks, which is more than double. This is a significant advantage of the Zen 4c core, which cuts down on caches instead of being an entirely different core, as Intel does with its P and E-cores. However, these are just preliminary numbers; we must wait for real-world benchmarks to see the actual performance.
76 Comments on Intel "Sierra Forest" Xeon System Surfaces, Fails in Comparison to AMD Bergamo
On serious note, old article on current state of Intel:
www.techspot.com/news/97578-intel-bad-place-they-need-admit.html
Also this introduces latency with schedulers. lower speeds introduce scaling issues as well if it's scaled laterally. I know a lot about kernel scheduling and latency as an audio producer. You need low latency and fast cpu scaling. the lower the per core clock is the higher latency and you get what's called xruns and kernel hangs that create audio stuttering, and cracking. having nothing but high latency e cores is not going to cut it for anyone serious about anything. This is why I have to upgrade my xeon 8c16t at only 2ghz to something more than that right now. Even a 6c12t at 3ghz would be much better... Core clock speed over less cores is best for audio production. Specifically for heavy synth patches that use a lot of layers typically get processed on just one thread. Not spread over several evenly. This is due to plugin containerization within the Digital Audio Workstation.
I also chose my laptop ( dell inspiron 15 3525) because it's got all power in all cores! (ryzen 7 5700u 1.9ghz 8c16t up to 4.3ghz) vs getting a more durable dell latitude that has only a few performance cores and a buncha useless e cores! my price range was only 500 so I made due with what i could... bitwig performance was the most important thing! I can change my cpu governor to performance and run at full speed while producing and then go back to schedutil for day to day stuff.
many server applications are also structured like this code wise and suffer the same. You are not going to get good database performance from e cores, neither any GIS processing or anything intense at all...
you can't read and write to the same section of ram at the same time.
1. 2023 - Z4 Bergamo and Siena for server (telcom and other sectors) - small cores (codename Dionysus); 8 core CCX/16 core CCD (codename Vindhya)
2. 2023 - Z4 Phoenix2 for entry mobility devices - hybrid monolithic design
3. 2024 - Z5 Turin Dense and Sorano for server 192 and 64 cores - small cores (codename Prometheus); 16 core CCX/CCD
4. 2024 - Z5 Strix Point for top mobility devices 12 cores - hybrid design
5. 2025 - Z5 Turin with AI chiplets - unknown number of cores; AI chiplets expected ~1500 TOPS It's Crestmont-derived e-cores. We can't expect miracles even with higher frequency, but without SMT. Also, it will come one year later and AMD is preparing Turin Dense already on 3nm to compete with Sierra Forrest. It's more about news effect than anything more technical, in absence of other preliminary metrics. I would not worry. At the end of the day, the news inspires us to exchange thoughts, with or without cursed Geekbench. Nobody is forced to buy those products. There are plenty of alternatives. Relax. There are no other benchmarks available. It's just a talking point. Treat it like a gossip from royal household.
When I worked for invision forum services we would be hitting max cpu loads a lot on some of our biggest clients we had to tell them to get better cpu on the servers all the time. Some things were ran on single threads,
Many applications still rely on single thread ( and frequency ).
Most of the developers out there just don't have the resources to do multi-core optimization
and just rely on the hosting software ( e.g. tomcat/ wildfly...etc) to do basic multi-threading management.
And with lackluster coding, so the software just bricks when things aren't cathcing up in a single loop.
Results are bunch of in-house + lack of maintenance softwares out there that just runs on faster single core.
With the server market being highly specialized and thus welcoming of weird designs that go far beyond mere benchmarks to achieve real world results, ARM dominating the mobile market and finally making inroads into traditional mobile computing, one can't help but wonder what will happen with the x86 architecture and what does the future have in store for it. I'm sure it's not really going anywhere, but I doubt that it'll be able to remain as Intel's darling and aggressively protected patent for too long, at least not without architectural innovation pushing forward regardless of it
Growing popularity of ARM servers has nothing to do with usefulness of SMT on Bergamo SKUs which sell now like hot cakes.
www.phoronix.com/review/amd-epyc-9754-smt/8