Wednesday, June 14th 2023

AMD Zen 4c Not an E-core, 35% Smaller than Zen 4, but with Identical IPC

AMD on Tuesday (June 13) launched the EPYC 9004 "Bergamo" 128-core/256-thread high density compute server processor, and with it, debuted the new "Zen 4c" CPU microarchitecture. A lot had been made out about Zen 4c in the run up to yesterday's launch, such as rumors that it is a Zen 4 "lite" core that has lesser number-crunching muscle, and hence lower IPC, and that Zen 4c is AMD's answer to Intel's E-core architectures, such as "Gracemont" and "Crestmont." It turns out that it's neither a lite version of Zen 4, nor is it an E-core, but a physically compacted version of the Zen 4 core, with identical number crunching machinery.

First things first—Zen 4c has the same exact IPC as Zen 4 (that's performance at a given clock-speed). This is because its front-end, execution stage, load/store component, and internal cache hierarchy is exactly the same. It has the same 88-deep load queue, 64-deep store queue, the same 675,000 µop cache, the exact same INT+FP issue width of 10+6, the same exact INT register file, the same scheduler, and cache latencies. The L1I and L1D caches are the same 32 KB in size as "Zen 4," and so is the dedicated L2 cache, at 1 MB.
The only thing that's changed is that the effective L3 cache per core has been reduced to 2 MB, from 4 MB on the 8-core "Zen 4" CCD. While the regular 8-core "Zen 4" CCD has eight "Zen 4" cores sharing a 32 MB L3 cache, the new 16-core "Zen 4c" CCD AMD introduced with "Bergamo" sees the chiplet pack two 8-core CCX (CPU core complexes), each with 16 MB of L3 cache shared among the 8 cores of the CCX. In this respect, the last-level cache and CPU core organization of the "Zen 4c" CCD has some similarities to the "Zen 2" CCD (which used two 4-core CCXs).

What's interesting is that the 16-core "Zen 4c" CCD isn't AMD's first product from this generation with lower last-level cache per core. The "Phoenix" APU silicon used in Ryzen 7040 series mobile processors sees eight "Zen 4" cores share a 16 MB L3 cache. For math-heavy compute workloads with lesser memory footprint, "Zen 4c" offers identical performance to "Zen 4," however, the smaller L3 cache should impact performance in bandwidth-sensitive workloads with large data-sets.
The Zen 4c CCD is built on the same exact TSMC 5 nm EUV foundry node that the company makes its regular 8-core Zen 4 CCD on, however, the Zen 4c CPU core is 35% smaller than the Zen 4 core, with a die area (per-core) of just 2.48 mm², compared to 3.84 mm². The die-size savings probably come from AMD "compacting" the various core components without reducing their form or function in any way. As we said earlier, the counts of the various core components remains the same, as do the sizes of the µ-op, L1, and L2 caches. EPYC 9004 "Bergamo" achieves its core-count of 128 using eight of these 16-core Zen 4c CCDs. In comparison, the regular "Genoa" processor achieves 96 cores over twelve 8-core Zen 4 CCDs.
Add your own comment

153 Comments on AMD Zen 4c Not an E-core, 35% Smaller than Zen 4, but with Identical IPC

#76
Tomorrow
Minus InfinityAs someone that does more productivity than gaming, it is IMO a huge mistake for AMD to be sticking to it's lower core counts than Intel on desktop especially now they have a great compact core in the 4c. I would have much preferred a 7850X with 8 Zen 4 and 4 Zen 4c cores any day over a v-cache 7800X3D and it would take the fight to the 13700K. 4 Zen 4c cores would be easily competitive with 8 Gracemont cores. And they would cause less scheduler issues as they'll look like regular cores. That Zen 5 is also sticking to same pattern as Zen 4 is even worse. IMO the performance gap between Zen 5 and Arrow Lake will be even larger for MT.
Why does it have to be either/or?

7850X could exist for those who need MT and 7800X3D exists for those who want the best gaming performance.
I would hate 7800X3D not existing.
Posted on Reply
#77
Scrizz
DavenMaybe 4c stands for Zen 4 ‘compact’. Siena might introduce Zen 4e as an E-core like architecture. AFAIK, Intel’s E-core removes basic functional blocks to achieve a smaller footprint. Zen 4c does not remove any basic or even specialized (bfloat, VINNI, AVX512) functional blocks.
I'm wondering when we'll get a 4c tactical :laugh:
Posted on Reply
#78
InVasMani
TomorrowWhy does it have to be either/or?

7850X could exist for those who need MT and 7800X3D exists for those who want the best gaming performance.
I would hate 7800X3D not existing.
I agree there is reason to celebrate both points of convergence and divergence from the directions AMD/INTEL take with their CPU design architectures it just means a lot more options for everyone.
Posted on Reply
#79
Squared
DenverZen4c would be a much better efficiency core if AMD decides to beat intel at its own game.
SemiAnalysis also said that Zen4 has hard lines between various parts of the core, separating areas different teams worked on. Zen4c eliminates many of these lines so that the different parts can be pushed closer together. But I suspect this also means that Zen4c started development late in the Zen4 development process, and is the primary reason why the first Zen4 chiplet came out last year but the first Zen4c chiplet isn't quite out yet.

On the other hand, Intel's Redwood Cove P cores and Crestmont E cores will be released on the same day when Meteor Lake comes to market. If AMD wants to build a hybrid processor like this, it's probably either going to be six months later than a non-hyrbrid, or AMD will need a different kind of small core that can be developed concurrently with Zen.
Posted on Reply
#80
Minus Infinity
TomorrowWhy does it have to be either/or?

7850X could exist for those who need MT and 7800X3D exists for those who want the best gaming performance.
I would hate 7800X3D not existing.
I didn't mean to imply either/or. I just said "I" would have preferred a productivity optimised version. Hence the different name. I have zeroo interest in the v-cache models other than they are more efficient. Now if they could give us a 3rd gen with basically no MT penalty due to lower clocks and v-cache across both ccd's that would be different.
Posted on Reply
#81
doc7000
Minus InfinityAs someone that does more productivity than gaming, it is IMO a huge mistake for AMD to be sticking to it's lower core counts than Intel on desktop especially now they have a great compact core in the 4c. I would have much preferred a 7850X with 8 Zen 4 and 4 Zen 4c cores any day over a v-cache 7800X3D and it would take the fight to the 13700K. 4 Zen 4c cores would be easily competitive with 8 Gracemont cores. And they would cause less scheduler issues as they'll look like regular cores. That Zen 5 is also sticking to same pattern as Zen 4 is even worse. IMO the performance gap between Zen 5 and Arrow Lake will be even larger for MT.
If they were to make a CPU using both Zen 4 and Zen 4c then it would have an 8 core zen 4 chiplet and a 16 core Zen4c chiplet on it for 24 cores and 48 threads, we don't know how this will compare against the current 16 core 7950X as we don't know the real world performance loss for non data center workloads of the Zen 4c chiplets. If the smaller Zen 4c delivers half the performance of a Zen 4 core on productivity workloads then it would be pointless.

IMHO Ryzen 9 should have all been 16 core cpus, with Ryzen 7 being 12 core cpus, Ryzen 5 being 8 core cpus, and Ryzen 3 being 6 core cpus. Sell the cheapest 12 core for around $450 with the cheapest 8 core at $300 and the cheapest 6 core at $200 and lets say $600 for the cheapest 16 core cpu. This would have done a much better job keeping core counts and overall performance much closer when comparing Intel and AMD offerings this time around. Though considering that right now you can get a 7600 for $223, a 7700 for $326, a 7900 for $420, and a 7950X for $576 dollars in the US the pricing is almost there anyways.
Posted on Reply
#82
R0H1T
doc7000Ryzen 5 being 8 core cpus, and Ryzen 3 being 6 core cpus
There will always be a demand for cheap quad or dual cores, even if just for using it to watch the latest Netflix/Disney trash. That demand isn't going away soon & as long as we have rejects/defective chips being sold we'll continue to see them being listed for 50~100 USD. So that seems unreasonable as far as I'm concerned, 2-3 years down the line when Intel/AMD just eliminate dual cores from the lineup I guess that could make sense but not right now.
Posted on Reply
#83
zlobby
DavenMaybe 4c stands for Zen 4 ‘compact’.
My bet is on 'carrera'.
Posted on Reply
#84
dyonoctis
R0H1TThere will always be a demand for cheap quad or dual cores, even if just for using it to watch the latest Netflix/Disney trash. That demand isn't going away soon & as long as we have rejects/defective chips being sold we'll continue to see them being listed for 50~100 USD. So that seems unreasonable as far as I'm concerned, 2-3 years down the line when Intel/AMD just eliminate dual cores from the lineup I guess that could make sense but not right now.
AMD already eliminated dual cores from their line-up, and there haven't been a quad core Ryzen since zen 2, the cheapest zen 3 is an hexa core at 94€. The remaining Ryzen 3 can be found for the same price as an hexacore of the same generation for some reason.
Posted on Reply
#85
AnotherReader
sLowEndLittle 'c' = little cache :p

I am pretty surprised at how much space they managed to save by reducing cache though.
Zen 4c actually has more cache than Zen 4 across the entire chiplet. Remember there are 16 Zen4c cores in one chiplet. Despite the reduced L3 per core, this means each chiplet has 32 MB of L3 cache and 16 MB of L2 cache as opposed to regular Zen 4 with 32 MB of L3 and 8 MB of L2.
Posted on Reply
#86
R0H1T
dyonoctisAMD already eliminated dual cores from their line-up, and there haven't been a quad core Ryzen since zen 2, the cheapest zen 3 is an hexa core at 94€. The remaining Ryzen 3 can be found for the same price as an hexacore of the same generation for some reason.
I was talking about all of their existing lineups, including notebooks, so that's not exactly true ~
www.techpowerup.com/cpu-specs/?mfgr=AMD&released=2022&nCores=2&sort=nameIt will eventually happen as they move almost completely from 7/6nm parts but probably still a couple of years away.
Posted on Reply
#87
btk2k2
Tek-CheckGuys, the article is about Zen4 c core. Why so much spam?


What is this about?
AMD will probably use the Zen4c CCX layout in PHX2 going forward so they can make the APU smaller and cheaper to produce. It should also improve the low power performance since the v/f curve will be different.
SquaredSemiAnalysis also said that Zen4 has hard lines between various parts of the core, separating areas different teams worked on. Zen4c eliminates many of these lines so that the different parts can be pushed closer together. But I suspect this also means that Zen4c started development late in the Zen4 development process, and is the primary reason why the first Zen4 chiplet came out last year but the first Zen4c chiplet isn't quite out yet.

On the other hand, Intel's Redwood Cove P cores and Crestmont E cores will be released on the same day when Meteor Lake comes to market. If AMD wants to build a hybrid processor like this, it's probably either going to be six months later than a non-hyrbrid, or AMD will need a different kind of small core that can be developed concurrently with Zen.
They don't need to run concurrently. AMD can release a full big core part like the 7950X and once the 'c' variant is online can release a new SKU with 1 8c standard CCD and 1 16c dense CCD.

Just like the lag between the 5800X and the 5800X3D was longer than the lag between 7700X and 7800X3D I expect for Zen 5 it will initially be a 16c 2 CCD design and a few months later once the Zen 5c design is finished they can launch a 24c SKU.

In theory they could release a 24c Zen 4 SKU as well, just depends on if there is a business case for such a thing. Personally I think there might be if it gets people to jump onto AM5 before Arrow Lake launches which should improve uptake of Zen 5 onwards.
Posted on Reply
#88
Od1sseas
Tek-CheckNobody cares about this. This article is not about your CPU. Learn something about Zen4 c cores.
I'm replying to someone that called me an Intel fanboy. If you didn't actually care you wouldn't have replied to my comment as well.
Posted on Reply
#89
evolucion8
Od1sseasI'm replying to someone that called me an Intel fanboy. If you didn't actually care you wouldn't have replied to my comment as well.
Cheering up for a brand won't make it magically better or faster overall.
Posted on Reply
#90
Tek-Check
Od1sseasI'm replying to someone that called me an Intel fanboy. If you didn't actually care you wouldn't have replied to my comment as well.
Cut irrelevant discussions and focus on what matters. Don't get rpovoked and let others know what to focus on. Simple.
Posted on Reply
#91
R-T-B
tabascosauz@R-T-B just going off of observable Ryzen behaviour in the past 3 generations and its internal temp monitoring, L3 doesn't usually get hot or draw a lot of power. Whether in 16MB, 32MB or Vcache form.
Depends on if your working it. Run small ffts on prime 95 and watch the temps rise.

EDIT: wait, that might just be l1/l2 though. Now I am questioning my logic.
Posted on Reply
#92
tabascosauz
R-T-BDepends on if your working it. Run small ffts on prime 95 and watch the temps rise.

EDIT: wait, that might just be l1/l2 though. Now I am questioning my logic.
L3 (at least the base layer not Vcache, since the temp behaviour does not change for Vermeer-X) shows up as its own temp sensor. It basically never gets hot compared to the rest of the core; whether that's because L3 doesn't get hot or the sensor is not what it seems, who knows.

If L3 had a significant impact on per-core power, I also think we would be able to tell a difference by now between half and full L3 and between full and X3D. Core power is roughly similar at the end of the day.

Posted on Reply
#93
Altirix
Od1sseasIntel can pack 4 E-Cores in the same size as 1 P-Core. What about AMD? How many Zen4c cores for one Zen 4 core?
Raptor Cove Core with 2MB L2$ is 7.04mm^2

Gracemont is 1.70mm^2 without L2$, a block of 4 with 4MB L2$ cache is 8.78mm^2 (note the image incorrectly states 2MB for L2$)

Zen 4 with 1MB L2$ is 3.84mm^2

Zen 4C with 1MB L2$ is 2.48mm^2, we can estimate a block of 4 with 4MB L2$ to be about 9.9mm^2 unless ive butchered the math

so it really doesn't matter how many zen4c cores can pack into the area of a zen4 core because intel is getting thrashed in area efficiency.

kinda find it funny this was likely commented in bad faith as you had the numbers to work out they could fit like 1.5 Z4C cores in a Z4 core. which must be an own because Intel could fit 4 cripple cores in the same area as their performance core. turns out Intels P core has awful area efficiency and honestly its the same with their E core, when you consider Z4C, is the same RTL design as Z4 and is about the same size as a Gracemont cluster.
Posted on Reply
#94
doc7000
R0H1TThere will always be a demand for cheap quad or dual cores, even if just for using it to watch the latest Netflix/Disney trash. That demand isn't going away soon & as long as we have rejects/defective chips being sold we'll continue to see them being listed for 50~100 USD. So that seems unreasonable as far as I'm concerned, 2-3 years down the line when Intel/AMD just eliminate dual cores from the lineup I guess that could make sense but not right now.
AMD hasn't made a Ryzen 4 core since the 3100 and 3300X CPUs, there may still be demand for 4 core cpus and AMD does sell CPUs under different branding then Ryzen. Same thing with Intel not all of their cpus are sold with core i branding so they can launch lower core count cpus with different branding while keeping the Ryzen where it currently is no less then 6 cores. Compared to CPUs that went on sale back in 2020 and you probably can't even buy brand new.
Posted on Reply
#95
Squared
btk2k2They don't need to run concurrently. AMD can release a full big core part like the 7950X and once the 'c' variant is online can release a new SKU with 1 8c standard CCD and 1 16c dense CCD.

Just like the lag between the 5800X and the 5800X3D was longer than the lag between 7700X and 7800X3D I expect for Zen 5 it will initially be a 16c 2 CCD design and a few months later once the Zen 5c design is finished they can launch a 24c SKU.

In theory they could release a 24c Zen 4 SKU as well, just depends on if there is a business case for such a thing. Personally I think there might be if it gets people to jump onto AM5 before Arrow Lake launches which should improve uptake of Zen 5 onwards.
Sure Zen4c doesn't need to be developed concurrently; but I think as long as AMD's small core is a compacted derivative of the main core, it'll always be a bit later to market and it'll always be limited to fewer precessor models that AMD could afford to wait to release. This will limit how it can compete against Intel's E cores.
Posted on Reply
#96
InVasMani
Can't they combine this on one CCX and also use another CCX with X3D and get a nice mix of low latency single thread driven performance with higher multi-thread driven performance at a bit slower latency!? Seems reasonable enough to me.
Posted on Reply
#97
Mawkzin
AMD hasn't made a Ryzen 4 core since the 3100 and 3300X CPUs, there may still be demand for 4 core cpus and AMD does sell CPUs under different branding then Ryzen. Same thing with Intel not all of their cpus are sold with core i branding so they can launch lower core count cpus with different branding while keeping the Ryzen where it currently is no less then 6 cores. Compared to CPUs that went on sale back in 2020 and you probably can't even buy brand new.
AMD is selling ryzen 3 5300G 4c/8t, they stop selling the X version because they could make more profit of these 4cores in the datacenter market.
Posted on Reply
#98
londiste
AltirixRaptor Cove Core with 2MB L2$ is 7.04mm^2
Gracemont is 1.70mm^2 without L2$, a block of 4 with 4MB L2$ cache is 8.78mm^2 (note the image incorrectly states 2MB for L2$)

Zen 4 with 1MB L2$ is 3.84mm^2
Zen 4C with 1MB L2$ is 2.48mm^2, we can estimate a block of 4 with 4MB L2$ to be about 9.9mm^2 unless ive butchered the math

so it really doesn't matter how many zen4c cores can pack into the area of a zen4 core because intel is getting thrashed in area efficiency.
Intel 7 vs TSMC N5 though. So not exactly a direct comparison.
Posted on Reply
#99
doc7000
MawkzinAMD is selling ryzen 3 5300G 4c/8t, they stop selling the X version because they could make more profit of these 4cores in the datacenter market.
They sell that as a cheap APU, they are not selling stand alone 4 core CPUs and I question how many 5300G chips they are even selling. If there was more demand for 4 core CPUs then AMD as well as Intel would offer more of them.
InVasManiCan't they combine this on one CCX and also use another CCX with X3D and get a nice mix of low latency single thread driven performance with higher multi-thread driven performance at a bit slower latency!? Seems reasonable enough to me.
A much better route for AMD would be to just move to a third chiplet for their CPUs, this would give them up to 24 cores and 48 threads.
Posted on Reply
#100
Aquinus
Resident Wat-man
I'm pretty sure that I know what "c" stands for.
Posted on Reply
Add your own comment
Nov 21st, 2024 13:09 EST change timezone

New Forum Posts

Popular Reviews

Controversial News Posts