Sunday, March 20th 2022

Apple Mac Studio Taken Apart, Reveals Giant M1 Ultra SoC

Max Tech performed the first detailed teardown of the Apple Mac Studio, the most powerful Mac since Apple dumped Intel for processors in favor of its own silicon based around high-performance Arm chips built from the ground-up for its own software ecosystem. The M1 Ultra SoC powering the Mac Studio is its most striking piece of technology, with Apple attaching some very tall performance claims not just for its CPU compute performance, but also graphics rendering performance.

The M1 Ultra SoC is physically huge, with roughly similar package size to an AMD EPYC processor in the SP3 package. An integrated heatspreader (IHS) covers almost the entire topside of the package. Things get interesting under the hood. The M1 Ultra is a multi-chip module of two M1 Max dies connected on package using Apple UltraFusion, a coherent fabric interconnect that allows the various components of the two M1 Max dies to access memory controlled by the other die. Speaking of memory, The M1 Ultra features up to 128 GB of LPDDR5 memory that's on-package, This memory is used for the CPU, GPU, as well as the neural processor, and has a combined memory bandwidth of 800 GB/s. The M1 Ultra features up to 20 CPU cores, up to 32 Neural cores, and up to 64 GPU cores (8,192 programmable shaders).
Source: Wccftech
Add your own comment

41 Comments on Apple Mac Studio Taken Apart, Reveals Giant M1 Ultra SoC

#1
tabascosauz
Isn't Apple using TSMC N5 for this? That's even more so a honking big chunk of monolithic silicon. Even if the DRAM is included. Speculation said M1 was 120mm^2 ish, and M1 ultra is what, 8 times that?

AMD: moar cache
Intel: moar cores
Apple: moar die size

What a monster, huh. A Geekbench monster and not much else, that is :D
Posted on Reply
#2
BArms
Guessing the larger die area indirectly grants increased energy efficiency.
Posted on Reply
#3
zlobby
BArmsGuessing the larger die area indirectly grants increased energy efficiency.
Say what?
tabascosauzIsn't Apple using TSMC N5 for this? That's even more so a honking big chunk of monolithic silicon. Even if the DRAM is included. Speculation said M1 was 120mm^2 ish, and M1 ultra is what, 8 times that?

AMD: moar cache
Intel: moar cores
Apple: moar die size

What a monster, huh. A Geekbench monster and not much else, that is :D
Not even HBM2... M-e-e-e-h!
Posted on Reply
#4
InVasMani
What I want to see with TSV is like a 4 sided TSV connected cube of PCB's and then a another 4 inside in a cross shape design. Basically 8 PCB's that could contain circuitry that could be interconnected via TSV. I'd keep two edge ends open to allow for a push/pull blower configuration thru them though. I think eventually we could see something akin to these memory heatsink on each side of the PCB's connected via TSV wit chips on each side for the inner 4 cross shape in design mentioned with just enough excess area around them for TSV connections between them to others like them. Basically enough for a CPU chip die and memory on opposite side with heatsinks for both that can then be TSV interconnected to adjacent ones and cooled in a sort of blower chamber cube. The whole thing could pretty much be a modular TSV connection of PCB's with chips on each side sandwiched with heatsinks attached to them.

tabascosauzIsn't Apple using TSMC N5 for this? That's even more so a honking big chunk of monolithic silicon. Even if the DRAM is included. Speculation said M1 was 120mm^2 ish, and M1 ultra is what, 8 times that?

AMD: moar cache
Intel: moar cores
Apple: moar die size

What a monster, huh. A Geekbench monster and not much else, that is :D
It's closer to more heatspreader area than die size since it's several dies closely connected via TSV. I don't have a problem with that approach it's the right thing do for the time being if anything to keep yields manageable. I think really until we've gotten to a point where it's no longer practical and feasible to enlarge the socket size dimensions on a MB it still make the most sense since 3D stacking becomes a more tricky en-devour once you increase heat into the equation more readily eventually more of that concern can certain decrease at smaller nodes though. You can really randomize those three names with that moar x satire all three are pushing for more everything. It's funny how it started as just a poking fun satire of AMD's bulldozer missteps really and has just transformed over the years into a byproduct of the complications of chip making and perpectuation of our desire for them to do more of everything despite 640KB being all you need.
Posted on Reply
#5
Garrus
tabascosauzIsn't Apple using TSMC N5 for this? That's even more so a honking big chunk of monolithic silicon. Even if the DRAM is included. Speculation said M1 was 120mm^2 ish, and M1 ultra is what, 8 times that?

AMD: moar cache
Intel: moar cores
Apple: moar die size

What a monster, huh. A Geekbench monster and not much else, that is :D
It's faster than my 12700k, and faster than my RTX 3070. It's not bad. It's an expensive luxury product, but it isn't bad.

It can run Baldur's Gate 3 at 4k/60fps for example.

Also you are just looking at the integrated heat spreader. All the ram is under there. The actual chip is a lot smaller. And even then the SOC can't be compared against a plain CPU or GPU, there's a lot of cache and other things that take space. 16GB DDR5 has more transistors than the M1 Ultra, and more than any CPU, for example. Ever added up the area of all the ICs you can see on a plain ram stick?
Posted on Reply
#6
Chrispy_
Intel, AMD, Qualcomm have created an interconnect alliance, probably because they're scared that the rich boy is just throwing money at the problem. That package size is ridiculous, likely matched only by how ridiculous Apple's profit margins are.
Posted on Reply
#7
Garrus
Chrispy_Intel, AMD, Qualcomm have created an interconnect alliance, probably because they're scared that the rich boy is just throwing money at the problem. That package size is ridiculous, likely matched only by how ridiculous Apple's profit margins are.
The "CPU package" not including the memory is 1/3 the size of the IHS. It isn't that big. You can tell by the thermal paste only covering a third of it.
Posted on Reply
#8
Unregistered
GarrusIt's faster than my 12700k, and faster than my RTX 3070. It's not bad. It's an expensive luxury product, but it isn't bad.

It can run Baldur's Gate 3 at 4k/60fps for example.

Also you are just looking at the integrated heat spreader. All the ram is under there. The actual chip is a lot smaller. And even then the SOC can't be compared against a plain CPU or GPU, there's a lot of cache and other things that take space. 16GB DDR5 has more transistors than the M1 Ultra, and more than any CPU, for example. Ever added up the area of all the ICs you can see on a plain ram stick?
A12700k is 22k stock MC on R23 so not that much faster.
Posted on Edit | Reply
#9
Lycanwolfen
I would bet money the M1 is nothing more but an AMD cpu with the name scratched off and the Apple logo printed on it. Apple has done that in the past.
Posted on Reply
#10
Fourstaff
TiggerA12700k is 22k stock MC on R23 so not that much faster.
That's the problem right? Modified phone processor catching up to a desktop-first chip, backed by the biggest mountain of cash in the corporate world. Intel is the underdog here, their market cap is 1/10th the size of Apple's.
Posted on Reply
#11
Lycanwolfen
Apple will always say it's faster than anything on the market but never tells you in what it is faster at. All the media its faster than this faster then that but in what it never tells you.
Posted on Reply
#12
Steevo
48Mb of L2 cache is a lot of cache for 20
cores, maybe we have reached the pleateau of custom silicon and need to start integrating more cache for more performance. I’m curious about the latency, maybe we will see numbers.

All current general purpose tests show it’s slower overall than 1 or 2 generation old hardware X86-64, but for specific tests it’s still equal to current iteration hardware, 3D stack from AMD may be faster
Posted on Reply
#13
Nike_486DX
wondering if hbm3 will make its way to M2.
Posted on Reply
#14
Space Lynx
Astronaut
Apple needs some Trojan Magnum's, damn.
Posted on Reply
#15
Garrus
TiggerA12700k is 22k stock MC on R23 so not that much faster.
Not sure I understand your comment. I said it is faster. You agree. But it isn't that much faster. Ok. It is 15 percent faster. ;)
Posted on Reply
#16
timta2
LycanwolfenApple will always say it's faster than anything on the market but never tells you in what it is faster at. All the media its faster than this faster then that but in what it never tells you.
That's not true at all. All you have to do is scroll down to the bottom of the page and read the details they have provided for you to read.
Posted on Reply
#17
Blue4130
InVasManiWhat I want to see with TSV is like a 4 sided TSV connected cube of PCB's and then a another 4 inside in a cross shape design. Basically 8 PCB's that could contain circuitry that could be interconnected via TSV. I'd keep two edge ends open to allow for a push/pull blower configuration thru them though. I think eventually we could see something akin to these memory heatsink on each side of the PCB's connected via TSV wit chips on each side for the inner 4 cross shape in design mentioned with just enough excess area around them for TSV connections between them to others like them. Basically enough for a CPU chip die and memory on opposite side with heatsinks for both that can then be TSV interconnected to adjacent ones and cooled in a sort of blower chamber cube. The whole thing could pretty much be a modular TSV connection of PCB's with chips on each side sandwiched with heatsinks attached to them.



It's closer to more heatspreader area than die size since it's several dies closely connected via TSV. I don't have a problem with that approach it's the right thing do for the time being if anything to keep yields manageable. I think really until we've gotten to a point where it's no longer practical and feasible to enlarge the socket size dimensions on a MB it still make the most sense since 3D stacking becomes a more tricky en-devour once you increase heat into the equation more readily eventually more of that concern can certain decrease at smaller nodes though. You can really randomize those three names with that moar x satire all three are pushing for more everything. It's funny how it started as just a poking fun satire of AMD's bulldozer missteps really and has just transformed over the years into a byproduct of the complications of chip making and perpectuation of our desire for them to do more of everything despite 640KB being all you need.
I think I've read tsv more times in this post than the rest of my life :p
Posted on Reply
#18
IceShroom
People dont confuse ISH with die size. Do you guys think the Theadripper or Xeon Platinum 's die size is the same as the ISH size??
Posted on Reply
#19
looniam
tabascosauzIsn't Apple using TSMC N5 for this? That's even more so a honking big chunk of monolithic silicon. Even if the DRAM is included. Speculation said M1 was 120mm^2 ish, and M1 ultra is what, 8 times that?
M1 120.5mm^2, Pro 245mm^2, Max 432mm^2, Ultra 840mm^2.
tabascosauzWhat a monster, huh.
Posted on Reply
#20
chrcoluk
What is the TDP? Also per core performance seems inferior as it outnumbers the intel and AMD chips on core coubt?
Posted on Reply
#21
mb194dc
It's $4K for the cheapest studio with this behemoth in it... Quite a serious piece of hardware. Of course it's apple so all proprietary etc.

Can get a TR based 3970x system with a top end graphics card for the same price...
Posted on Reply
#22
DeathtoGnomes
well, you cant increase the memory so I'm guessing that $4k price tag is for a minimal amount of it, 16g?
Posted on Reply
#23
watzupken
It looks big, but until someone delids the heatspread on top, we won‘t be able to tell how big the chips are. Under the heatspread, it is not just the SOC. The system RAM is also “unified”, which means it is part of the SOC. Which is why it the heat spread is this big I suspect.
Posted on Reply
#24
mama
Apparently you can add storage, if you rip it open.
Posted on Reply
#25
Garrus
DeathtoGnomeswell, you cant increase the memory so I'm guessing that $4k price tag is for a minimal amount of it, 16g?
No, you can just look it up on Apple.com, right? 64GB, which is pretty generous for Apple. That's where a lot of the $4000 price comes from. Apple charges $1600 for that ram usually.
chrcolukWhat is the TDP? Also per core performance seems inferior as it outnumbers the intel and AMD chips on core coubt?
60W for the CPU. 100W for the GPU. But currently it rarely gets close to that. Per core performance is high in IPC, but clock speeds are only 3Ghz. 16+4, P+E. Obviously beats any other 3 Ghz consumer CPU. Too bad. That's the only other thing I wanted from the Studio, 4 Ghz clocks, not 3.
Posted on Reply
Add your own comment
Nov 22nd, 2024 11:46 EST change timezone

New Forum Posts

Popular Reviews

Controversial News Posts