• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Radeon Instinct MI100 "Arcturus" Hits the Radar, We Have its BIOS

700mm2 on 7nm EUV... damn.
Imagine this thing on 14nm, you'd have 3 working dies per wafer :laugh:

ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
 
In real life, Nvidia dominated GPU computing before they added Tensors - all thanks to a better ecosystem and support. And nothing changed here.
Even if Mi100 temporarily pulls ahead in performance, it won't be enough.

Well, I think it's a matter of whether their software stack get some interest from developers. Having key super computers (like Frontier) using their hardware will help in that department.

Also, AMD already has a MI200 in the works. MI100 looks like it's gonna be pretty powerful, but MI200 will be even better, and is probably what they're gonna deploy in Frontier.
 
only amd can make a gpu so bad at being a gpu that it has no graphics processing capability
 
I looked at NVIDIA Quadro RTX 8000 specs and it says over 200 TOPS at INT8. Is there really so huge difference between Radeon and RTX, or does Nvidia counts differently?

They use different ratios of execution units. Everything is a trade-off, MI60 has a lot of FP64 units, Turing doesn't, Volta does but it doesn't have any RT cores. The thing is though INT8/INT4/FP16 aren't that critical.

Out of all of those, FP64 units have become indispensable. It used to be that they were very expensive power and size wise and that's why GPUs of the past skimped on that but any real compute accelerator nowadays needs to have strong FP64 performance. 64 bit floating point is usually the de facto precision for simulations and that sort of stuff, you can do without tensor cores or INT8/INT4/FP16 but not without FP64 in a data center environment. That's why there have been no large Turing based Tesla's, because no one would have wanted them due to their poor FP64 performance.
 
Last edited by a moderator:
Hehe, they seem to follow the motto: "Bad product is still better than no product"
They've signed a few large contracts and they have to deliver a GPGPU accelerator. It doesn't have to be the best available. It only has to match the agreed specification.
 
That's not AMD's spec page. It's Techpowerup's speculated spec list.

When the user with the Arcturus card run GPU-Z and submitted the BIOS to the database, well, I don't think so.
I think there is a way for the software to read the specifications and put them into the database.

I smell $1000 Gaming GPU's this year by AMD...

No way, I smell much lower prices, after all, AMD has to regain some mindshare and lost positions.

ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²

Probably fake limit of only 429 mm². That would mean bye-bye enthusiasts videocards.
 
ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
So this will be first gen 7nm to reach 500+mm2? Or for it to have 8192 cores it is made possible by losing the raster engines?

When the user with the Arcturus card run GPU-Z and submitted the BIOS to the database, well, I don't think so.
I think there is a way for the software to read the specifications and put them into the database.



No way, I smell much lower prices, after all, AMD has to regain some mindshare and lost positions.



Probably fake limit of only 429 mm². That would mean bye-bye enthusiasts videocards.
The bios doesn't show any die information, all of this is speculation by me and will be updated when new info comes out.
 
Theoretically, and based on V7 pro performance. A version of this card for Pros would be insane. To call it a monster would be an understatement. On top of that, since its Vega they can add the ability like with last Vega based card to use m.2 as extra ram. I'm certain this card is coming. They just need to focus and market it for that segment and not gaming like before. Call it the FirePro X. This specific version though is obviously just for compute and even still it is beyond anything on the market by Far.
 
Let me set this straight. It's like taking out passenger/driver cabin and trunk space off a Tesla and fill it with battery and more powerful motor (delete raster capability, basically the little doo-dah that shows stuff on your monitor and fill the space with CUs). Even steering system because this Tesla will just go straight in highly specialized environment (only for AI-ML acceleration unlike GPUs that are usually jack of all trades).
Except extra battery and motor wouldn't be used for faster 0-60 and top speed rather hauling more stuff so a Tesla semi or pickup (not for gaming but workstation only).

Some things you get after reading this thread:
It's not a gaming card at all. It's workstation card through and through. But people will like a gaming card with this spec.
People don't understand that with raster stuff added back in this thing will become MASSIVE.
AMD also has brain-dead haters for their products who don't even understand what they're hating.

My personal take is that this is a good development. I've long prescribed that AMD should have different architectures for different markets instead of "jack of all trades, master of none". Now that CPU business is making them money, hope they spend clever money on R&D for RTG.
 
Let me set this straight. It's like taking out passenger/driver cabin and trunk space off a Tesla and fill it with battery and more powerful motor (delete raster capability, basically the little doo-dah that shows stuff on your monitor and fill the space with CUs). Even steering system because this Tesla will just go straight in highly specialized environment (only for AI-ML acceleration unlike GPUs that are usually jack of all trades).
Except extra battery and motor wouldn't be used for faster 0-60 and top speed rather hauling more stuff so a Tesla semi or pickup (not for gaming but workstation only).

Some things you get after reading this thread:
It's not a gaming card at all. It's workstation card through and through. But people will like a gaming card with this spec.
People don't understand that with raster stuff added back in this thing will become MASSIVE.
AMD also has brain-dead haters for their products who don't even understand what they're hating.

My personal take is that this is a good development. I've long prescribed that AMD should have different architectures for different markets instead of "jack of all trades, master of none". Now that CPU business is making them money, hope they spend clever money on R&D for RTG.
This is why I said a "version " of this card for pro. Which would obviously mean rasters in amongst other things and inevitably higher tdp.
 
Hehe, they seem to follow the motto: "Bad product is still better than no product"

Your poor attempt at trolling and fanboyism is honestly the coolest thing ever, ohhh baby you make me so excited for sucking Nvidia off with you......

700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
 
700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
Or they have no choice - they need to make such a chip (PoC).
 
A shock for many that is has 128CUs as they couldn't get that big Navi will have even 80CUs. But AMD is on rails now with only marketing being inferior but their products are on top level. Even their latest GPU drivers for Navi arch are improved (the biggest customer problem for the last 5-6 months). Impressive feat nevertheless, especially for 200W. Big Navi now can easily become the fastest GPU with some distance while using under 300W. It seems that 7nm+ helps muchly in efficiency.
 
Your poor attempt at trolling and fanboyism is honestly the coolest thing ever, ohhh baby you make me so excited for sucking Nvidia off with you......

700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?

No doubt cut down dies will be sold, so I'm sure there's plenty of profit even if it's 50% fully functioning yield.
 
A shock for many that is has 128CUs as they couldn't get that big Navi will have even 80CUs. But AMD is on rails now with only marketing being inferior but their products are on top level. Even their latest GPU drivers for Navi arch are improved (the biggest customer problem for the last 5-6 months). Impressive feat nevertheless, especially for 200W. Big Navi now can easily become the fastest GPU with some distance while using under 300W. It seems that 7nm+ helps muchly in efficiency.
Maybe this is related to what they have done with Ryzen 4000 Vega cores efficiency.
 
The estimation changed, not a 700 die any loner.

Radeon VII is already 331 mm² , it is physically impossible to fit 2X the shaders with only 6 billion transistors more, and in 420 mm² .

420 mm² is impossible with these specs.

Have you got any source that they can manufacture larger than 429 mm² dies on N7+ and have you got any source that this particular chip is on N7+, and not in N7 ?
 
The coments here, jesus. This GPU can't even render graphics. Do people even realize that ? Its purely AI GPU which explains the low wattage. Its similar to 75W Tesla T4 which is purely AI as well while having RTX 2070S spec.
Absolutely NOT.

Obviously, this card can render. It's built around a normal GPU. It just can't provide a video signal - there are no outputs and no logic dedicated for this task.
It can be used in any scenario that can utilize GPGPU (including AI, obviously).

This is NOT similar to Tesla T4.
In green camp you have the V100, which is an all-mighty, all-round, dual-slot accelerator. Mi100 (like Mi60 now) will compete in this segment.
Nvidia also makes the Tesla T4, which has half of V100 Tensor (AI) potential, but just 5% of it's double-precision performance. T4 is single-slot, 1/4th of V100 price and uses 75W (V100 is up to 300W).

Which means that if you need V100's double-precision (all-round) performance, you buy a V100. You can't go wrong with this card.
But if you don't need it, you buy 2xTesla T4's - you get pretty much the same performance in stuff like Deep Learning (e.g. image recognition) for half the money and half the power.
 
  • Like
Reactions: ppn
That would be another massive failure just like the Crap-eon 7, but then again, AMD just doesn't seem to tire of failures... :D

Why was the Radeon 7 a failure in your eyes?
 
Radeon VII is already 331 mm² , it is physically impossible to fit 2X the shaders with only 6 billion transistors more, and in 420 mm² .

In Mi60 -4096 shading units = 160 mm2. double that, the memory controllers remain unchanged. SO therefore 50% bigger die 2X shaders. Shrink to 7nm+ with 18% better density. looks like perfect prediction by DB maintainer.
 
Back
Top