Monday, February 10th 2020
AMD Radeon Instinct MI100 "Arcturus" Hits the Radar, We Have its BIOS
AMD's upcoming large post-Navi graphics chip, codenamed "Arcturus," will debut as "Radeon Instinct MI100", which is an AI-ML accelerator under the Radeon Instinct brand, which AMD calls "Server Accelerators." TechPowerUp accessed its BIOS, which is now up on our VGA BIOS database. The card goes with the device ID "0x1002 0x738C," which confirms "AMD" and "Arcturus,". The BIOS also confirms that memory size is at a massive 32 GB HBM2, clocked at 1000 MHz real (possibly 1 TB/s bandwidth, if memory bus width is 4096-bit).
Both Samsung (KHA884901X) and Hynix memory (H5VR64ESA8H) is supported, which is an important capability for AMD's supply chain. From the ID string "MI100 D34303 A1 XL 200W 32GB 1000m" we can derive that the TDP limit is set to a surprisingly low 200 W, especially considering this is a 128 CU / 8,192 shader count design. Vega 64 and Radeon Instinct MI60 for comparison have around 300 W power budget with 4,096 shaders, 5700 XT has 225 W with 2560 shaders, so either AMD achieved some monumental efficiency improvements with Arcturus or the whole design is intentionally running constrained, so that AMD doesn't reveal their hand to these partners, doing early testing of the card.
-- images removed --
Looking through the BIOS I also found what looks like several clock tables that top out at 1334 MHz, 1091 MHz, 1000 MHz. AMD's engineers typically list clocks in the following order: GPU clock, SOC clock, memory clock. This suggests that the GPU will tick at up to 1334 MHz, way lower than what Navi and Vega were able to achieve — maybe they do that to operate the chip in a more power-efficient way. The memory clock at 1000 MHz, matches the BIOS id string's "1000m", and falls in range with the 2.0 - 2.4 Gbps that Samsung is speccing their HBM2 memory chips at.
Arcturus' debut as a Radeon Instinct product follows the pattern of AMD debuting new big GPUs as low-volume/high-margin AI-ML accelerators first, followed by Radeon Pro and finally Radeon client graphics products. Arcturus is not "big Navi," rather it seems to be much closer to Vega than to Navi, which makes perfect sense given its target market. AMD's Linux sources mention "It's because Arcturus has not 3D engine", which could hint at what AMD did with this chip: take Vega and remove all 3D raster graphics ability, which shaves a few billion transistors off the silicon, freeing up space for more CUs. For gamers, AMD is planning a new line of Navi 20-series chips leveraging 7 nm EUV for launch throughout 2020. Various higher-ups at AMD, including its CEO, publicly hinted that a big client-segment GPU is in the works, and that the company is very much interested at taking another swing at premium 4K UHD gaming.
Sources:
Arcturus Linux Patches, Arcturus Linux Patches
Both Samsung (KHA884901X) and Hynix memory (H5VR64ESA8H) is supported, which is an important capability for AMD's supply chain. From the ID string "MI100 D34303 A1 XL 200W 32GB 1000m" we can derive that the TDP limit is set to a surprisingly low 200 W, especially considering this is a 128 CU / 8,192 shader count design. Vega 64 and Radeon Instinct MI60 for comparison have around 300 W power budget with 4,096 shaders, 5700 XT has 225 W with 2560 shaders, so either AMD achieved some monumental efficiency improvements with Arcturus or the whole design is intentionally running constrained, so that AMD doesn't reveal their hand to these partners, doing early testing of the card.
-- images removed --
Looking through the BIOS I also found what looks like several clock tables that top out at 1334 MHz, 1091 MHz, 1000 MHz. AMD's engineers typically list clocks in the following order: GPU clock, SOC clock, memory clock. This suggests that the GPU will tick at up to 1334 MHz, way lower than what Navi and Vega were able to achieve — maybe they do that to operate the chip in a more power-efficient way. The memory clock at 1000 MHz, matches the BIOS id string's "1000m", and falls in range with the 2.0 - 2.4 Gbps that Samsung is speccing their HBM2 memory chips at.
Arcturus' debut as a Radeon Instinct product follows the pattern of AMD debuting new big GPUs as low-volume/high-margin AI-ML accelerators first, followed by Radeon Pro and finally Radeon client graphics products. Arcturus is not "big Navi," rather it seems to be much closer to Vega than to Navi, which makes perfect sense given its target market. AMD's Linux sources mention "It's because Arcturus has not 3D engine", which could hint at what AMD did with this chip: take Vega and remove all 3D raster graphics ability, which shaves a few billion transistors off the silicon, freeing up space for more CUs. For gamers, AMD is planning a new line of Navi 20-series chips leveraging 7 nm EUV for launch throughout 2020. Various higher-ups at AMD, including its CEO, publicly hinted that a big client-segment GPU is in the works, and that the company is very much interested at taking another swing at premium 4K UHD gaming.
76 Comments on AMD Radeon Instinct MI100 "Arcturus" Hits the Radar, We Have its BIOS
I'm kind of disappointed Arcturus is aiming for ultra-high end. I hope "big Navi" is still coming.
Memory specs are exact match for what was used in MI60, so not much newsworthy there. This. 100MHz lower base clock (1091 vs 1200) and considerably lower boost clock (1400 vs 1800) does help a lot with power efficiency. Assuming MI100 is 80CU, AMD still has managed a huge efficiency boost though.
For a full list of speculated specs
29 Billion transistors | 700 mm² | 8192 Shaders | 512 TMUs | 128 ROPs | 200 W TDP
I wonder which one will be better for gaming: MI100 or Navi 21 www.techpowerup.com/gpu-specs/amd-navi-21.g923 ? ?
AMD is finally very brave to design such a monster of a chip! :eek:
I'd like to see Navi 21-based consumer Radeon as soon as possible, too!
As in, it's not for gamers, datacenter only :P
Imagine this thing on 14nm, you'd have 3 working dies per wafer :laugh:
Also... if AMD is really going to shoot for another Vega repeat with HBM and once more get eclipsed by a simple x80 Ampere... they can close up shop.
So, forget it. Nice proof of concept, not happening for us. Still its nice to see them do a large die like this.
But Nvidia adds Tensor cores and they make all the difference - Nvidia ends up 3-4 times faster in some tasks.
In real life, Nvidia dominated GPU computing before they added Tensors - all thanks to a better ecosystem and support. And nothing changed here.
Even if Mi100 temporarily pulls ahead in performance, it won't be enough.
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",
Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²
MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²
Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²