• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Shows Instinct MI300 Exascale APU with 146 Billion Transistors

Joined
Jan 3, 2021
Messages
3,477 (2.46/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
Impressive, and I hope we soon learn more about how it's structured. Three Zen CCDs plus multiple GPU dies on top, among other chips, apparently. Four cache+MC dies beneath, but those can't be nearly as big as the nine on top. (?) But why didn't AMD choose to place the cache+MC dies under HBM stacks?

And ... This thing will also require 3D power delivery electronics all around.
 
Joined
Nov 26, 2021
Messages
1,642 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
Impressive, and I hope we soon learn more about how it's structured. Three Zen CCDs plus multiple GPU dies on top, among other chips, apparently. Four cache+MC dies beneath, but those can't be nearly as big as the nine on top. (?) But why didn't AMD choose to place the cache+MC dies under HBM stacks?

And ... This thing will also require 3D power delivery electronics all around.
Placing the cache die under the compute dies would minimize added latency as can be seen in the case of the 5800X3D. It will probably allow better cooling too as the compute dies are the power hogs. Moreover, as you said, I expect that the MCDs would have much more wiring for power than the MCDs for the consumer RDNA3 GPUs.
 
Joined
Mar 10, 2010
Messages
11,878 (2.21/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
I don't work with nvidia :D
I said AMD is to blame because most of the time it doesn't deliver up to the expectations. The users want the fastest. AMD never delivers the fastest.
Actually, if RDNA 1 and before Vega 64, etc garbage, they simply ignored the top of the line market - the halo which sells the whole product lineup beneath it.

:D



Exactly. AMD must concentrate on working relationships with those "OEM"s.
Or build its own ecosystem without them - direct sales, etc.
That's AMD to blame - because nvidia and Intel concentrate on buying the "OEM", so the result which you see in the stores is 99% Intel-nvidia exclusives.
People see only what they Want to see, I see plenty of alternatives to Nvidia and Intel and my five year old and still better than the 1080 non ti Vega64, it was released against says you're wrong, but more importantly, this isn't a GPU thread FFS.

Shall we discuss the XTX overhaet issue next, how about we leave DGPU talk to threads that suit it?!.

Not Enterprise which is showing up your naivety.
 

Leiesoldat

lazy gamer & woodworker
Supporter
Joined
Jun 29, 2021
Messages
122 (0.10/day)
System Name Arda
Processor AMD Ryzen 5800X3D
Motherboard Gigabyte X570-I AORUS Pro WiFi
Cooling Custom Loop - Aquacomputer, Optimus, EK, Bykski
Memory GSkill Trident Z RGB 32 GB (2x16) DDR4-3200
Video Card(s) Gigabyte Gaming OC RX 6800XT
Storage SK Hynix P41 1TB
Display(s) VIOTEK 3440 x 1440 144 Hz Curved
Case XTIA Proto-XL
Audio Device(s) Schiit Modius + Schiit Jotunheim
Power Supply Seasonic Prime 850W Titanium
Mouse Xtrfy MZ1 Zy's Rail Wireless
Keyboard Rainkeebs Yasui - Custom 40% Ortholinear
Software Windows 11 Pro
Nexus290 said:
Vya Domus said:
Intel has a similar solution with Ponte Vecchio, which
Leiesoldat said:
Ponte Vecchio is inside the Aurora supercomputer which is still on track, albeit slowly.
Aurora was first announced in 2015 and to be finished in 2018.

Yes they said Aurora was going to be finished in 2018, but Intel kept tacking on more and more capability to match Frontier which pushed the scheduled completion date out. Aurora being delayed is not a secret to anybody in the HPC industry, but the likely completion date is going to be somewhere between now and a maybe 2024; Intel keeps those dates hidden.
 
Last edited:
Joined
Feb 12, 2021
Messages
220 (0.16/day)
Placing the cache die under the compute dies would minimize added latency as can be seen in the case of the 5800X3D. It will probably allow better cooling too as the compute dies are the power hogs. Moreover, as you said, I expect that the MCDs would have much more wiring for power than the MCDs for the consumer RDNA3 GPUs.

The added cache is on top of the die because routing through it would be a nightmare if it was underneath, that is the primary reason why the cache is on the top.

The 3D-V Cache power use, plus heat from the silicon below doesn't make a lot of difference to temperature because of how it is manufactured, plus the die selection that has been made, plus the 3D-V Cache has been designed to sit directly over the CPU Cache, it cant just be placed anywhere, and so any other products AMD wants to make where they can stack 3D-V Cache has to be designed that way from the start, and the 3D-V Cache has to be designed to fit the die below it, such as the MCD's on their new GPU's, are they the same as for the Zen CPU's.? I doubt it.

As for the latency point, I don't see how it would reduce latency by having the 3D-V Cache on the underside of the CPU die, if anything I think that apart from being a wiring nightmare, it would add latency as the processor die would be just a little further away from the substrate, and thus everything that it is communicating with, RAM, and I/O I imagine the difference of a few microns would be minimal, and if so, it would also be minimal if the cache was underneath the compute dies ¯\_(ツ)_/¯
 
Joined
Nov 26, 2021
Messages
1,642 (1.51/day)
Location
Mississauga, Canada
Processor Ryzen 7 5700X
Motherboard ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling Noctua NH-C14S (two fans)
Memory 2x16GB DDR4 3200
Video Card(s) Reference Vega 64
Storage Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s) Nixeus NX-EDG27, and Samsung S23A700
Case Fractal Design R5
Power Supply Seasonic PRIME TITANIUM 850W
Mouse Logitech
VR HMD Oculus Rift
Software Windows 11 Pro, and Ubuntu 20.04
As for the latency point, I don't see how it would reduce latency by having the 3D-V Cache on the underside of the CPU die, if anything I think that apart from being a wiring nightmare, it would add latency as the processor die would be just a little further away from the substrate, and thus everything that it is communicating with, RAM, and I/O I imagine the difference of a few microns would be minimal, and if so, it would also be minimal if the cache was underneath the compute dies ¯\_(ツ)_/¯
As you said, there are good reasons for the cache die being on top of the CPU die. Let's see what they are doing with this one. However, the cache die is much smaller than the CPU die, and making it large enough to devote area to power wiring might solve the power delivery issues. If the latency impact of off-chip access was insignificant, the 5800X3D wouldn't have had the cache placed on top of the main die.
 
Joined
Jan 3, 2021
Messages
3,477 (2.46/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
As you said, there are good reasons for the cache die being on top of the CPU die. Let's see what they are doing with this one. However, the cache die is much smaller than the CPU die, and making it large enough to devote area to power wiring might solve the power delivery issues. If the latency impact of off-chip access was insignificant, the 5800X3D wouldn't have had the cache placed on top of the main die.
Tom's has a few photos. Four large dies are visible on top ... and that still doesn't explain anything.
 
Joined
Sep 1, 2020
Messages
2,338 (1.52/day)
Location
Bulgaria
Strange in TPU GPU database all numbers, include those for performance for MI250X and MI300 are same.
 
Joined
May 13, 2015
Messages
632 (0.18/day)
Processor AMD Ryzen 3800X / AMD 8350
Motherboard ASRock X570 Phantom Gaming X / Gigabyte 990FXA-UD5 Revision 3.0
Cooling Stock / Corsair H100
Memory 32GB / 24GB
Video Card(s) Sapphire RX 6800 / AMD Radeon 290X (Toggling until 6950XT)
Storage C:\ 1TB SSD, D:\ RAID-1 1TB SSD, 2x4TB-RAID-1
Display(s) Samsung U32E850R
Case be quiet! Dark Base Pro 900 Black rev. 2 / Fractal Design
Audio Device(s) Creative Sound Blaster X-Fi
Power Supply EVGA Supernova 1300G2 / EVGA Supernova 850G+
Mouse Logitech M-U0007
Keyboard Logitech G110 / Logitech G110
as an AMD shareholder, i'm so glad they don't listen to random people on the internet.
Please, visit every AMD related news article, wait for the first 3-4 fools to post and just reply with this. :toast:
 
Joined
Oct 30, 2020
Messages
250 (0.17/day)
There is also 10 times more money in the graphics market ! 8% today market share with outlook for probable bankruptcy, or 80% market share as is case with nvidia !

You sound like a child with these silly statements. I see you have a pattern for dissing AMD wherever you can but atleast be clever about it. AMD gets around 10x the money for every instinct they sell compared to desktop GPU's. Also, probably bankruptcy? They had 1 billion in profits in 2022, which would've been double had they not written off a bunch of money after Xilinx acquisition. They shrank much lower than your beloved intel and nvidia. They care about margins and profits, like every large company, and that's exactly where they are focusing their R&D. The MI300 is an absolutely staggering achievement, just the sheer complexity is mind boggling.
So, please answer me. What possible bankruptcy? Be clever with your response please, get out of the 10 year old mindset and subsequent comments.
 
  • Haha
Reactions: ARF
Joined
Dec 16, 2010
Messages
1,668 (0.33/day)
Location
State College, PA, US
System Name My Surround PC
Processor AMD Ryzen 9 7950X3D
Motherboard ASUS STRIX X670E-F
Cooling Swiftech MCP35X / EK Quantum CPU / Alphacool GPU / XSPC 480mm w/ Corsair Fans
Memory 96GB (2 x 48 GB) G.Skill DDR5-6000 CL30
Video Card(s) MSI NVIDIA GeForce RTX 4090 Suprim X 24GB
Storage WD SN850 2TB, Samsung PM981a 1TB, 4 x 4TB + 1 x 10TB HGST NAS HDD for Windows Storage Spaces
Display(s) 2 x Viotek GFI27QXA 27" 4K 120Hz + LG UH850 4K 60Hz + HMD
Case NZXT Source 530
Audio Device(s) Sony MDR-7506 / Logitech Z-5500 5.1
Power Supply Corsair RM1000x 1 kW
Mouse Patriot Viper V560
Keyboard Corsair K100
VR HMD HP Reverb G2
Software Windows 11 Pro x64
Benchmark Scores Mellanox ConnectX-3 10 Gb/s Fiber Network Card
8x performance improvement and 5x efficiency improvement?

That's a very clever way of disguising a 60% increased TDP.
 
Joined
Nov 2, 2013
Messages
466 (0.12/day)
System Name Auriga
Processor Ryzen 7950X3D w/ aquacomputer cuplex kryos NEXT with VISION - acrylic/nickel
Motherboard Asus ProArt X670E-CREATOR WIFI + Intel X520-DA2 NIC
Cooling Alphacool Res/D5 Combo •• Corsair XR7 480mm + Black Ice Nemesis 360GTS radiators •• 7xNF-A12 chromax
Memory 96GB G.Skill Trident Z5 RGB (F5-6400J3239F48GX2-TZ5RK)
Video Card(s) MSI RTX 4090 Suprim Liquid X w/ Bykski waterblock
Storage 2TB inland TD510 Gen5 ••• 4TB WD Black SN850X ••• 40TB UNRAID NAS
Display(s) Alienware AW3423DWF (3440x1440, 10-bit @ 145Hz)
Case Thermaltake Core P8
Power Supply Corsair AX1600i
Mouse Razer Viper V2 Pro (FPS games) + Logitech MX Master 2S (everything else)
Keyboard Keycult No2 rev 1 w/Amber Alps and TX stabilizers on a steel plate. DCS 9009 WYSE keycaps
Software W11 X64 Pro
Benchmark Scores https://valid.x86.fr/c3rxw7
8x performance improvement and 5x efficiency improvement?

That's a very clever way of disguising a 60% increased TDP.
TDP doesn't matter. what matters is density and efficiency.

Please, visit every AMD related news article, wait for the first 3-4 fools to post and just reply with this. :toast:
sometimes the things i read make me cringe so hard i scramble to close the browser tab. some posts are so bad that i don't even know how to reply. i don't even know where to start. so i don't.
 
Joined
Oct 27, 2009
Messages
1,180 (0.21/day)
Location
Republic of Texas
System Name [H]arbringer
Processor 4x 61XX ES @3.5Ghz (48cores)
Motherboard SM GL
Cooling 3x xspc rx360, rx240, 4x DT G34 snipers, D5 pump.
Memory 16x gskill DDR3 1600 cas6 2gb
Video Card(s) blah bigadv folder no gfx needed
Storage 32GB Sammy SSD
Display(s) headless
Case Xigmatek Elysium (whats left of it)
Audio Device(s) yawn
Power Supply Antec 1200w HCP
Software Ubuntu 10.10
Benchmark Scores http://valid.canardpc.com/show_oc.php?id=1780855 http://www.hwbot.org/submission/2158678 http://ww
Ponte Vecchio is inside the Aurora supercomputer which is still on track, albeit slowly.
Making progress is different than on track.
Aurora was first announced in 2015 and to be finished in 2018
2017, delayed to 2021 but scaled up to 1 exaFLOP (with Ponte Viecchio)
October 2020, delayed another 6mo.
October 2021, Intel took a 400m Loss on the checkbook and Aurora is targeted for 2023 and 2exaflops.

8x performance improvement and 5x efficiency improvement?

That's a very clever way of disguising a 60% increased TDP.
I am curious to see AMD's reply as to how this chip works in multi configs...
It looks... like its 2 socket max. It looks like a 4 GPU chiplet with 4x8 zen4 chiplets betwen the hbm stacks.
Taking node from 4x 560w to 2x idk 700-900w? Unclear if it will need a parent cpu, as each APU has 96 Zen4 cores onboard.
Also the block diagram is notably different than the chip in hand and pictured.

For all of you (ARF) that clearly missed it, AMD was very proudly showing off their insanely profitable Xilinx wing, under the new AMD XDNA branding.
That new FPGA Decimates the inference card by nvidia in the same class... AMD is taking advantage of NVidia abandoning the 75w slot.
Nvidia replaced the T4 with the A2 and it's barely better. https://www.nvidia.com/en-us/data-center/tesla-t4/ https://www.nvidia.com/en-us/data-center/products/a2/
Left room for AMD.
 
Last edited:
Joined
Nov 14, 2021
Messages
126 (0.11/day)
Instead of focusing on already super developed market segments which do not need so much attention, better AMD focus on the discrete graphics business. Very very lame !
That is where the money is. Read financial reports. Takes only a few minutes. AMD in Q3 2022 revenue of $1.6B datacenter segment, $1.6B gaming segment, $1B client segment. 30% margins from datacenter, 10% from gaming and slight negative on the client. AMD also has higher yoy increases on datacenter.

There is more growth oppertunity with higher returns in datacenter.

Look at Nvidia. Q3 2022 they pulled in $3.8B datacenter, up 31% yoy and gaming was $1.6B, down 51% yoy. Looking at Nvidia's stack, it is geared for datacenter. Even watching Nvidia's release of gaming GPUs, there is way more focus on everything outside gaming. But gaming gets to benefit from a lot of these features that they primarily are selling for enterprise.
 

Solaris17

Super Dainty Moderator
Staff member
Joined
Aug 16, 2005
Messages
26,912 (3.83/day)
Location
Alabama
System Name RogueOne
Processor Xeon W9-3495x
Motherboard ASUS w790E Sage SE
Cooling SilverStone XE360-4677
Memory 128gb Gskill Zeta R5 DDR5 RDIMMs
Video Card(s) MSI SUPRIM Liquid X 4090
Storage 1x 2TB WD SN850X | 2x 8TB GAMMIX S70
Display(s) 49" Philips Evnia OLED (49M2C8900)
Case Thermaltake Core P3 Pro Snow
Audio Device(s) Moondrop S8's on schitt Gunnr
Power Supply Seasonic Prime TX-1600
Mouse Lamzu Atlantis mini (White)
Keyboard Monsgeek M3 Lavender, Moondrop Luna lights
VR HMD Quest 3
Software Windows 11 Pro Workstation
Benchmark Scores I dont have time for that.
This gives me Xeon Phi vibes with the added GPU element.
 
Joined
Jun 2, 2017
Messages
9,090 (3.33/day)
System Name Best AMD Computer
Processor AMD 7900X3D
Motherboard Asus X670E E Strix
Cooling In Win SR36
Memory GSKILL DDR5 32GB 5200 30
Video Card(s) Sapphire Pulse 7900XT (Watercooled)
Storage Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s) GIGABYTE FV43U
Case Corsair 7000D Airflow
Audio Device(s) Corsair Void Pro, Logitch Z523 5.1
Power Supply Deepcool 1000M
Mouse Logitech g7 gaming mouse
Keyboard Logitech G510
Software Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores Firestrike: 46183 Time Spy: 25121
You are talking about Gaming GPUs market, which doesn't bring as much money as the server space and professional. Just look at Nvidia selling the same gaming gpu die to professionals or server for 10x the price.

Nvidia makes 58% of it revenue from Compute & Networking and 42% from Graphics. Graphics include Gaming GPU and streaming, professional Quadro/RTX GPUs and Visualization software, Automotive. So if we remove the non gaming stuff, the revenue just from Gaming is even less, it's about 26% from NVidia Q3 2022 Financial Results. Why would AMD aim for a market that brings only 26% of NVIDIA revenue when it can go for the bigger one?
You forgot Mining you cannot use Nvidia without incorporating Mining in 2021 you could buy a 30 series card retail but Thousands of Mining farms using 3070s and 3080s or do we forget that they released a driver nerfing mining on those cards?
 
Joined
Jun 2, 2017
Messages
9,090 (3.33/day)
System Name Best AMD Computer
Processor AMD 7900X3D
Motherboard Asus X670E E Strix
Cooling In Win SR36
Memory GSKILL DDR5 32GB 5200 30
Video Card(s) Sapphire Pulse 7900XT (Watercooled)
Storage Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s) GIGABYTE FV43U
Case Corsair 7000D Airflow
Audio Device(s) Corsair Void Pro, Logitch Z523 5.1
Power Supply Deepcool 1000M
Mouse Logitech g7 gaming mouse
Keyboard Logitech G510
Software Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores Firestrike: 46183 Time Spy: 25121
This is an astounding technical achievement.
 

Solaris17

Super Dainty Moderator
Staff member
Joined
Aug 16, 2005
Messages
26,912 (3.83/day)
Location
Alabama
System Name RogueOne
Processor Xeon W9-3495x
Motherboard ASUS w790E Sage SE
Cooling SilverStone XE360-4677
Memory 128gb Gskill Zeta R5 DDR5 RDIMMs
Video Card(s) MSI SUPRIM Liquid X 4090
Storage 1x 2TB WD SN850X | 2x 8TB GAMMIX S70
Display(s) 49" Philips Evnia OLED (49M2C8900)
Case Thermaltake Core P3 Pro Snow
Audio Device(s) Moondrop S8's on schitt Gunnr
Power Supply Seasonic Prime TX-1600
Mouse Lamzu Atlantis mini (White)
Keyboard Monsgeek M3 Lavender, Moondrop Luna lights
VR HMD Quest 3
Software Windows 11 Pro Workstation
Benchmark Scores I dont have time for that.
But just some hundred times faster?

I would hope so Phi is super old. You wouldnt compare this directly to old accelerator cards from Intel or anyone else.
 
Joined
Sep 1, 2020
Messages
2,338 (1.52/day)
Location
Bulgaria
I would hope so Phi is super old. You wouldnt compare this directly to old accelerator cards from Intel or anyone else.
Of course, the Intel product you mentioned was never very successful and widespread. I wouldn't kidding with the dead.
 
Top