• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Scores Another EPYC Win in Exascale Computing With DOE's "El Capitan" Two-Exaflop Supercomputer

silentbogo

Moderator
Staff member
Joined
Nov 20, 2013
Messages
5,540 (1.38/day)
Location
Kyiv, Ukraine
System Name WS#1337
Processor Ryzen 7 5700X3D
Motherboard ASUS X570-PLUS TUF Gaming
Cooling Xigmatek Scylla 240mm AIO
Memory 4x8GB Samsung DDR4 ECC UDIMM
Video Card(s) MSI RTX 3070 Gaming X Trio
Storage ADATA Legend 2TB + ADATA SX8200 Pro 1TB
Display(s) Samsung U24E590D (4K/UHD)
Case ghetto CM Cosmos RC-1000
Audio Device(s) ALC1220
Power Supply SeaSonic SSR-550FX (80+ GOLD)
Mouse Logitech G603
Keyboard Modecom Volcano Blade (Kailh choc LP)
VR HMD Google dreamview headset(aka fancy cardboard)
Software Windows 11, Ubuntu 24.04 LTS
But can it multi-virtual machine run crysis?
With raytracing!

Let's see what resources are put into ROCm now that AMD has some income to fund dev. Nv has many years (decade) lead with their better fleshed out ecosystem. With nn/AI, Dnn/Dlops will feature heavily on upcoming IHV releases.
Today it's not the case. There are GPGPU APIs that can do the same and have expansive feature set and ecosystem. Heck, before yesterday I didn't even know that OpenMP already implemented GPU offloading (last time I tinkered with it 5-6 years ago).
The main reason why CUDA ruled the HPC and GPGPU compute in general, is being fast. Other aspects are just a consequence of the first one.
 
Joined
Jun 10, 2014
Messages
822 (0.22/day)
Location
Poland
System Name Proper
Processor 5900X + OC
Motherboard GB X570s Elite AX
Cooling WC Heatkiller 3.0 LT
Memory G.Skill 3600 CL16
Video Card(s) Zotac RTX 3070 Ti Trinity LC'ed + OC
Storage KC2500 1TB + A2000 1TB
Display(s) GB M32Q
Case Fractal Define R6 USB C
Audio Device(s) Creative AE-7 + Phonic AM120 MkIII + H/K AVR 265 -> Paradigm Monitor 11 v.7 + AKG K712 Pro
Power Supply Seasonic Prime PX-850
Mouse Log G502 X LS
Keyboard Keychron K5 Opt.brown
Software Win10 Pro
They'll have so many of those new EPYCs, surely they won't notice one is missing, right? Cause I need it... ;)
 
Joined
Jan 8, 2017
Messages
9,426 (3.28/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
CUDA is more for companies like mine where we have 10 people and make biomedical imaging devices. CUDA helps us speed up the image reconstruction on the GPU versus the CPU. We are too small to make our own APIs. Giant supercomputer projects have custom tailor made software.

Completely agree, highly specialized software for these large scale computations are probably optimized down to the lowest available level like PTX for Nvidia and assembly for AMD. Truth is not a whole lot of the critical software paths there are actually going to be written in CUDA or OpenCL.

Why do you keep saying CUDA is in a locked-in eco-system? You can run CUDA code on other hardware (even on x86 and ARM, if you're desperate) using HIP through ROCm, but you need to translate (not manual conversion) to avoid any NVIDIA extensions. This is currently a lot more efficient than what can be done in OpenCL 2.1.

CUDA really is a locked ecosystem even for customers of Nvidia hardware. For example their ISA isn't open to the public and there are instances where no matter what you write in CUDA or directly in PTX it will never be as fast as the hardware is capable of. Nvidia reserves the highest level of optimizations for themselves so in order to get the most out of the hardware you purchased you either have to use a library that was hand optimized by Nvidia or if there is none for of the sort of thing you need to do then tough luck. If that's not a locked ecosystem then I don't know what is.
 
Last edited:
Joined
Mar 10, 2010
Messages
11,878 (2.21/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
They picked the cheapest but good enuff and not the absolute highest performing options.
Like they could use xeons on a DOE super computer , near double the power use on a DOE system would go down well.
Now Fujitsu's 64FX chip's seems like a contender but not intel.
As for the GGPu choice perhaps they see something in the next generation of chips that we have not yet seen, they are not comparing chips that are out are they no it's chips to be made yet.
 
Joined
Oct 18, 2013
Messages
6,176 (1.52/day)
Location
Over here, right where you least expect me to be !
System Name The Little One
Processor i5-11320H @4.4GHZ
Motherboard AZW SEI
Cooling Fan w/heat pipes + side & rear vents
Memory 64GB Crucial DDR4-3200 (2x 32GB)
Video Card(s) Iris XE
Storage WD Black SN850X 4TB m.2, Seagate 2TB SSD + SN850 4TB x2 in an external enclosure
Display(s) 2x Samsung 43" & 2x 32"
Case Practically identical to a mac mini, just purrtier in slate blue, & with 3x usb ports on the front !
Audio Device(s) Yamaha ATS-1060 Bluetooth Soundbar & Subwoofer
Power Supply 65w brick
Mouse Logitech MX Master 2
Keyboard Logitech G613 mechanical wireless
Software Windows 10 pro 64 bit, with all the unnecessary background shitzu turned OFF !
Benchmark Scores PDQ
Great, now we can figure out how to obliterate everyone on the planet even faster & moar better than before...get ready, 'cause the end times are now upon us !

$600 million is fairly massive

Not by government spending standards, seeins how they're spending OUR money not theirs :(
 
Joined
Aug 8, 2019
Messages
430 (0.22/day)
System Name R2V2 *In Progress
Processor Ryzen 7 2700
Motherboard Asrock X570 Taichi
Cooling W2A... water to air
Memory G.Skill Trident Z3466 B-die
Video Card(s) Radeon VII repaired and resurrected
Storage Adata and Samsung NVME
Display(s) Samsung LCD
Case Some ThermalTake
Audio Device(s) Asus Strix RAID DLX upgraded op amps
Power Supply Seasonic Prime something or other
Software Windows 10 Pro x64
You're right about that. Corporations create these supercomputers with a major goal in mind, so they would need custom APIs to get to that goal efficiently. But what @xkm1948 is getting at is that CUDA can scale from the basic enthusiast all the way to the [big] corporations that don't have the time (or need) to have a custom API developed for them.

If anything, those same corporations would employ researchers from these universities. :laugh:



Why do you keep saying CUDA is in a locked-in eco-system? You can run CUDA code on other hardware (even on x86 and ARM, if you're desperate) using HIP through ROCm, but you need to translate (not manual conversion) to avoid any NVIDIA extensions. This is currently a lot more efficient than what can be done in OpenCL 2.1.

The investment in ROCm is an advantage for everyone since all compute APIs will use this. Thank AMD for pulling this off.



They still use Apple because of deals (think 60%+ hardware and support discounts) offered by Apple. Also hardware deployment of Mac minis and Pros depends on department use cases.

Vulkan is aimed at rendering (and why any GPGPU code using Vulkan is on the graphics pipeline), which is why it succeeds OpenGL. OpenCL is meant for GPGPU use.

Oh I know Apple gives universities crazy prices. It's a great way to keep up demand once students become workers.

I'm so deep in studying Latin and writing papers on Greek and Roman epics my brain is melting, I really should focus and my posts are suffering because of that.

It's amazing how this stuff can get so muddied when you are trying to ram different stuff into it.
 
Joined
Feb 25, 2016
Messages
292 (0.09/day)
This is the first such exascale contract where AMD is the sole purveyor of both CPUs and GPUs, with AMD's other design win with EPYC in the Cray Shasta being paired with NVIDIA graphics cards.
@Raevenlord This is 2nd AMD win for exascale computing where both cpu and gpu is from AMD. 1st one was called Frontier.
 
Joined
Jun 3, 2010
Messages
2,540 (0.48/day)
@Raevenlord This is 2nd AMD win for exascale computing where both cpu and gpu is from AMD. 1st one was called Frontier.
That system uses 40MW@1.5 Exaflops. FastForward 2 project aims at 20MW@1Exaflops. %33 higher.
 
Joined
Nov 24, 2017
Messages
853 (0.33/day)
Location
Asia
Processor Intel Core i5 4590
Motherboard Gigabyte Z97x Gaming 3
Cooling Intel Stock Cooler
Memory 8GiB(2x4GiB) DDR3-1600 [800MHz]
Video Card(s) XFX RX 560D 4GiB
Storage Transcend SSD370S 128GB; Toshiba DT01ACA100 1TB HDD
Display(s) Samsung S20D300 20" 768p TN
Case Cooler Master MasterBox E501L
Audio Device(s) Realtek ALC1150
Power Supply Corsair VS450
Mouse A4Tech N-70FX
Software Windows 10 Pro
Benchmark Scores BaseMark GPU : 250 Point in HD 4600
Great, now we can figure out how to obliterate everyone on the planet even faster & moar better than before...get ready, 'cause the end times are now upon us !
If the new supercomputer was built with 5GHz Xeon and GTX 480, then the Govt. could have obliterate us just by truning the computer 'On'.
 
Top