• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Tesla GPUs Again Power World's Greenest Petaflop Supercomputer

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,402 (7.52/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
NVIDIA today announced that, for the second year in a row, the world's most energy efficient petaflop-class supercomputer is powered by NVIDIA Tesla GPUs.

The Tsubame 2.0 system at the Tokyo Institute of Technology's Global Scientific Information Center (GSIC) ranks as the greenest petaflop-class supercomputer on the recently released Green500 list. Published twice annually, the Green500 list, rates the 500 most energy efficient supercomputers based on performance achieved relative to power consumed.

Tsubame 2.0 is a heterogeneous supercomputer (combining both CPUs and GPUs) used to accelerate a range of scientific and industrial research in Japan. With sustained performance of 1.19 petaflops per second while consuming 1.2 megawatts, Tsubame 2.0 delivers 958 megaflops of processing power per watt of energy. It is 3.4-times more energy efficient than the next-closest x86 CPU-only petaflop system, the Cielo Cray supercomputer at Los Alamos National Laboratory, which delivers 278 megaflops per watt.

In the race to exascale computing, power efficiency has become the defining element of computing performance. Heterogeneous GPU-accelerated systems are inherently more energy efficient than CPU-only systems because applications can take advantage of the different processors for executing different jobs. The sequential parts of the application runs on CPUs, and the data- and compute-intensive parts are accelerated by the massively parallel GPU processor.

Tsubame 2.0 is comprised of HP ProLiant SL390 servers with Intel Xeon CPUs accelerated by NVIDIA Tesla GPUs. The Tesla GPUs provide more than 80 percent of its performance, enabling Tsubame 2.0 to achieve high levels of performance with very low power usage. This year, two of the five finalists for the prestigious Gordon Bell Prize ran on Tsubame 2.0, including the winner for Special Achievement in Scalability and Time-to Solution.

The latest Green500 list underscores the energy efficiency of heterogeneous computer design. Five of the world's 10 most efficient systems, and 22 of the top 30 most efficient systems, combine GPUs with CPUs.

Tesla GPUs are massively parallel accelerators based on the CUDA parallel computing architecture. Application developers can accelerate their applications either by using CUDA C, CUDA C++, CUDA Fortran or using the simple, easy-to-use directive-based compilers.

For more information about Tsubame 2.0, visit the Tokyo Institute of Technology, Global Scientific Information and Computing Center website. To learn more about Tesla GPUs, visit the Tesla website. To learn more about CUDA, visit the CUDA website.

View at TechPowerUp Main Site
 
Joined
Dec 9, 2007
Messages
746 (0.12/day)
I wish I could afford a Tesla rack, I'd be contributing a lot more in many Folding@Home/BOINC projects.
 
Joined
Nov 15, 2005
Messages
1,011 (0.14/day)
Processor 2500K @ 4.5GHz 1.28V
Motherboard ASUS P8P67 Deluxe
Cooling Corsair A70
Memory 8GB (2x4GB) Corsair Vengeance 1600 9-9-9-24 1T
Video Card(s) eVGA GTX 470
Storage Crucial m4 128GB + Seagate RAID 1 (1TB x 2)
Display(s) Dell 22" 1680x1050 nothing special
Case Antec 300
Audio Device(s) Onboard
Power Supply PC Power & Cooling 750W
Software Windows 7 64bit Pro
That's actually pretty cool, now if they could only make programming for those CUDA cores a little easier / mainstream.
 

newtekie1

Semi-Retired Folder
Joined
Nov 22, 2005
Messages
28,473 (4.07/day)
Location
Indiana, USA
Processor Intel Core i7 10850K@5.2GHz
Motherboard AsRock Z470 Taichi
Cooling Corsair H115i Pro w/ Noctua NF-A14 Fans
Memory 32GB DDR4-3600
Video Card(s) RTX 2070 Super
Storage 500GB SX8200 Pro + 8TB with 1TB SSD Cache
Display(s) Acer Nitro VG280K 4K 28"
Case Fractal Design Define S
Audio Device(s) Onboard is good enough for me
Power Supply eVGA SuperNOVA 1000w G3
Software Windows 10 Pro x64
That's actually pretty cool, now if they could only make programming for those CUDA cores a little easier / mainstream.

Wasn't there just an article on TPU in the last few weeks about nVidia working on that?

Edit: Found it. It is called OpenACC. http://www.techpowerup.com/forums/showthread.php?t=155267

Edit2: I believe CUDA 4.0 made a big step in the right direction of making programing for CUDA easier too. http://www.techpowerup.com/141283/New-CUDA-4.0-Release-Makes-Parallel-Programming-Easier.html
 
Last edited:
Joined
Jul 10, 2011
Messages
799 (0.16/day)
Processor Intel
Motherboard MSI
Cooling Cooler Master
Memory Corsair
Video Card(s) Nvidia
Storage Western Digital/Kingston
Display(s) Samsung
Case Thermaltake
Audio Device(s) On Board
Power Supply Seasonic
Mouse Glorious
Keyboard UniKey
Software Windows 10 x64
Fermi is not so power hungry after all.
 
Top