• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Breathes Life into Kepler with the GK210 Silicon

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,129 (7.57/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
NVIDIA's "Maxwell" architecture may have got a rather low-key debut with the GeForce GTX 750 Ti, but nobody saw its performance-segment derivative, the GM204 silicon, driving the GeForce GTX 980 and the GTX 970. The new architecture makes its predecessor, the "Kepler" look inefficient in comparison. It looks like NVIDIA still thinks Kepler is competitive to competition from AMD (GCN) and Intel (Knights Corner), in the high-performance computing era.

The problems here are NVIDIA already launched a GK110 based Tesla HPC card, and its big "Maxwell" chip is nowhere in sight. The GM204 has limited memory bandwidth, and its texture-compression mojo can't bail out bandwidth-hogging HPC applications. The solution? Develop a new big silicon based on "Kepler." Enter, the GK210. That's right, the G-K-210. Launched today with the Tesla K80 dual-chip HPC accelerator, this chip could feature design improvements over the GK110, while offering memory bandwidth and sizes not possible on the GM204.



The Tesla K80 accelerator is a dual-chip solution, with two GK210 chips. Each of the two features 2,496 CUDA cores, totaling 4,992 in all. Each chip features a 384-bit wide GDDR5 memory interface, wired to 12 GB of memory. That gives the K80 a staggering 24 GB of memory, across two 240 GB/s memory interfaces. 240 GB/s may not seem like a figure a GM204 can't achieve, but we're beyond consumer (GeForce) and enterprise (Quadro) market-segments here, entering the mission-critical (Tesla) one. NVIDIA is clocking the card very conservatively. The Tesla isn't a graphics card to begin with. Its core runs at 562 MHz, which can spool up to 875 MHz, and the memory ticks at 5.00 GHz, less than the 6 GHz on the Tesla K40.

So what's changed between the GK210 and the GK110? For one, it appears to be extremely energy efficient. The Tesla K80 comes with passive cooling (relies on the air-flow of the rackmount blade it's part of), and has a TDP rating of 300W (150W per GPU system). In comparison, the single-chip Tesla K40 is rated at 235W. The Boost clocks of both chips are identical, even if the nominal clocks on the Tesla K80's GK210 are marginally lower, and the memory clocks lower by 15%. Another technical difference between the GK210 and the GK110 is under the hood.

While both chips are based on the "Kepler" architecture, GK210 features double the shader cache amount. Each of the 15 streaming multiprocessors (SMXs) features 128 KB of shader cache, compared to 64 KB per SMX on the GK110. The GK210 also has a 512 KB register file per SMX, double the size of the 256 KB register file size, of the GK110. A larger register file size means that the number of variables a shader can use is increased. If an operation runs out of register, then those variables have to sit in the chip's limited last-level cache, taking more clock cycles to fetch, or even worse, the GPU memory, which is several orders of magnitude slower. These two changes could step up the GPU's serial processing performance slightly, while retaining its inherent parallel processing advantages, which could really help in an HPC environment. In other words, we won't hold our breath for a consumer GeForce debut of this chip.

View at TechPowerUp Main Site
 
Joined
Nov 4, 2005
Messages
11,948 (1.73/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s) 55" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
Is that mean gk210 will come on GeForce series?


Probably not as they already had those cards, and the whole point is the new color compression hardware on the 9XX series makes the core almost unuseable for a compute card as it starves the core for bandwidth when the data isn't able to be compressed.

Also brings into question the new stacked memory being used on compute cards since its currently limited to 4Gb, perhaps we will see a new memory ring bus used to bridge more together.
 

the54thvoid

Super Intoxicated Moderator
Staff member
Joined
Dec 14, 2009
Messages
12,935 (2.38/day)
Location
Glasgow - home of formal profanity
Processor Ryzen 7800X3D
Motherboard MSI MAG Mortar B650 (wifi)
Cooling be quiet! Dark Rock Pro 4
Memory 32GB Kingston Fury
Video Card(s) Gainward RTX4070ti
Storage Seagate FireCuda 530 M.2 1TB / Samsumg 960 Pro M.2 512Gb
Display(s) LG 32" 165Hz 1440p GSYNC
Case Asus Prime AP201
Audio Device(s) On Board
Power Supply be quiet! Pure POwer M12 850w Gold (ATX3.0)
Software W10
Is that mean gk210 will come on GeForce series?

Perhaps but unlikely. It wouldn't offer much more than a 780ti/Titan Black or GTX980. It's been revamped specifically for HPC duty.

I think we'll see the Maxwell move on from here for the top card, GM200.
 
Joined
Apr 16, 2013
Messages
549 (0.13/day)
Location
Bulgaria
System Name Black Knight | White Queen
Processor Intel Core i9-10940X (28 cores) | Intel Core i7-5775C (8 cores)
Motherboard ASUS ROG Rampage VI Extreme Encore X299G | ASUS Sabertooth Z97 Mark S (White)
Cooling Noctua NH-D15 chromax.black | Xigmatek Dark Knight SD-1283 Night Hawk (White)
Memory G.SKILL Trident Z RGB 4x8GB DDR4 3600MHz CL16 | Corsair Vengeance LP 4x4GB DDR3L 1600MHz CL9 (White)
Video Card(s) ASUS ROG Strix GeForce RTX 4090 OC | KFA2/Galax GeForce GTX 1080 Ti Hall of Fame Edition
Storage Samsung 990 Pro 2TB, 980 Pro 1TB, 850 Pro 256GB, 840 Pro 256GB, WD 10TB+ (incl. VelociRaptors)
Display(s) Dell Alienware AW2721D 240Hz| LG OLED evo C4 48" 144Hz
Case Corsair 7000D AIRFLOW (Black) | NZXT ??? w/ ASUS DRW-24B1ST
Audio Device(s) ASUS Xonar Essence STX | Realtek ALC1150
Power Supply Enermax Revolution 1250W 85+ | Super Flower Leadex Gold 650W (White)
Mouse Razer Basilisk Ultimate, Razer Naga Trinity | Razer Mamba 16000
Keyboard Razer Blackwidow Chroma V2 (Orange switch) | Razer Ornata Chroma
Software Windows 10 Pro 64bit
Pretty interesting. Looks like that they like Kepler a lot. :)
 
Top