• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA GP100 Silicon to Feature 4 TFLOPs DPFP Performance

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,308 (7.52/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
NVIDIA's upcoming flagship GPU based on its next-generation "Pascal" architecture, codenamed GP100, is shaping up to be a number-crunching monster. According to a leaked slide by an NVIDIA research fellow, the company is designing the chip to serve up double-precision floating-point (DPFP) performance as high as 4 TFLOP/s, a 3-fold increase from the 1.31 TFLOP/s offered by the Tesla K20, based on the "Kepler" GK110 silicon.

The same slide also reveals single-precision floating-point (SPFP) performance to be as high as 12 TFLOP/s, four times that of the GK110, and nearly double that of the GM200. The slide also appears to settle the speculation on whether GP100 will use stacked HBM2 memory, or GDDR5X. Given the 1 TB/s memory bandwidth mentioned on the slide, we're inclined to hand it to stacked HBM2.



View at TechPowerUp Main Site
 

Eilifein

New Member
Joined
Feb 17, 2016
Messages
3 (0.00/day)
Just for a reference, since they compare a 7970 (LOL) to their newest cards.

FirePro W9100: http://www.amd.com/en-us/products/graphics/workstation/firepro-3d/9100#

  • 320 GB/s memory bandwidth
  • 5.24 TFLOPS peak single-precision floating-point performance
  • 2.62 TFLOPS peak dual-precision floating-point performance
SP BYTE/FLOP = 0.061068
DP BYTE/FLOP = 0.122137

Wow.... comparing the Tesla cards to a 7970.
 

64K

Joined
Mar 13, 2014
Messages
6,773 (1.72/day)
Processor i7 7700k
Motherboard MSI Z270 SLI Plus
Cooling CM Hyper 212 EVO
Memory 2 x 8 GB Corsair Vengeance
Video Card(s) Temporary MSI RTX 4070 Super
Storage Samsung 850 EVO 250 GB and WD Black 4TB
Display(s) Temporary Viewsonic 4K 60 Hz
Case Corsair Obsidian 750D Airflow Edition
Audio Device(s) Onboard
Power Supply EVGA SuperNova 850 W Gold
Mouse Logitech G502
Keyboard Logitech G105
Software Windows 10
This year will be an exciting year for GPUs. Big increases in performance from both teams.
 
Joined
Aug 15, 2008
Messages
5,941 (0.99/day)
Location
Watauga, Texas
System Name Univac SLI Edition
Processor Intel Xeon 1650 V3 @ 4.2GHz
Motherboard eVGA X99 FTW K
Cooling EK Supremacy EVO, Swiftech MCP50x, Alphacool NeXXos UT60 360, Black Ice GTX 360
Memory 2x16GB Corsair Vengeance LPX 3000MHz
Video Card(s) Nvidia Titan X Tri-SLI w/ EK Blocks
Storage HyperX Predator 240GB PCI-E, Samsung 850 Pro 512GB
Display(s) Dell UltraSharp 34" Ultra-Wide (U3415W) / (Samsung 48" Curved 4k)
Case Phanteks Enthoo Pro M Acrylic Edition
Audio Device(s) Sound Blaster Z
Power Supply Thermaltake 1350watt Toughpower Modular
Mouse Logitech G502
Keyboard CODE 10 keyless MX Clears
Software Windows 10 Pro
NVIDIA's upcoming flagship GPU based on its next-generation "Pascal" architecture, codenamed GP100, is shaping up to be a number-crunching monster. According to a leaked slide by an NVIDIA research fellow, the company is designing the chip to serve up double-precision floating-point (DPFP) performance as high as 4 TFLOP/s, a 3-fold increase from the 1.31 TFLOP/s offered by the Tesla K20, based on the "Kepler" GK110 silicon.

The same slide also reveals single-precision floating-point (SPFP) performance to be as high as 12 TFLOP/s, four times that of the GK110, and nearly double that of the GM200. The slide also appears to settle the speculation on whether GP100 will use stacked HBM2 memory, or GDDR5X. Given the 1 TB/s memory bandwidth mentioned on the slide, we're inclined to hand it to stacked HBM2.



Source: 3DCenter.org
Or maybe the fact that it says stacked 3D DRAM?

Just for a reference, since they compare a 7970 (LOL) to their newest cards.

FirePro W9100: http://www.amd.com/en-us/products/graphics/workstation/firepro-3d/9100#

  • 320 GB/s memory bandwidth
  • 5.24 TFLOPS peak single-precision floating-point performance
  • 2.62 TFLOPS peak dual-precision floating-point performance
SP BYTE/FLOP = 0.061068
DP BYTE/FLOP = 0.122137

Wow.... comparing the Tesla cards to a 7970.
Tesla K20x = OG Titan so comparing it to 7970 makes a bit more sense.
 
Joined
Sep 15, 2011
Messages
6,772 (1.40/day)
Processor Intel® Core™ i7-13700K
Motherboard Gigabyte Z790 Aorus Elite AX
Cooling Noctua NH-D15
Memory 32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s) ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage 2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s) Acer Predator X34 3440x1440@100Hz G-Sync
Case NZXT PHANTOM410-BK
Audio Device(s) Creative X-Fi Titanium PCIe
Power Supply Corsair 850W
Mouse Logitech Hero G502 SE
Software Windows 11 Pro - 64bit
Benchmark Scores 30FPS in NFS:Rivals
Yeah, but the question is, are there any games out there worth the investment of buying a new top video card??
 
Joined
Aug 20, 2007
Messages
21,560 (3.40/day)
System Name Pioneer
Processor Ryzen R9 9950X
Motherboard GIGABYTE Aorus Elite X670 AX
Cooling Noctua NH-D15 + A whole lotta Sunon and Corsair Maglev blower fans...
Memory 64GB (4x 16GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s) XFX RX 7900 XTX Speedster Merc 310
Storage Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s) 55" LG 55" B9 OLED 4K Display
Case Thermaltake Core X31
Audio Device(s) TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply FSP Hydro Ti Pro 850W
Mouse Logitech G305 Lightspeed Wireless
Keyboard WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software Gentoo Linux x64 / Windows 11 Enterprise IoT 2024
Yeah, but the question is, are there any games out there worth the investment of buying a new top video card??

There never is, game developers target what already exists. New releases will take advantage though.

The exception of course is Crysis, but that's about it.
 
Joined
Aug 15, 2008
Messages
5,941 (0.99/day)
Location
Watauga, Texas
System Name Univac SLI Edition
Processor Intel Xeon 1650 V3 @ 4.2GHz
Motherboard eVGA X99 FTW K
Cooling EK Supremacy EVO, Swiftech MCP50x, Alphacool NeXXos UT60 360, Black Ice GTX 360
Memory 2x16GB Corsair Vengeance LPX 3000MHz
Video Card(s) Nvidia Titan X Tri-SLI w/ EK Blocks
Storage HyperX Predator 240GB PCI-E, Samsung 850 Pro 512GB
Display(s) Dell UltraSharp 34" Ultra-Wide (U3415W) / (Samsung 48" Curved 4k)
Case Phanteks Enthoo Pro M Acrylic Edition
Audio Device(s) Sound Blaster Z
Power Supply Thermaltake 1350watt Toughpower Modular
Mouse Logitech G502
Keyboard CODE 10 keyless MX Clears
Software Windows 10 Pro
Yeah, but the question is, are there any games out there worth the investment of buying a new top video card??
I don't see any for this year, yet, but anybody gaming at 4k should want one of these. Maxwell V2 just isn't enough for 4k.
 

Eilifein

New Member
Joined
Feb 17, 2016
Messages
3 (0.00/day)
Or maybe the fact that it says stacked 3D DRAM?

Tesla K20x = OG Titan so comparing it to 7970 makes a bit more sense.

I'm sorry if it came out a bit weird, but i meant the Pascal one, not the K20x. In any case, I really don't understand what the graph wants to communicate. They don't even pit Pascal with the top Tesla dogs, K40 and K80(dual gpu).

Edit: To reiterate, i quote the OP: NVIDIA's upcoming flagship GPU based on its next-generation "Pascal" architecture, codenamed GP100. Specifically mentioning "flagship", then comparing it to K20x and 7970 is at the very least misleading.
 

64K

Joined
Mar 13, 2014
Messages
6,773 (1.72/day)
Processor i7 7700k
Motherboard MSI Z270 SLI Plus
Cooling CM Hyper 212 EVO
Memory 2 x 8 GB Corsair Vengeance
Video Card(s) Temporary MSI RTX 4070 Super
Storage Samsung 850 EVO 250 GB and WD Black 4TB
Display(s) Temporary Viewsonic 4K 60 Hz
Case Corsair Obsidian 750D Airflow Edition
Audio Device(s) Onboard
Power Supply EVGA SuperNova 850 W Gold
Mouse Logitech G502
Keyboard Logitech G105
Software Windows 10
Yeah, but the question is, are there any games out there worth the investment of buying a new top video card??

Depends on what you want. There are games that you can't max at 4K and average 60 FPS with a single Titan X or a single GTX 980 Ti and there will no doubt be more in the next couple of years. I imagine for people that want 4K and a single GPU they will like it. Worth the investment is relative to what a person wants and is willing to spend to get it.
 
Joined
Aug 15, 2008
Messages
5,941 (0.99/day)
Location
Watauga, Texas
System Name Univac SLI Edition
Processor Intel Xeon 1650 V3 @ 4.2GHz
Motherboard eVGA X99 FTW K
Cooling EK Supremacy EVO, Swiftech MCP50x, Alphacool NeXXos UT60 360, Black Ice GTX 360
Memory 2x16GB Corsair Vengeance LPX 3000MHz
Video Card(s) Nvidia Titan X Tri-SLI w/ EK Blocks
Storage HyperX Predator 240GB PCI-E, Samsung 850 Pro 512GB
Display(s) Dell UltraSharp 34" Ultra-Wide (U3415W) / (Samsung 48" Curved 4k)
Case Phanteks Enthoo Pro M Acrylic Edition
Audio Device(s) Sound Blaster Z
Power Supply Thermaltake 1350watt Toughpower Modular
Mouse Logitech G502
Keyboard CODE 10 keyless MX Clears
Software Windows 10 Pro
I'm sorry if it came out a bit weird, but i meant the Pascal one, not the K20x. In any case, I really don't understand what the graph wants to communicate. They don't even pit Pascal with the top Tesla dogs, K40 and K80(dual gpu).
Double precision. DP in Maxwell is nonexistent which is why the M40/ect isn't on there. The K40 compared to K20x is the difference between Titan and Titan Black so it makes sense. Assuming this "leak" is a real leak I'd be willing to bet it's comparing Pascal to K20x to make the #s seem higher to investors? That's just a guess, but most internal should know at least ballpark figures for both cards when looking at a graph like this. I personally don't care what they're on about with the graph, I like that SP performance if it's true.
 

Eilifein

New Member
Joined
Feb 17, 2016
Messages
3 (0.00/day)
Double precision. DP in Maxwell is nonexistent which is why the M40/ect isn't on there. The K40 compared to K20x is the difference between Titan and Titan Black so it makes sense. Assuming this "leak" is a real leak I'd be willing to bet it's comparing Pascal to K20x to make the #s seem higher to investors? That's just a guess, but most internal should know at least ballpark figures for both cards when looking at a graph like this. I personally don't care what they're on about with the graph, I like that SP performance if it's true.
In that sense, i can agree with.
 
Joined
Aug 15, 2008
Messages
5,941 (0.99/day)
Location
Watauga, Texas
System Name Univac SLI Edition
Processor Intel Xeon 1650 V3 @ 4.2GHz
Motherboard eVGA X99 FTW K
Cooling EK Supremacy EVO, Swiftech MCP50x, Alphacool NeXXos UT60 360, Black Ice GTX 360
Memory 2x16GB Corsair Vengeance LPX 3000MHz
Video Card(s) Nvidia Titan X Tri-SLI w/ EK Blocks
Storage HyperX Predator 240GB PCI-E, Samsung 850 Pro 512GB
Display(s) Dell UltraSharp 34" Ultra-Wide (U3415W) / (Samsung 48" Curved 4k)
Case Phanteks Enthoo Pro M Acrylic Edition
Audio Device(s) Sound Blaster Z
Power Supply Thermaltake 1350watt Toughpower Modular
Mouse Logitech G502
Keyboard CODE 10 keyless MX Clears
Software Windows 10 Pro
In that sense, i can agree with.
If you look at the graph DP is bold and the side says performance on double precision. Then you got guys like me who give 0 Fs about DP and all I'm looking at is that SP figure which looks juicy.
 
Joined
Apr 19, 2011
Messages
2,198 (0.44/day)
Location
So. Cal.
With like some probably six+ years between designs and now on a shrink, I'd hope it can more than double/triple some of the numbers! How else is science and other types of professional workload find "more" faster, this is why they call Super computers... Super! For probably the past three years the scientific community has had to do with Stagnate Computers, pretty much.
 
Joined
Dec 18, 2005
Messages
8,253 (1.19/day)
System Name money pit..
Processor Intel 9900K 4.8 at 1.152 core voltage minus 0.120 offset
Motherboard Asus rog Strix Z370-F Gaming
Cooling Dark Rock TF air cooler.. Stock vga air coolers with case side fans to help cooling..
Memory 32 gb corsair vengeance 3200
Video Card(s) Palit Gaming Pro OC 2080TI
Storage 150 nvme boot drive partition.. 1T Sandisk sata.. 1T Transend sata.. 1T 970 evo nvme m 2..
Display(s) 27" Asus PG279Q ROG Swift 165Hrz Nvidia G-Sync, IPS.. 2560x1440..
Case Gigabyte mid-tower.. cheap and nothing special..
Audio Device(s) onboard sounds with stereo amp..
Power Supply EVGA 850 watt..
Mouse Logitech G700s
Keyboard Logitech K270
Software Win 10 pro..
Benchmark Scores Firestike 29500.. timepsy 14000..
unless they can produce at least 50% more pixel driving power for the same wattage it aint gonna achieve much.. somehow i dont see it.. but time will tell.. :)

i think those looking for huge gains or cost savings are gonna be a little disappointed..

trog
 
Joined
Aug 15, 2008
Messages
5,941 (0.99/day)
Location
Watauga, Texas
System Name Univac SLI Edition
Processor Intel Xeon 1650 V3 @ 4.2GHz
Motherboard eVGA X99 FTW K
Cooling EK Supremacy EVO, Swiftech MCP50x, Alphacool NeXXos UT60 360, Black Ice GTX 360
Memory 2x16GB Corsair Vengeance LPX 3000MHz
Video Card(s) Nvidia Titan X Tri-SLI w/ EK Blocks
Storage HyperX Predator 240GB PCI-E, Samsung 850 Pro 512GB
Display(s) Dell UltraSharp 34" Ultra-Wide (U3415W) / (Samsung 48" Curved 4k)
Case Phanteks Enthoo Pro M Acrylic Edition
Audio Device(s) Sound Blaster Z
Power Supply Thermaltake 1350watt Toughpower Modular
Mouse Logitech G502
Keyboard CODE 10 keyless MX Clears
Software Windows 10 Pro
unless they can produce at least 50% more pixel driving power for the same wattage it aint gonna achieve much.. somehow i dont see it.. but time will tell.. :)

i think those looking for huge gains or cost savings are gonna be a little disappointed..

trog
The numbers and the quoted 50% per watt from Nvidia (rumor) lines up to current Titan X offering.
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
With like some probably six+ years between designs and now on a shrink, I'd hope it can more than double/triple some of the numbers!
Answer me this: Why would you expect this? TSMC have already explained that the process offers twice the transistor density OR a 70% reduction in power than CLN28HPM, and the last time ANY flagship GPU offered more than a doubling of FP32 and FP64 was 2009 (Cypress over RV 770XT/790XT)...and the last time both those parameters were tripled in the space of generation? Never.
How else is science and other types of professional workload find "more" faster, this is why they call Super computers... Super!
There's a reason that supers are referred to as clusters. There is also a reason that the interconnect plays a huge part in these clusters, and also a reason that OmniPath, HSA, and NVLink are seen as future performance multipliers.
Or maybe the fact that it says stacked 3D DRAM?.
The actual slide is probably quite old. The slide deck (PDF) it came from concentrates on HMC so your distinction is very much valid.

I'd be wary about taking too much Pascal info for granted in the slide if the information is that old.
 
Last edited:
Joined
Jan 8, 2016
Messages
34 (0.01/day)
Was there any doubt that hi-end Pascals would feature HBM2? :p Like if Nvidia could afford NOT to include it...

I think I read that lower-end Pascals would feature GDDR5X, but that's about all of it.
 

newtekie1

Semi-Retired Folder
Joined
Nov 22, 2005
Messages
28,473 (4.08/day)
Location
Indiana, USA
Processor Intel Core i7 10850K@5.2GHz
Motherboard AsRock Z470 Taichi
Cooling Corsair H115i Pro w/ Noctua NF-A14 Fans
Memory 32GB DDR4-3600
Video Card(s) RTX 2070 Super
Storage 500GB SX8200 Pro + 8TB with 1TB SSD Cache
Display(s) Acer Nitro VG280K 4K 28"
Case Fractal Design Define S
Audio Device(s) Onboard is good enough for me
Power Supply eVGA SuperNOVA 1000w G3
Software Windows 10 Pro x64
I don't think we will even see the high end Pascal GPU in the consumer space any time soon. I'm guessing nVidia will do the same thing they have done the past few generations, release the mid-range GPU as the top end. Then coast on that for a while, and then release the high end GPU later down the line.
 

the54thvoid

Super Intoxicated Moderator
Staff member
Joined
Dec 14, 2009
Messages
13,124 (2.39/day)
Location
Glasgow - home of formal profanity
Processor Ryzen 7800X3D
Motherboard MSI MAG Mortar B650 (wifi)
Cooling be quiet! Dark Rock Pro 4
Memory 32GB Kingston Fury
Video Card(s) Gainward RTX4070ti
Storage Seagate FireCuda 530 M.2 1TB / Samsumg 960 Pro M.2 512Gb
Display(s) LG 32" 165Hz 1440p GSYNC
Case Asus Prime AP201
Audio Device(s) On Board
Power Supply be quiet! Pure POwer M12 850w Gold (ATX3.0)
Software W10
I don't think we will even see the high end Pascal GPU in the consumer space any time soon. I'm guessing nVidia will do the same thing they have done the past few generations, release the mid-range GPU as the top end. Then coast on that for a while, and then release the high end GPU later down the line.

I think AMD will have a lot to do with that decision. If AMD release Polaris performance parts in Q2 that outshine Maxwell (which is pretty much assured given Fiji is a very close match) Nvidia will be forced to play their hand, if they even have it ready.
If AMD release a solid card, it will be humbling for Nvidia (which we all agree would be very good). It really depends what each company's moles know about each others tech. Perhaps it will be Tahiti versus GK104 all over again? Perhaps it will be 290X versus 780ti? I would like to see AMD come out with a better card and one that puts pressure on Nvidia.

But, if AMD have no Polaris performance part ready, Yeah, Nvidia will do exactly what they always do, milk the mid range as the best part until they need to release their top end. I doubt Nvidia will jump when AMD release the dual Fiji part. It will give AMD hands down the fastest card but it wont be seen as a 'valid' threat to Nvidia's 980ti (dual versus single arguments).
 
Joined
Jun 13, 2012
Messages
1,412 (0.31/day)
Processor i7-13700k
Motherboard Asus Tuf Gaming z790-plus
Cooling Coolermaster Hyper 212 RGB
Memory Corsair Vengeance RGB 32GB DDR5 7000mhz
Video Card(s) Asus Dual Geforce RTX 4070 Super ( 2800mhz @ 1.0volt, ~60mhz overlock -.1volts)
Storage 1x Samsung 980 Pro PCIe4 NVme, 2x Samsung 1tb 850evo SSD, 3x WD drives, 2 seagate
Display(s) Acer Predator XB273u 27inch IPS G-Sync 165hz
Audio Device(s) Logitech Z906 5.1
Power Supply Corsair RMx Series RM850x (OCZ Z series PSU retired after 13 years of service)
Mouse Logitech G502 hero
Keyboard Logitech G710+
If AMD release a solid card, it will be humbling for Nvidia (which we all agree would be very good). It really depends what each company's moles know about each others tech. Perhaps it will be Tahiti versus GK104 all over again? Perhaps it will be 290X versus 780ti? I would like to see AMD come out with a better card and one that puts pressure on Nvidia.

But, if AMD have no Polaris performance part ready, Yeah, Nvidia will do exactly what they always do, milk the mid range as the best part until they need to release their top end. I doubt Nvidia will jump when AMD release the dual Fiji part. It will give AMD hands down the fastest card but it wont be seen as a 'valid' threat to Nvidia's 980ti (dual versus single arguments).
Only "polaris card" amd has even shown off is low-mid range card that is 950/960 range card. Which likely is AMD just doing it ti create some hype/get ahead of nvidia's PR announcements. I don't think 3-4 months of being taped out is enough time for QA testing over AMD's new chip's. As for fiji, that being CF'ed gpu, which will put some people off Since if there is 50% boost that could put next gen gpu in that ball park of that dual gpu without CF/SLI drawbacks.
 
Joined
Sep 29, 2013
Messages
97 (0.02/day)
Processor Intel i7 4960x Ivy-Bridge E @ 4.6 Ghz @ 1.42V
Motherboard x79 AsRock Extreme 11.0
Cooling EK Supremacy Copper Waterblock
Memory 65.5 GBs Corsair Platinum Kit @ 666.7Mhz
Video Card(s) PCIe 3.0 x16 -- Asus GTX Titan Maxwell
Storage Samsung 840 500GBs + OCZ Vertex 4 500GBs 2x 1TB Samsung 850
Audio Device(s) Soundblaster ZXR
Power Supply Corsair 1000W
Mouse Razer Naga
Keyboard Corsair K95
Software Zbrush, 3Dmax, Maya, Softimage, Vue, Sony Vegas Pro, Acid, Soundforge, Adobe Aftereffects, Photoshop
Yeah, but the question is, are there any games out there worth the investment of buying a new top video card??

Star Citizens? New refreshes of Battlefield and Call of Duty sequels? Assassin Creed Sequels? New MMOs with D3D12.0? 4K eye candy that will have diminished value and desire as time approaches 2017 and 2018?

The exception of course is Crysis, but that's about it.

Is there a new Crysis Sequel coming out? C3 is cake on for high-end computers...

I'm sorry if it came out a bit weird, but i meant the Pascal one, not the K20x. In any case, I really don't understand what the graph wants to communicate. They don't even pit Pascal with the top Tesla dogs, K40 and K80(dual gpu).

Edit: To reiterate, i quote the OP: NVIDIA's upcoming flagship GPU based on its next-generation "Pascal" architecture, codenamed GP100. Specifically mentioning "flagship", then comparing it to K20x and 7970 is at the very least misleading.

1. The graph is basically stating a performance improvement in the 64 bit floating point precision area over CPUs and others. As you can see, there's no major improvements for gaming if you focus on 64bitFPP, but rendering and number crushing, that's a different story. 32bitFPP at 12 Tera-whatevers per sec is actually pretty significant for gaming. You can say one of NVidia's many points with this graph is they didn't skimp on the 64bitFPP area like the last 2 to 3 generations on Titan "this time."

2. The graph speaks of a correlation between memory usage and the 1st derivative aka bytes per flop. What NVidia is basically saying is that the point in which information is being stored to the framebuffer for 32 or 64 bit floating point precision executions, the usage is actually less if you compare it to other products with a similar relationship. Furthermore, I think it's a typo when the graph shows 0,256 and 0,805 for SP and DP on the new Pascal. It's probably meant to say 0.256 bytes per flop SP and 0.805 bytes per flop DP.

3. 7970 and above, 64bit FPP has actually gone up for AMD Graphic Cards probably because AMD saw a small niche in the market where AMD Consumers would use their discrete graphic cards to render videos and others in a time where NVidia was taking it away after the first Titan series generation. NVidia was thinking that they could remove the 64bitFPP in gaming cards, and this would probably boost the sells of Quandro Cards, but there wasn't really a big difference in sales (speculation), and you can see this in M4000 where you have a Maxwell Titan and Workstation card providing about the same performance/features to rendering. The only difference is the driver that was probably significant for the most part.

4. Tesla is more of a number cruncher, and it's contender is the Intel's knight's landing or any server CPUs. Simply put, it's an accelerator card, but it still acts as a Graphic Card: Offload GPU executions to the GPU for processing and image rendering, use CUDA, blah blah blah. Some would say that Knights Landing is a work in progress and Intel's failure at an Intel Graphics Card. Intel's Xeon PHI is future a proof toys that can't be used for practical applications because a lot of current softwares don't utilize multi-core coding, and in order to make it work, you need to be someone who knows how to code both for a program and on the Xeon Phi to make it work remotely (in theory). From my understanding, you can't just load a PC game, and 64 micro CPU cores from Knights Landing is going to make your bottleneck troubles disappear. Thus giving you an FPS of 3,000 on World of Warcraft on ultra high settings. NO! The PC game utilizes coding to function with the physical Core for your CPU, but other codes need to be implemented for Knights Landing--that's assuming it works properly when you do that, to make it work. While Intel has it's multicore coding for Knight's Landing, NVidia's Tesla line uses Cuda. They say it's more efficient, and it provides better performance than Knights Landing. Overall, I think it's just a glorified GPU with some Nitro or rocket boosters... Tesla can't act as a substitute CPU through your PCI bus for increased performance, but it can improve rendering times for programs that utilize GPU rendering, and the coding is less complicated??

5. Majority of CPUs have poor 64bit FPP in general. Take a look at the Sandy Bridge Xeon 2690 in the table. 64bitFPP is only what, 243.2Teraflops versus the AMD 7970 at 1010. TeraFLops in DP alone.

6. 64bitFPP isn't a major function for every, normal use and PC Gaming. So in a sense, Intel and AMD can say "big F***en Deal," but to renderers and CGI people who use NVidia's codes to render particle effects, we'll be like OMG, that's going to make my epeen super sexy. Frames times are cut down from 10 minutes to 10 seconds. Woot WOOT! I can hit the clubs a lot sooner.
 
Joined
Aug 20, 2007
Messages
21,560 (3.40/day)
System Name Pioneer
Processor Ryzen R9 9950X
Motherboard GIGABYTE Aorus Elite X670 AX
Cooling Noctua NH-D15 + A whole lotta Sunon and Corsair Maglev blower fans...
Memory 64GB (4x 16GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s) XFX RX 7900 XTX Speedster Merc 310
Storage Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s) 55" LG 55" B9 OLED 4K Display
Case Thermaltake Core X31
Audio Device(s) TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply FSP Hydro Ti Pro 850W
Mouse Logitech G305 Lightspeed Wireless
Keyboard WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software Gentoo Linux x64 / Windows 11 Enterprise IoT 2024
Joined
Sep 15, 2011
Messages
6,772 (1.40/day)
Processor Intel® Core™ i7-13700K
Motherboard Gigabyte Z790 Aorus Elite AX
Cooling Noctua NH-D15
Memory 32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s) ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage 2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s) Acer Predator X34 3440x1440@100Hz G-Sync
Case NZXT PHANTOM410-BK
Audio Device(s) Creative X-Fi Titanium PCIe
Power Supply Corsair 850W
Mouse Logitech Hero G502 SE
Software Windows 11 Pro - 64bit
Benchmark Scores 30FPS in NFS:Rivals
Guys, you are keep saying that the new ones will be better suited for 4K gaming. LOL. If you think that 0.07% of the users that are gaming in 4K are going to make nVidia/AMD ritch by buyng new cards, then we are all living an a dream world :))))
Common, lets be real for once.
I'm gaming with full details ALL existing games on 1080p with my (now) crappy 780 Ti card and so far there is zero reason to upgrade. If the rummors are true, then those new cards will be at least 700$ or more in East Asia/Europe...
Good luck with that.
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
1. The graph is basically stating a performance improvement in the 64 bit floating point precision area over CPUs and others. As you can see, there's no major improvements for gaming if you focus on 64bitFPP, but rendering and number crushing, that's a different story. 32bitFPP at 12 Tera-whatevers per sec is actually pretty significant for gaming. You can say one of NVidia's many points with this graph is they didn't skimp on the 64bitFPP area like the last 2 to 3 generations on Titan "this time."
It is actually only GM 200 that has reduced FP64. GK110 for Titan/Titan Black is at the same 1:3 (FP64:FP32) rate as it's Quadro and Tesla brethren
3. 7970 and above, 64bit FPP has actually gone up for AMD Graphic Cards...
Not actually true. Tahiti (HD 7970 / FirePro W8000/W9000) has a 1:4 FP64 rate (roughly 950-1000 GFLOP). Hawaii has a native rate of 1:2 (2.1-2.6TF) for FirePro and 1:8 rate for Radeon. Fiji has a native rate of 1:16, which works out to just over half (537 GFLOPS) that of the 7970 (1024 GFLOPS).
probably because AMD saw a small niche in the market where AMD Consumers would use their discrete graphic cards to render videos...
CG Render software seldom uses double precision (V-Ray and Deadpool- the current CG PR poster-boy being an exception), and it is virtually non existent for consumer applications. You also might want to check the state of OpenCL rendering. In general it is a mess.
NVidia was thinking that they could remove the 64bitFPP in gaming cards, and this would probably boost the sells of Quandro Cards, but there wasn't really a big difference in sales (speculation)
Numbers sold don't reflect value especially given the discrepancy in pricing.
and you can see this in M4000 where you have a Maxwell Titan and Workstation card providing about the same performance/features to rendering. The only difference is the driver that was probably significant for the most part.
The driver and the 24/7 support are the big differences between workstation and consumer graphics cards for both vendors. The warranty also guarantees a like for like replacement for a 3 year term.
4. Tesla is more of a number cruncher, and it's contender is the Intel's knight's landing or any server CPUs. Simply put, it's an accelerator card, but it still acts as a Graphic Card: Offload GPU executions to the GPU for processing and image rendering, use CUDA, blah blah blah. Some would say that Knights Landing is a work in progress and Intel's failure at an Intel Graphics Card. Intel's Xeon PHI is future a proof toys that can't be used for practical applications because a lot of current softwares don't utilize multi-core coding, and in order to make it work, you need to be someone who knows how to code both for a program and on the Xeon Phi to make it work remotely (in theory).
Xeon Phi represents a challenge to code for to maximize its potential. Xeon Phi also has an annoying issue of performance decreasing as the job size increases - not a great selling point for supercomputer workloads as a general rule (it also isn't overly efficient). Intel gains market share through its generous support and basically giving the things away (or in some cases, actually giving them away). Certainly a great way to fast track market share even if they aren't actually used ( a la Tiahne-2). I still suspect both Nvidia and Intel (and AMD if they have the resources) will need to address dedicated non-graphics pipelined GPGPU like PEZY-SC
 
Joined
Dec 14, 2011
Messages
1,096 (0.23/day)
Location
South-Africa
Processor AMD Ryzen 9 5900X
Motherboard ASUS ROG STRIX B550-F GAMING (WI-FI)
Cooling Noctua NH-D15 G2
Memory 32GB G.Skill DDR4 3600Mhz CL18
Video Card(s) ASUS GTX 1650 TUF
Storage SAMSUNG 990 PRO 2TB
Display(s) Dell S3220DGF
Case Corsair iCUE 4000X
Audio Device(s) ASUS Xonar D2X
Power Supply Corsair AX760 Platinum
Mouse Razer DeathAdder V2 - Wireless
Keyboard Corsair K70 PRO - OPX Linear Switches
Software Microsoft Windows 11 - Enterprise (64-bit)
I think I read that lower-end Pascals would feature GDDR5X, but that's about all of it.

If they are, they can keep it. HBM2 has better power efficiency, performance as well as making the PCB smaller. It would be a big mistake if they decide to go with GDDR5X, even for the lower tiered cards; just think Media PC's etc.
 
Top