The GeForce 8800 GTS 640 was a high-end graphics card by NVIDIA, launched on November 8th, 2006. Built on the 90 nm process, and based on the G80 graphics processor, in its G80-100-K0-A2 variant, the card supports DirectX 11.1. Even though it supports DirectX 11, the feature level is only 10_0, which can be problematic with many DirectX 11 & DirectX 12 titles. The G80 graphics processor is a large chip with a die area of 484 mm² and 681 million transistors. Unlike the fully unlocked GeForce 8800 GTX, which uses the same GPU but has all 128 shaders enabled, NVIDIA has disabled some shading units on the GeForce 8800 GTS 640 to reach the product's target shader count. It features 96 shading units, 24 texture mapping units, and 20 ROPs. NVIDIA has paired 640 MB GDDR3 memory with the GeForce 8800 GTS 640, which are connected using a 320-bit memory interface. The GPU is operating at a frequency of 513 MHz, memory is running at 792 MHz. Being a dual-slot card, the NVIDIA GeForce 8800 GTS 640 draws power from 1x 6-pin power connector, with power draw rated at 143 W maximum. Display outputs include: 2x DVI, 1x S-Video. GeForce 8800 GTS 640 is connected to the rest of the system using a PCI-Express 1.0 x16 interface. The card measures 267 mm in length, and features a dual-slot cooling solution. Its price at launch was 449 US Dollars.
Based on TPU review data: "Performance Summary" at 1920x1080, 4K for 2080 Ti and faster.
Performance estimated based on architecture, shader count and clocks.
Clock Speeds
GPU Clock
513 MHz
Shader Clock
1188 MHz
Memory Clock
792 MHz
1584 Mbps effective
Memory
Memory Size
640 MB
Memory Type
GDDR3
Memory Bus
320 bit
Bandwidth
63.36 GB/s
Render Config
Shading Units
96
TMUs
24
ROPs
20
SM Count
12
L2 Cache
80 KB
Theoretical Performance
Pixel Rate
10.26 GPixel/s
Texture Rate
24.62 GTexel/s
FP32 (float)
228.1 GFLOPS
Board Design
Slot Width
Dual-slot
Length
267 mm
10.5 inches
TDP
143 W
Suggested PSU
300 W
Outputs
2x DVI 1x S-Video
Power Connectors
1x 6-pin
Board Number
P356
Graphics Features
DirectX
11.1 (10_0)
OpenGL
3.3
OpenCL
1.1 (1.0)
Vulkan
N/A
CUDA
1.0
Shader Model
4.0
G80 GPU Notes
Each Streaming Multiprocessor(SM) in the chip architecture contains 8 SPs and 2 SFUs.
Each SP can fulfill up to two single precision operations MAD per clock.
Each SFU can fulfill up to four operations SF per clock (these units can also handle single-precision floating-point multiplications per clock).
The approximate ratio of operations MAD to operations SF is equal 2:1.
The theoretical SP + SFU performance in single-precision floating point operations
[FLOPSsp + sfu, GFLOPS] of the graphics card with shader count [n] and shader frequency [f, GHz], is estimated by the following formula: FLOPSsp+sfu; f × n × 3.
Alternative formula: FLOPSsp+sfu; f × m × ( 8 SPs * 2 (MAD) + 4 * 2 SFUs ). [m] - SM count.
SP - Shader Processor (Unified Shader, CUDA Core)
SFU - Special Function Unit
SM - Streaming Multiprocessor
MAD - ADD+MUL
Latest Drivers:
Windows 2000:
Quadro Release 261.19
Windows XP / Server 2003 x64:
GeForce Release 340.52
Quadro Release R319 U2 (321.01)
Tesla Release 258.96
Windows Vista / 7 / 8 / 8.1 / 10 32-bit:
GeForce Release 342.01
Quadro Release R340 U15 (342.01)
Tesla Release 258.96