The GeForce 8800 Ultra was an enthusiast-class graphics card by NVIDIA, launched on May 2nd, 2007. Built on the 90 nm process, and based on the G80 graphics processor, in its G80-450-A3 variant, the card supports DirectX 11.1. Even though it supports DirectX 11, the feature level is only 10_0, which can be problematic with many DirectX 11 & DirectX 12 titles. The G80 graphics processor is a large chip with a die area of 484 mm² and 681 million transistors. It features 128 shading units, 32 texture mapping units, and 24 ROPs. NVIDIA has paired 768 MB GDDR3 memory with the GeForce 8800 Ultra, which are connected using a 384-bit memory interface. The GPU is operating at a frequency of 612 MHz, memory is running at 1080 MHz. Being a dual-slot card, the NVIDIA GeForce 8800 Ultra draws power from 2x 6-pin power connectors, with power draw rated at 171 W maximum. Display outputs include: 2x DVI, 1x S-Video. GeForce 8800 Ultra is connected to the rest of the system using a PCI-Express 1.0 x16 interface. The card measures 270 mm in length, and features a dual-slot cooling solution. Its price at launch was 829 US Dollars.
Based on TPU review data: "Performance Summary" at 1920x1080, 4K for 2080 Ti and faster.
Performance estimated based on architecture, shader count and clocks.
Clock Speeds
GPU Clock
612 MHz
Shader Clock
1512 MHz
Memory Clock
1080 MHz
2.2 Gbps effective
Memory
Memory Size
768 MB
Memory Type
GDDR3
Memory Bus
384 bit
Bandwidth
103.7 GB/s
Render Config
Shading Units
128
TMUs
32
ROPs
24
SM Count
16
L2 Cache
96 KB
Theoretical Performance
Pixel Rate
14.69 GPixel/s
Texture Rate
39.17 GTexel/s
FP32 (float)
387.1 GFLOPS
Board Design
Slot Width
Dual-slot
Length
270 mm
10.6 inches
TDP
171 W
Suggested PSU
450 W
Outputs
2x DVI 1x S-Video
Power Connectors
2x 6-pin
Board Number
P355 SKU 2
Graphics Features
DirectX
11.1 (10_0)
OpenGL
3.3
OpenCL
1.1 (1.0)
Vulkan
N/A
CUDA
1.0
Shader Model
4.0
G80 GPU Notes
Each Streaming Multiprocessor(SM) in the chip architecture contains 8 SPs and 2 SFUs.
Each SP can fulfill up to two single precision operations MAD per clock.
Each SFU can fulfill up to four operations SF per clock (these units can also handle single-precision floating-point multiplications per clock).
The approximate ratio of operations MAD to operations SF is equal 2:1.
The theoretical SP + SFU performance in single-precision floating point operations
[FLOPSsp + sfu, GFLOPS] of the graphics card with shader count [n] and shader frequency [f, GHz], is estimated by the following formula: FLOPSsp+sfu; f × n × 3.
Alternative formula: FLOPSsp+sfu; f × m × ( 8 SPs * 2 (MAD) + 4 * 2 SFUs ). [m] - SM count.
SP - Shader Processor (Unified Shader, CUDA Core)
SFU - Special Function Unit
SM - Streaming Multiprocessor
MAD - ADD+MUL
Latest Drivers:
Windows 2000:
Quadro Release 261.19
Windows XP / Server 2003 x64:
GeForce Release 340.52
Quadro Release R319 U2 (321.01)
Tesla Release 258.96
Windows Vista / 7 / 8 / 8.1 / 10 32-bit:
GeForce Release 342.01
Quadro Release R340 U15 (342.01)
Tesla Release 258.96