LocalAIPrices
← Back to comparison
NVIDIA

NVIDIA GeForce RTX 4080 Super

Ada Lovelace · RTX 40 Series · ACTIVE

We earn commissions from purchases made through links on this site. This doesn't affect our rankings or recommendations.
Prices updated 5 days ago
New price
Amazon
Used avg (eBay 30d)
20% below new
$/GB VRAM (best)
$49/GB
MSRP: $999 (98% of MSRP)

Specifications

VRAM16 GB
Memory typeGDDR6X
Bus width256-bit
Memory bandwidth736 GB/s
CUDA cores10,240
Tensor cores320
FP16104.4 TFLOPS
TDP320W
Power connector16-pin (12VHPWR)
Card length304 mm
Slot width2.5 slots
PCIeGen 4 x16
CUDA compute8.9
Max model (Q4)~28B parameters

Inference Benchmarks (Q4_K_M)

Llama 3.3 8B
95.0 tok/s
Qwen 3 32B
28.0 tok/s*
Llama 3.3 70B

llama.cpp, batch_size=1, ctx=4096, single GPU. Values marked with * are estimated.

What Can You Run?

ModelQ4_K_MQ8_0FP16
Llama 3.3 8B8BExcellent~95 tok/sUsableWon't fit
Llama 3.3 70B70.6BWon't fitWon't fitWon't fit
Qwen 3 8B8.2BUsableUsableWon't fit
Qwen 3 32B32.8BWon't fit~28 tok/sWon't fitWon't fit
DeepSeek R1 70B70.6BWon't fitWon't fitWon't fit
Mistral Nemo 12B12.2BUsableUsableWon't fit
Phi-4 14B14BUsableWon't fitWon't fit
Gemma 3 27B27.4BWon't fitWon't fitWon't fit
Codestral 25B25.3BWon't fitWon't fitWon't fit
Command R 35B35BWon't fitWon't fitWon't fit