← Back to comparison
NVIDIA
NVIDIA GeForce RTX 4090
Ada Lovelace · RTX 40 Series · ACTIVE
We earn commissions from purchases made through links on this site. This doesn't affect our rankings or recommendations.
Prices updated 5 days ago
$/GB VRAM (best)
$65/GB
MSRP: $1,599 (138% of MSRP)
Specifications
| VRAM | 24 GB |
| Memory type | GDDR6X |
| Bus width | 384-bit |
| Memory bandwidth | 1,008 GB/s |
| CUDA cores | 16,384 |
| Tensor cores | 512 |
| FP16 | 165.2 TFLOPS |
| TDP | 450W |
| Power connector | 16-pin (12VHPWR) |
| Card length | 304 mm |
| Slot width | 3 slots |
| PCIe | Gen 4 x16 |
| CUDA compute | 8.9 |
| Max model (Q4) | ~44B parameters |
Inference Benchmarks (Q4_K_M)
Llama 3.3 8B
128.0 tok/s
Qwen 3 32B
42.0 tok/s
Llama 3.3 70B
10.0 tok/s
llama.cpp, batch_size=1, ctx=4096, single GPU.
What Can You Run?
| Model | Q4_K_M | Q8_0 | FP16 |
|---|---|---|---|
| Llama 3.3 8B8B | Excellent~128 tok/s | Usable | Usable |
| Llama 3.3 70B70.6B | Won't fit~10 tok/s | Won't fit | Won't fit |
| Qwen 3 8B8.2B | Usable | Usable | Usable |
| Qwen 3 32B32.8B | Excellent~42 tok/s | Won't fit | Won't fit |
| DeepSeek R1 70B70.6B | Won't fit | Won't fit | Won't fit |
| Mistral Nemo 12B12.2B | Usable | Usable | Won't fit |
| Phi-4 14B14B | Usable | Usable | Won't fit |
| Gemma 3 27B27.4B | Usable | Won't fit | Won't fit |
| Codestral 25B25.3B | Usable | Won't fit | Won't fit |
| Command R 35B35B | Usable | Won't fit | Won't fit |