NVIDIA GeForce RTX 4090 — Local AI Performance

Still the benchmark for consumer AI. 24 GB VRAM fits DeepSeek R1 32B and Llama 4 Scout fully. 165 t/s on Llama 3.1 8B.

Quick Specs

VRAM24 GB
Memory Bandwidth1008 GB/s
TDP450 W
ArchitectureAda Lovelace AD102
MSRP$1,599
Speed (Llama 3.1 8B Q4_K_M)~165 tokens/sec

Compatible LLMs

← Check your hardware | Full benchmarks