Written by Jakub Rusinowski · Last updated June 15, 2026
Yes — the Llama 4 family runs on the NVIDIA GeForce RTX 5090.
| VRAM | 32 GB |
| Memory Bandwidth | 1792 GB/s |
| Llama 4 Maverick 17B | Q4_K_M · 24 GB · ~400 tok/s (est.) |
| Llama 4 Scout 17B | Q4_K_M · 10.5 GB · ~400 tok/s (est.) |
Yes — the Llama 4 family runs on the NVIDIA GeForce RTX 5090.
Llama 4 Maverick 17B at Q4_K_M quantization (24 GB), estimated ~400 tokens/sec.
← Can I Run It? | Llama 4 Model Page | NVIDIA GeForce RTX 5090 GPU Page | Check Your Hardware