Can I Run Llama 4 on NVIDIA GeForce RTX 5090?

Written by Jakub Rusinowski · Last updated June 15, 2026

Yes — the Llama 4 family runs on the NVIDIA GeForce RTX 5090.

NVIDIA GeForce RTX 5090 Specs

VRAM32 GB
Memory Bandwidth1792 GB/s

Llama 4 Variants That Fit

Llama 4 Maverick 17BQ4_K_M · 24 GB · ~400 tok/s (est.)
Llama 4 Scout 17BQ4_K_M · 10.5 GB · ~400 tok/s (est.)

FAQ

Can I run Llama 4 on the NVIDIA GeForce RTX 5090?

Yes — the Llama 4 family runs on the NVIDIA GeForce RTX 5090.

Which Llama 4 variant fits best on the NVIDIA GeForce RTX 5090?

Llama 4 Maverick 17B at Q4_K_M quantization (24 GB), estimated ~400 tokens/sec.

← Can I Run It? | Llama 4 Model Page | NVIDIA GeForce RTX 5090 GPU Page | Check Your Hardware