Can I Run Llama 3.2 Family on NVIDIA GeForce RTX 4080 Super?

作者: Jakub Rusinowski · 最后更新: 2026年6月15日

Yes — the Llama 3.2 Family family runs on the NVIDIA GeForce RTX 4080 Super.

NVIDIA GeForce RTX 4080 Super Specs

VRAM16 GB
Memory Bandwidth736 GB/s

Llama 3.2 Family Variants That Fit

Llama 3.2 11B Vision InstructQ4_K_M · 7.8 GB · ~94 tok/s (est.)
Llama 3.2 3B InstructQ4_K_M · 2.2 GB · ~335 tok/s (est.)
Llama 3.2 1B InstructQ4_K_M · 0.8 GB · ~400 tok/s (est.)

FAQ

Can I run Llama 3.2 Family on the NVIDIA GeForce RTX 4080 Super?

Yes — the Llama 3.2 Family family runs on the NVIDIA GeForce RTX 4080 Super.

Which Llama 3.2 Family variant fits best on the NVIDIA GeForce RTX 4080 Super?

Llama 3.2 11B Vision Instruct at Q4_K_M quantization (7.8 GB), estimated ~94 tokens/sec.

← Can I Run It? | Llama 3.2 Family Model Page | NVIDIA GeForce RTX 4080 Super GPU Page | Check Your Hardware