Autor: Jakub Rusinowski · Ostatnia aktualizacja: 15 czerwca 2026
Yes — the Llama 3.3 family runs on the NVIDIA GeForce RTX 5090.
| VRAM | 32 GB |
| Memory Bandwidth | 1792 GB/s |
| Llama 3.3 70B Instruct | Q2_K_XS (Tight) · 26 GB · ~69 tok/s (est.) |
Yes — the Llama 3.3 family runs on the NVIDIA GeForce RTX 5090.
Llama 3.3 70B Instruct at Q2_K_XS (Tight) quantization (26 GB), estimated ~69 tokens/sec.
← Can I Run It? | Llama 3.3 Model Page | NVIDIA GeForce RTX 5090 GPU Page | Check Your Hardware