Written by Jakub Rusinowski · Last updated June 15, 2026
Yes — the Qwen 3.5 family runs on the NVIDIA GeForce RTX 4070.
| VRAM | 12 GB |
| Memory Bandwidth | 504 GB/s |
| Qwen 3.5 9B | Q4_K_M · 6.6 GB · ~76 tok/s (est.) |
| Qwen 3.5 4B | Q4_K_M · 3.4 GB · ~148 tok/s (est.) |
| Qwen 3.5 2B | Q4_K_M · 2.7 GB · ~187 tok/s (est.) |
| Qwen 3.5 0.8B | Q4_K_M · 1 GB · ~400 tok/s (est.) |
Yes — the Qwen 3.5 family runs on the NVIDIA GeForce RTX 4070.
Qwen 3.5 9B at Q4_K_M quantization (6.6 GB), estimated ~76 tokens/sec.
← Can I Run It? | Qwen 3.5 Model Page | NVIDIA GeForce RTX 4070 GPU Page | Check Your Hardware