Qwen 3 — Local AI Model by Alibaba Cloud

Alibaba's next-generation model series with a breakthrough hybrid thinking mode. Qwen 3 models can toggle between fast response mode and deep Chain-of-Thought reasoning on demand — type /think for hard problems, /no_think for fast answers. Available in dense and MoE variants.

Hardware Requirements

Qwen 3 8BMin 6 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen3:8b
Qwen 3 14BMin 10 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen3:14b
Qwen 3 32BMin 20 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen3:32b
Qwen 3 30B-A3B (MoE)Min 8 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen3:30b-a3b
Qwen 3 235B-A22B (MoE)Min 80 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen3:235b-a22b

How to Run Locally

Install Ollama then run: ollama run qwen3:8b

Minimum VRAM: 6 GB. For best results use Q4_K_M quantization.