DeepSeek V3.2 — Local AI Model by DeepSeek

DeepSeek's updated 685B MoE model delivering approximately 90% of GPT-5 quality at 1/50th the price. Significant performance jump over V3 — AIME 2026 at 91.67% in thinking mode. MIT-licensed and widely adopted. Input tokens at ~$0.28/1M make it one of the most cost-effective frontier models via API. Self-hostable on multi-GPU clusters (~370 GB at Q4).

Hardware Requirements

DeepSeek V3.2 685BMin 370 GB VRAM · Q4_K_M · 128,000 ctx ·

How to Run Locally

Install Ollama then run: ollama run

Minimum VRAM: 370 GB. For best results use Q4_K_M quantization.