Kimi K2.5 / K2.6 — Local AI Model by Moonshot AI

Moonshot AI's cutting-edge coding and agentic model series. Kimi K2.5 and K2.6 rank among the top models globally for coding tasks, multimodal understanding, and autonomous agent workflows. Built for developers who need a model that can reason, use tools, browse the web, write and debug code end-to-end.

Hardware Requirements

Kimi K2.5Min 20 GB VRAM · Q4_K_M · 128,000 ctx · ollama run hf.co/moonshotai/Kimi-K2.5-Instruct-Q4_K_M
Kimi K2.6Min 20 GB VRAM · Q4_K_M · 128,000 ctx · ollama run hf.co/moonshotai/Kimi-K2.6-Instruct-Q4_K_M

How to Run Locally

Install Ollama then run: ollama run hf.co/moonshotai/Kimi-K2.5-Instruct-Q4_K_M

Minimum VRAM: 20 GB. For best results use Q4_K_M quantization.