Qwen 2.5 VL — Local AI Model by Alibaba Cloud

Alibaba's powerful vision-language models that understand images, documents, charts, and videos alongside text. Qwen 2.5 VL 72B rivals GPT-4V on multiple visual benchmarks, while the 7B version delivers strong OCR and document parsing in under 6GB VRAM.

Hardware Requirements

Qwen 2.5 VL 7B InstructMin 6 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen2.5vl:7b
Qwen 2.5 VL 72B InstructMin 42 GB VRAM · Q4_K_M · 128,000 ctx · ollama run qwen2.5vl:72b

How to Run Locally

Install Ollama then run: ollama run qwen2.5vl:7b

Minimum VRAM: 6 GB. For best results use Q4_K_M quantization.