Autor: Jakub Rusinowski · Ostatnia aktualizacja: 15 czerwca 2026
Yes — the Llama 3.2 Family family runs on the Apple M4 Max.
| VRAM | 128 GB unified memory |
| Memory Bandwidth | 546 GB/s |
| Llama 3.2 90B Vision Instruct | Q4_K_M · 54 GB · ~10 tok/s (est.) |
| Llama 3.2 11B Vision Instruct | Q4_K_M · 7.8 GB · ~70 tok/s (est.) |
| Llama 3.2 3B Instruct | Q4_K_M · 2.2 GB · ~248 tok/s (est.) |
| Llama 3.2 1B Instruct | Q4_K_M · 0.8 GB · ~400 tok/s (est.) |
Yes — the Llama 3.2 Family family runs on the Apple M4 Max.
Llama 3.2 90B Vision Instruct at Q4_K_M quantization (54 GB), estimated ~10 tokens/sec.
← Can I Run It? | Llama 3.2 Family Model Page | Apple M4 Max GPU Page | Check Your Hardware