Apple M4 Max — Local AI Performance

Up to 128 GB unified memory acts as VRAM — can run any quantized model. 35W TDP is extraordinary. Silent, fast, and runs 70B models at 38 t/s. Best all-around for Mac users.

Quick Specs

VRAM128 GB
Memory Bandwidth546 GB/s
TDP35 W
ArchitectureARM, 3nm TSMC
MSRP$3,499
Speed (Llama 3.1 8B Q4_K_M)~110 tokens/sec

Compatible LLMs

← Check your hardware | Full benchmarks