Can I Run Llama 3.2 Family on NVIDIA GeForce RTX 4070?

Name: LLM Configurator — GPU VRAM Checker
Author: LLM Configurator

Autor: Jakub Rusinowski · Ostatnia aktualizacja: 25 września 2024

Yes, comfortably — you'll have ~4.2 GB of headroom running Llama 3.2 11B Vision Instruct at Q4_K_M (7.8 GB, ~65 tok/s (est.)).

Ujawnienie afiliacyjne: Niektóre odnośniki na tej stronie to linki afiliacyjne — jeśli dokonasz zakupu za ich pośrednictwem, LLM Configurator może otrzymać prowizję bez dodatkowych kosztów dla Ciebie. Jako uczestnik programu Amazon Associates, LLM Configurator zarabia na kwalifikujących się zakupach.

Sprawdź cenę na Amazon — NVIDIA GeForce RTX 4070 12GB

NVIDIA GeForce RTX 4070 Specs

VRAM	12 GB
Memory Bandwidth	504 GB/s

Llama 3.2 Family Sizes That Fit the NVIDIA GeForce RTX 4070

Llama 3.2 11B Vision Instruct	Q4_K_M · 7.8 GB · ~65 tok/s (est.)
Llama 3.2 3B Instruct	Q4_K_M · 2.2 GB · ~229 tok/s (est.)
Llama 3.2 1B Instruct	Q4_K_M · 0.8 GB · ~400 tok/s (est.)

Buy vs. rent Llama 3.2 Family

Buy the GPU

~$599

NVIDIA GeForce RTX 4070 · MSRP

Rent by the hour

from $0.34/hr

RTX 4090 (24 GB) class

At 2 hrs/day, buying (~$599) beats renting at $0.34/hr after about 2.4 years.

Affiliate links — we may earn a commission if you sign up, at no extra cost to you.

RunPod $0.34/hr

Rent on RunPod →

Vast.ai $0.35/hr · typical low · varies