Can I Run Llama 3.2 Family on NVIDIA GeForce RTX 5080?

Name: LLM Configurator — GPU VRAM Checker
Author: LLM Configurator

Autor: Jakub Rusinowski · Ostatnia aktualizacja: 25 września 2024

Yes, comfortably — you'll have ~8.2 GB of headroom running Llama 3.2 11B Vision Instruct at Q4_K_M (7.8 GB, ~123 tok/s (est.)).

Ujawnienie afiliacyjne: Niektóre odnośniki na tej stronie to linki afiliacyjne — jeśli dokonasz zakupu za ich pośrednictwem, LLM Configurator może otrzymać prowizję bez dodatkowych kosztów dla Ciebie. Jako uczestnik programu Amazon Associates, LLM Configurator zarabia na kwalifikujących się zakupach.

Sprawdź cenę na Amazon — NVIDIA GeForce RTX 5080 16GB

NVIDIA GeForce RTX 5080 Specs

VRAM	16 GB
Memory Bandwidth	960 GB/s

Llama 3.2 Family Sizes That Fit the NVIDIA GeForce RTX 5080

Llama 3.2 11B Vision Instruct	Q4_K_M · 7.8 GB · ~123 tok/s (est.)
Llama 3.2 3B Instruct	Q4_K_M · 2.2 GB · ~400 tok/s (est.)
Llama 3.2 1B Instruct	Q4_K_M · 0.8 GB · ~400 tok/s (est.)

Buy vs. rent Llama 3.2 Family

Buy the GPU

~$999

NVIDIA GeForce RTX 5080 · MSRP

Rent by the hour

from $0.34/hr

RTX 4090 (24 GB) class

At 2 hrs/day, buying (~$999) beats renting at $0.34/hr after about 4.1 years.

Affiliate links — we may earn a commission if you sign up, at no extra cost to you.

RunPod $0.34/hr

Rent on RunPod →

Vast.ai $0.35/hr · typical low · varies