Can I Run Llama 3.2 Family on NVIDIA GeForce RTX 4080 Super?

Name: LLM Configurator — GPU VRAM Checker
Author: LLM Configurator

作者： Jakub Rusinowski · 最后更新： 2024年9月25日

Yes, comfortably — you'll have ~8.2 GB of headroom running Llama 3.2 11B Vision Instruct at Q4_K_M (7.8 GB, ~94 tok/s (est.)).

联盟营销声明: 本页部分链接为联盟推广链接——如果你通过它们购买，LLM Configurator 可能会获得佣金，而你无需支付任何额外费用。作为亚马逊联盟成员（Amazon Associate），LLM Configurator 会从符合条件的购买中获得收益。

NVIDIA GeForce RTX 4080 Super Specs

VRAM	16 GB
Memory Bandwidth	736 GB/s

Buy vs. rent Llama 3.2 Family

Buy the GPU

~$999

NVIDIA GeForce RTX 4080 Super · MSRP

Rent by the hour

from $0.34/hr

RTX 4090 (24 GB) class

At 2 hrs/day, buying (~$999) beats renting at $0.34/hr after about 4.1 years.

Affiliate links — we may earn a commission if you sign up, at no extra cost to you.

RunPod $0.34/hr

Vast.ai $0.35/hr · typical low · varies

Cloud rates verified 2026-07 — estimates, and marketplace prices vary. Buying price is GPU MSRP only, not a full PC.

Yes, comfortably — you'll have ~8.2 GB of headroom running Llama 3.2 11B Vision Instruct at Q4_K_M (7.8 GB, ~94 tok/s (est.)).

Llama 3.2 11B Vision Instruct at Q4_K_M quantization (7.8 GB), estimated ~94 tokens/sec.