12GB

What Can You Actually Run on 12GB VRAM?
Qwen 3.5 9B at Q8_0 runs near-lossless on 12GB, Qwen 2.5 14B at Q4 hits 30 tok/s, and SDXL generates without workarounds. Every model that fits on an RTX 3060 12GB and the best upgrade path.
Jan 28, 2026