14B models at Q4 hit 25-32 tok/s, 7B-8B run at near-lossless Q6-Q8, and SDXL generates without workarounds. Every model that fits on an RTX 3060 12GB and the best upgrade path.