Tag: gguf

All the articles with the tag "gguf".

llm-concepts
15 May, 2026 7 min read

Quantization: How a 70B Model Fits on Your Laptop

Quantization shrinks a 70B model from 140 GB to 20 GB with almost no quality loss. What it actually does, and why the trick works.