Tag: scaling
All the articles with the tag "scaling".
-
llm-concepts6 min readParameter Counts and Scaling Laws: What 70B Actually Means
What does 70B actually mean? It tells you about memory requirements, inference speed, and training costs, but almost nothing about model quality on its own.
-
ai8 min readAI Inference and Scaling: From Training to Serving Billions
How trained AI models serve billions of requests through inference optimization, scaling infrastructure, and cost engineering.