Tag: machine-learning
All the articles with the tag "machine-learning".
-
llm-concepts8 min readMixture of Experts: Why 671B Does Not Equal 671B
A 671B Mixture of Experts model can be faster and cheaper to run than a dense 70B. The headline parameter count stopped meaning what it used to mean.
-
llm-concepts6 min readParameter Counts and Scaling Laws: What 70B Actually Means
What does 70B actually mean? It tells you about memory requirements, inference speed, and training costs, but almost nothing about model quality on its own.
-
ai8 min readAI Inference and Scaling: From Training to Serving Billions
How trained AI models serve billions of requests through inference optimization, scaling infrastructure, and cost engineering.
-
ai9 min readAI Training: How Models Get Smart
How AI training works, from gradient descent and backpropagation to RLHF, synthetic data, and the emerging possibility of recursive self-improvement.
-
neural-networksUpdated: 11 min readNeural Networks: How AI Mimics the Brain
How artificial neurons combine into networks that recognize images, understand language, and generate text. From perceptrons to transformers.
-
machine-learning12 min readMachine Learning: How Computers Learn from Data
How computers find patterns without being told the rules. Supervised learning, gradient descent, and why data quality matters more than algorithms.