Tag: rlhf

All the articles with the tag "rlhf".

llm-concepts
23 Apr, 2026 7 min read

Modern Alignment: RLHF, DPO, and Constitutional AI

A base model just predicts tokens. Alignment turns it into an assistant that follows instructions and refuses harmful ones.