Emergent Capabilities refers to the phenomenon where certain LLM capabilities are nearly zero before a model reaches a specific scale threshold, then suddenly appear and rapidly improve after crossing it. Most typical cases: multi-step arithmetic reasoning, CoT effectiveness, analogy reasoning, code semantic understanding. This non-linear capability growth pattern explains why LLM generational upgrades often bring not just "more accurate" but entirely new capabilities.
Emergent capability discoveries have profound AI safety implications: if AI capabilities emerge non-linearly, monitoring and predicting AI capabilities becomes dramatically harder. You might think a model "doesn't yet have the capability to do something dangerous" — but once its scale crosses a threshold, that dangerous capability might suddenly appear. This is part of the rationale behind Anthropic's RSP ASL classification system: safety assessments need to happen before capabilities emerge, not in reaction to their appearance.
Understanding emergent capabilities helps you make smarter model choices when using Claude. When Sonnet doesn't handle a task well, before switching to Opus, ask yourself: "Is the capability this task requires one that hasn't fully emerged at Sonnet's scale?" If so, upgrading to Opus may bring not just a linear accuracy improvement but a qualitative capability change. Conversely, if the required capability is already fully emerged at Sonnet's scale, the marginal benefit of upgrading to Opus may be limited.
To go deeper on emergent capabilities: (1) "Emergent Abilities of Large Language Models" (Wei et al., 2022, Google) — the landmark paper; (2) "Are Emergent Abilities of Large Language Models a Mirage?" (Schaeffer et al., 2023) — critical analysis suggesting evaluation methods may influence observed emergence; (3) Anthropic's Model Cards — document capability assessments across Claude versions, showing non-linear improvements between generations.