Core concepts

Reasoning Model

A model variant trained or tuned to spend more compute on internal reasoning before answering — better on math, code, and multi-step problems.

01 ——

In plain English

A reasoning model is one optimised to "think before it speaks." It generates an extended internal chain of reasoning before the final answer, often producing dramatically better results on problems that require multi-step logic (math, coding, scientific questions).

The reasoning model wave (2024–26):

OpenAI o1 → o3 — kicked off the category in late 2024
Claude Sonnet 4.x / Opus 4.x with Extended Thinking
Gemini 2.5 Thinking
DeepSeek R1 — open-source reasoning model, comparable to closed frontier
Qwen-Reasoning family

What they're good at:

Math olympiad problems
Competitive programming
Multi-step scientific or financial reasoning
Complex agentic planning

What they cost you:

Latency — 5–60+ seconds before any output
Tokens — the reasoning trace is billed even if hidden
Sometimes worse on simple queries — too much thinking can hurt

When to use one: For tasks where accuracy matters more than speed. For most consumer chat use cases, a non-reasoning model is faster and good enough.

02 ——