Temperature
A setting that controls how random or creative an AI model's responses are — lower values produce focused answers, higher values produce more varied ones.
In plain English
Temperature is a numerical parameter (usually 0 to 1, sometimes up to 2) that controls how much randomness the AI uses when picking the next token. It's the main creativity vs. consistency knob.
What different temperatures do:
- 0.0 — deterministic, picks the highest-probability token every time. Same input → same output. Best for code, classification, structured data.
- 0.3–0.7 — moderate variation. Good for chat and most tasks.
- 1.0+ — high randomness. Better for brainstorming, creative writing, generating diverse options.
Common pitfalls:
- Setting too high → incoherent, off-topic responses
- Setting too low → repetitive, robotic outputs
- Comparing models at different temperatures → invalid benchmark
Most APIs default to around 0.7. For production tools that need reliability, lower it; for creative work, raise it.