Core concepts

Context Window

The maximum amount of text (tokens) an AI model can read and remember at once during a single conversation.

01 ——

In plain English

The context window is the "working memory" of an AI model — it's the total amount of text the model can process in one go, including the conversation history, instructions, and any documents you've shared.

It's measured in tokens (roughly ¾ of a word each). A model with a 200,000-token context window can hold roughly 150,000 words at once — about two full novels.

Why it matters:

  • Short context = the model forgets earlier parts of a long conversation
  • Long context = the model can reason over entire codebases or research papers at once

Models like Claude and GPT-4 have pushed context windows from 4k tokens in 2022 to 1M+ tokens in 2025.

02 ——

Related terms

Back to glossaryLast reviewed May 2026
Vol. 4 · Issue 19 · Last reviewed 2026-05-30

Sign up for our newsletter

Receive weekly updates so you can stay up-to-date with the world of AI