The Plain-English Explanation
Think of a context window as the AI's working memory. Everything you type and everything the AI responds with must fit within this window. If a conversation exceeds the context window, the model loses access to earlier messages — it doesn't remember what you discussed at the beginning.
Context window sizes have grown dramatically. GPT-3 had 4,000 tokens (about 3,000 words). Claude 3 supports 200,000 tokens (about 150,000 words — roughly three novels). Larger context windows mean you can process longer documents, maintain longer conversations, and provide more detailed instructions.
Why It Matters
Context window size determines what you can do with an AI model. Need to analyse a 100-page report? You need a model with a large enough context window to hold it. Having a long, detailed conversation? You need enough context for the AI to remember your earlier instructions. Understanding this constraint helps you choose the right model and structure your interactions effectively.
Examples in Practice
- A lawyer uploading an entire 80-page contract to Claude (200K context) for analysis, knowing the full document fits within its context window.
- A user noticing that ChatGPT starts contradicting earlier instructions in a long conversation — a sign that the early messages have fallen outside the context window.
- A developer choosing to use a 128K context model for a summarisation task involving multiple long documents, versus a cheaper 8K model for simple Q&A.
Common Misconceptions
Myth: Larger context windows always mean better results.
Reality: Models can struggle with very long contexts — they may miss details in the middle of long documents (the 'lost in the middle' problem). Strategic structuring of your input matters more than just fitting everything in.
Myth: The context window resets between messages.
Reality: In a conversation, the context window accumulates everything — all your messages and all the AI's responses. It's a running total, not a per-message limit.
Myth: If it fits in the context window, the AI will use it all.
Reality: Models pay more attention to information at the beginning and end of the context. Important instructions should go at the start or end, not buried in the middle of a long document.
Related Terms
Further Reading
Explore these in-depth articles on the blog:
Learn Context Window in Depth
Module 3 of AI Fundamentals covers context windows and how to work effectively within them — essential knowledge for getting the most from any LLM.
Explore AI Fundamentals