The context window of a large language model is the amount of text, in tokens, that the model can consider or “remember” at any one time. An LLM’s context window can be thought of as the equivalent of its working memory - it determines how long of a conversation it can carry out without forgetting details, and the maximum size of documents or code samples it can process at once.Documentation Index
Fetch the complete documentation index at: https://docs.typingmind.com/llms.txt
Use this file to discover all available pages before exploring further.
Why it matters
Context length determines how much the model can “remember” during a single request. It affects:- long conversations
- large documents
- codebases
- multi-step reasoning
- retrieval-heavy workflows

What happen when you reached context length limit on TypingMind?
As you may know, each chat model has a different context window, for example::- GPT-5.4: 1M tokens
- Claude 4.5 Sonnet: 200,000 tokens
- Gemini 3.1 Pro: 1M tokens
You can control this context summary option by going to Settings → Internal Prompts → Auto summarize long conversations: