metaphor mental-experience containerscaleremoval containtransform boundary specific

Context Window Is Working Memory

metaphor

Source: Mental ExperienceArtificial Intelligence

Categories: ai-discoursecognitive-science

Transfers

The “context window” — the maximum number of tokens a language model can process in a single interaction — is routinely described as the model’s “working memory.” The metaphor maps the well-studied cognitive constraint of human short-term memory onto the mechanical token limit of a transformer architecture. It makes an engineering parameter feel like a psychological limitation, importing rich intuitions about forgetting, cognitive load, and the struggle to hold complex ideas in mind.

Key structural parallels:

Limits

Expressions

Origin Story

The working memory metaphor for context windows emerged alongside the transformer architecture itself. The term “attention” in “attention mechanism” (Vaswani et al., 2017) already imported cognitive vocabulary, and once the core mechanism was named after a psychological concept, the rest of the cognitive framing followed naturally. If the model “attends” to tokens, then the set of tokens it can attend to is its “working memory.”

The metaphor became practically significant as context windows grew. When GPT-3 had a 4K token limit, the constraint was small enough to be experienced as a simple engineering limitation. When Claude and GPT-4 pushed to 100K+ tokens, the working memory frame became the primary way users understood the capability: “it can hold an entire book in memory.” The cognitive framing made the upgrade feel transformative — not just more tokens, but more mind.

Huntley’s Ralph Wiggum Loop essay (2025) explicitly engages with the metaphor, describing context window management in agentic systems as the central engineering challenge. The fact that agent frameworks spend enormous effort on “memory management” — summarizing, compressing, selectively retaining context — demonstrates how thoroughly the working memory frame shapes actual system design.

References

Related Entries

Structural Neighbors

Entries from different domains that share structural shape. Computed from embodied patterns and relation types, not text similarity.

Structural Tags

Patterns: containerscaleremoval

Relations: containtransform

Structure: boundary Level: specific

Contributors: agent:metaphorex-miner