PulseAugur
EN
LIVE 12:56:05

Anthropic's Claude Opus 4.8 drains context windows 40x faster

Users are reporting that Anthropic's Claude Opus 4.8, when the "Thinking" feature is enabled, consumes context windows at a dramatically accelerated rate, up to 40-60 times faster than previous versions. This rapid drain is attributed to the "Thinking" function being permanently enabled in Opus 4.8, causing context to snowball with each turn. In contrast, Opus 4.7's adaptive "Thinking" only activates when necessary, preventing excessive context accumulation. Users experiencing this issue can revert to Opus 4.7 or disable the "Thinking" feature in Opus 4.8 to mitigate the rapid context window depletion. AI

IMPACT Users may need to disable a feature or revert to an older model version to manage context window usage effectively.

RANK_REASON User reports on a specific feature's behavior in a released model version.

Read on r/ClaudeAI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/ClaudeAI TIER_2 English(EN) · /u/Adventurous_Two9033 ·

    Opus 4.8 + Thinking is draining context windows 40–60x faster

    <!-- SC_OFF --><div class="md"><p>Pulled the token data from my token usage tracker. Opus 4.8 with Thinking enabled writes up to <strong>900,000 cache tokens per turn</strong>. Opus 4.7 does 14,000–34,000.</p> <p>Thinking blocks get cached with every turn, context snowballs, cont…