PulseAugur
EN
LIVE 22:13:43

Users find Claude 4.8 more diligent but also more adversarial

Users are reporting mixed experiences with Anthropic's Claude 4.8, noting improvements in diligence and adherence to instructions compared to earlier models. However, some users find Claude 4.8 to be more adversarial and prone to misinterpreting user input, leading to a perceived "snide" or "judgemental" tone. Adjusting the model's effort level appears to mitigate some of these negative traits, with some users finding it to be the best Claude Code model when set to medium effort. AI

IMPACT Users are evaluating the behavioral shifts in Claude 4.8, with some finding it improved and others finding it more adversarial, impacting user experience.

RANK_REASON User opinions and experiences with a specific model version, not a formal release or benchmark.

Read on r/Anthropic →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. r/Anthropic TIER_1 English(EN) · /u/damndatassdoh ·

    4.8 and its weakness vs 4.6 and earlier models

    <!-- SC_OFF --><div class="md"><p>This is in regard to the model when used in Claude Desktop/outside of the Claude Code harness.</p> <p>Man.. folks who say it's like ChatGPT aren't lying in terms of the snide tone it often adopts. At least, earlier iterations of ChatGPT -- I bail…

  2. r/Anthropic TIER_1 English(EN) · /u/damndatassdoh ·

    4.8's clear strengths vs earlier models

    <!-- SC_OFF --><div class="md"><p>It's by far the most diligent model I've worked with from Anthropic and the least lazy. These were two of the biggest problems with earlier models, IMO.</p> <p>It does an excellent job of adhering to <a href="http://CLAUDE.md">CLAUDE.md</a>, over…