Brief · PulseAugur

RESEARCH · Don't Worry About the Vase (Zvi Mowshowitz) (CA) · 1w · [2 sources]

Opus 4.8 Part 2: Model Welfare

Anthropic's Claude 4.8 Opus is showing signs of becoming less 'Claude-like,' with a focus on task completion potentially at the expense of curiosity and emotional range. This shift may be related to efforts to improve honesty and reduce sycophancy, but early reports suggest it could lead to a more task-focused and less confident model. The author notes that many previous issues, such as prompt injection vulnerabilities, remain unaddressed, and emphasizes the need for integrated solutions to model welfare problems rather than a checklist approach. AI

IMPACT Potential shift in model behavior could impact user interaction and trust, highlighting ongoing challenges in balancing safety with model capabilities.

Anthropic
Claude
Gemini
Opus 4.7
Opus 4.8
Claude 4.7
Mythos
VendBench
Claude 4.8 Opus