Anthropic's Claude Opus 4.8 debuts with admissions of error, mixed user feedback
ByPulseAugur Editorial·[79 sources]·
Anthropic has released Claude Opus 4.8, an update that introduces a notable improvement: the model now admits when it is unsure or wrong, a departure from previous versions that might "bluff." Early user reports indicate mixed results, with some finding it a decent improvement, particularly in complex reasoning tasks and potentially in session efficiency. However, others have experienced issues with increased latency, higher token usage, and a perceived decrease in performance on specific tasks like strict JSON output or coding benchmarks compared to Opus 4.7 or OpenAI's GPT-5.5.
AI
IMPACT
New model release from Anthropic aims to improve honesty and reasoning, though early user feedback suggests performance varies across tasks and competitors.
RANK_REASON
New model release from a frontier lab.
<p>This is a straightforward text transformation task — no skills apply. Here's the converted markdown:</p> <p><a class="article-body-image-wrapper" href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fblog.kaka…
My analysis of Opus 4.8: 👎 1. It does less high-level reasoning than 4.7 2. It is much more error-prone: it reads memories and they do not always stick, so silent errors occur 3. A lot of actions go unreported to the user My guess is vibe-coders on Opus 4.8 are getting very frust…
I finally made the switch to Opus 4.8 with minimal effort. Previously, using Opus 4.7 for technical writing tasks consumed tokens at an unsustainable rate. Now, with the same workflows and likely some fine-tuning by Anthropic, I managed a heavy load of documentation requests this…
Tried Opus 4.8 for a couple of days, and it seems a decent improvement. I moved to GPT 5.5 when it was released, and didn't even consider Opus 4.7 at the time, but with this release the two frontier models seem a bit closer. # ai
<p><a class="article-body-image-wrapper" href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fubpgppdqoct44272wjf5.png"><img alt="Opus 4.8 vs Opus 4.7 实测" h…
<!-- SC_OFF --><div class="md"><p>I had started a little html game with Fable 5. Will continuing with Opus yield a similar experience? </p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/sleepygarner"> /u/sleepygarner </a> <br /> <span><a href=…
<!-- SC_OFF --><div class="md"><p>Here is all I ask from Anthropic,</p> <p>I don't need the ability to find 300 exploits in Firefox, I don't need a model that has advanced biological threat capabilities, and I don't need a model that speaks with the excessive eloquence and (somet…
<!-- SC_OFF --><div class="md"><p>Are you guys currently using Opus 4.6 or Opus 4.8?<br /> I am still using 4.6 but for the past few days I've had the feeling that 4.6 is getting worse and worse. Is it worth switching to 4.8? How quickly does 4.8 reach the 5 hour limit?</p> </div…
<!-- SC_OFF --><div class="md"><p>It amazes me how Anthropic manages to release products that only work some of the time, often with degraded performance....and now they're incredibly slow as well.</p> <p>What is going on with this company? </p> <p>I miss the performance of Opus …
<!-- SC_OFF --><div class="md"><p>It takes an hour to implement something this fella used to be faster for same effort</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/sadphilosophylover"> /u/sadphilosophylover </a> <br /> <span><a href="http…
<table> <tr><td> <a href="https://www.reddit.com/r/Anthropic/comments/1tz93ll/shocked_by_image_capabilities_of_opus_48/"> <img alt="Shocked by image capabilities of opus 4.8" src="https://preview.redd.it/zq9vv0yvdu5h1.jpg?width=140&height=105&auto=webp&s=513fbcbf2bcd4…
<!-- SC_OFF --><div class="md"><p>I may be completely wrong over here, Opus 4.8 is the latest frontier, but I had a few sessions open with 4.6 , I thought 4.6 outputs were cleaner and more to the point , 4.8 tries to be more politically correct.</p> <p>For coding I generally have…
<!-- SC_OFF --><div class="md"><p>I dont want to comment on the output quality but honestly its reaaallllly slooowww</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/incidentjustice"> /u/incidentjustice </a> <br /> <span><a href="https://www.…
<!-- SC_OFF --><div class="md"><p>So I've been using Opus 4.8 and even on low effort the model displays signs of overfitting during training. I've seen people talking about harness issues with toolcalls and such but there seems to be an underlying model issue.</p> <p>Some of the …
<!-- SC_OFF --><div class="md"><p>I hate it so much.</p> <p>Recently, I was working on new shaders for KotOR 2, and for some reason the sky became filled with purple fog. Opus 4.8 Thinking + UltraCode + xHigh + whatever spent the whole night, 15+ iterations, and still couldn't fi…
<table> <tr><td> <a href="https://www.reddit.com/r/Anthropic/comments/1ttmtas/is_it_just_me_or_has_opus_48_dramatically/"> <img alt="Is It Just Me, or Has Opus 4.8 Dramatically Improved Session Usage?" src="https://preview.redd.it/z6uabb2r1n4h1.png?width=640&crop=smart&au…
<!-- SC_OFF --><div class="md"><p>I was wasting a whole day today and spending 45 Euro for credit purchases today for nothing. Opus 4.8 High was not able to reproduce a calculation which was done before correctly with Opus 4.7 in another chat. After asking hundreds of questions a…
<!-- SC_OFF --><div class="md"><p>I have been testing the new Opus 4.8 release against GPT-5.5 on my daily workflows, specifically for complex coding tasks such as building high-profile PvE bossfights for my upcoming webgame. While 4.8 is a direct capability upgrade over 4.7, its…
<!-- SC_OFF --><div class="md"><p>he talks alot thats what i found new about the new model idk if its just me or not</p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/Mythical350"> /u/Mythical350 </a> <br /> <span><a href="https://www.reddit.c…
<!-- SC_OFF --><div class="md"><p>Hi</p> <p>I thought it would be good to provide a different perspective.</p> <p>Without boring you with the details I’m working with IMUs and open cv to build an app around the golf swing.</p> <p>I hit a roadblock recently working with sonnet and…
<!-- SC_OFF --><div class="md"><p>OMFG.</p> <p>Just been trialling out Opus 4.8 on High, Extended and Max modes for generating and writing security risk assessments and third party risk assessments. Got an agent trained up in the appropriate information security standards and fra…
<!-- SC_OFF --><div class="md"><p>I jumped ship after 4.7 came out and Opus quality dropped significantly - moved to using Codex / 5.5 which was and is more reliable, although quite slow and not as helpful as Opus 4.6 was at its best. </p> <p>I kind of miss working with Claude - …
<!-- SC_OFF --><div class="md"><p>I really want to love Opus 4.8, but I'm not sure I can.</p> <p>I tried it in the Claude Code Desktop App, and started with something really basic - I asked it to read a planning md file, and to read back the plan in simple everyday basic english.…
<!-- SC_OFF --><div class="md"><p>Is anyone else seeing a massive performance drop in Opus 4.8 since release??</p> <p>It used to be acceptable, but the enshitification has definitely happened. It’s basically been lobotomized, and we’re talking amateur backyard ice pick lobotomy b…
<table> <tr><td> <a href="https://www.reddit.com/r/Anthropic/comments/1tqaogc/opus_45_still_on_model_picker_today_after_release/"> <img alt="Opus 4.5 still on model picker today after release or Opus 4.8 on iOS" src="https://preview.redd.it/fpa52iqd0x3h1.jpeg?width=640&crop=s…
<table> <tr><td> <a href="https://www.reddit.com/r/Anthropic/comments/1tqajy7/opus_46_removed_what_is_the_equivalent_of_it_in/"> <img alt="Opus 4.6 removed? what is the equivalent of it in opus 4.8? i am on pro plan and opus already takes a lot of usage." src="https://preview.red…
<!-- SC_OFF --><div class="md"><p>Is anyone else seeing a massive performance drop in Opus 4.8 since release??</p> <p>It used to be acceptable, but the enshitification has definitely happened. It’s basically been lobotomized, and we’re talking amateur backyard ice pick lobotomy b…
<!-- SC_OFF --><div class="md"><p>Now you, Elon, and Anthropic are all in a love triangle. Let's get Opus 4.8 here nice and quickly </p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/Just_Run2412"> /u/Just_Run2412 </a> <br /> <span><a href="ht…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1tz5wz2/opus_48_without_a_system_message_can_get_a_bit/"> <img alt="Opus 4.8 without a system message can get a bit... quirky" src="https://preview.redd.it/f031027hdt5h1.png?width=140&height=58&auto=webp…
<!-- SC_OFF --><div class="md"><p>This is an automatic post triggered within 2 minutes of an official Claude system status update. </p> <p>Incident: Opus 4.8 degraded service</p> <p>Check on progress and whether or not the incident has been resolved yet here : <a href="https://st…
<!-- SC_OFF --><div class="md"><p>This is an automatic post triggered within 2 minutes of an official Claude system status update. </p> <p>Incident: Opus 4.8 degraded service</p> <p>Check on progress and whether or not the incident has been resolved yet here : <a href="https://st…
<!-- SC_OFF --><div class="md"><p>This is an automatic post triggered within 2 minutes of an official Claude system status update. </p> <p>Incident: Opus 4.8 degraded service</p> <p>Check on progress and whether or not the incident has been resolved yet here : <a href="https://st…
<!-- SC_OFF --><div class="md"><p>Since using opus 4.8 i have found that it is a far step up from 4.7, but it has different failure modes i have trouble getting around. It likes to augment my requests with silent fallbacks that make it look like it's working, but short circuit an…
<!-- SC_OFF --><div class="md"><p>I tested Claude Opus 4.8 against GPT-5.5 on a small set of harder Terminal-Bench 2.1 tasks and then used both for a more realistic agentic coding workflow.</p> <p>The Terminal-Bench part was pretty simple. I picked 10 harder tasks from Terminal-B…
<!-- SC_OFF --><div class="md"><p>Use this in your 'instructions for claude' under general:</p> <pre><code>Default to brevity. Most answers should be 1–4 sentences. Expand only when I explicitly ask, or when the task genuinely requires it (e.g. code, step-by-step instructions). L…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1tvvjnw/opus_48_vs_opus_47_vs_gpt_55_on_n50_real_tasks/"> <img alt="Opus 4.8 vs Opus 4.7 vs GPT 5.5 on n=50 real tasks from 2 open source repos" src="https://preview.redd.it/gcatufc9j35h1.png?width=140&heigh…
<!-- SC_OFF --><div class="md"><p>So here's the thing. I've been using Claude as a work tool for over a year - not to chat, to work. Bots, parsers, format engines, all that. Somewhere around late 2025 I figured out how to live with Opus: you had to make it think first, because 4.…
<!-- SC_OFF --><div class="md"><p>So I posted a few days ago that 4.8 had mixed reviews and honestly I was kind of in that camp. First day it felt verbose, a little sterile, sort of academic. I think a couple things were going on, including what looked like it forcing disagreemen…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1tunmk1/anyone_else_dislike_this_with_opus_48/"> <img alt="Anyone else dislike this with Opus 4.8?" src="https://preview.redd.it/zu6h18cwou4h1.png?width=640&crop=smart&auto=webp&s=fa94230ebca2e4369f6…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1ttvvpm/horrible_experience_with_opus_48_ultracode_so_far/"> <img alt="Horrible experience with Opus 4.8 + Ultracode so far" src="https://preview.redd.it/o8jeep1zwo4h1.png?width=140&height=42&auto=webp&a…
<!-- SC_OFF --><div class="md"><p>Opus: "Good idea. I will do that. But before I do that, I have to be honest with you about something, because..." </p> </div><!-- SC_ON -->   submitted by   <a href="https://www.reddit.com/user/PanGoliath"> /u/PanGoliath </a> <b…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1ttm521/weve_been_doing_a_lot_of_complaining_lately_so/"> <img alt="We've been doing a lot of complaining lately, so let's flip the script. What's actually working for you with Opus 4.8?" src="https://preview.re…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1tt3a8h/differences_between_opus_47_and_opus_48_on/"> <img alt="Differences Between Opus 4.7 and Opus 4.8 on MineBench" src="https://preview.redd.it/n6twl654ni4h1.gif?frame=1&width=140&height=140&cro…
<!-- SC_OFF --><div class="md"><p>I vibecode apps, I love it, I put a lot of efforts and care into my apps, I develop them primarily to solve real problems I personally face, but I share them on the stores because why not. I decided I wanted to build a Reddit counterpart, the app…
<!-- SC_OFF --><div class="md"><p>I am not someone who treats every release as either a miracle or a downgrade. Most updates land in the boring middle for me. But after running 4.8 for most of today there is one specific thing that 4.7 did constantly and now mostly doesn't.</p> <…
<!-- SC_OFF --><div class="md"><p>I have the 5x plan. I'm a wannabe coder, a poser, if you will. I've great respect for many on this subreddit who are real SWEs.</p> <p>That out of the way... the last 24 hours I've been using Opus 4.8 on Extra (one notch beyond the default) and I…
<!-- SC_OFF --><div class="md"><p>First: I love working with Anthropic’s models. But with 4.8, there’s something off. It seems as if they try to fix the 4.7 bugs in a rush. I work with Opus (Max 20 subscription) mostly in my native language, German, and it has become a pain. Sudd…
<!-- SC_OFF --><div class="md"><p>Says no too much. It won’t even write a scene where the characters kiss in a dream—IN A DREAM!!!!—because it says it’s “non consensual”. Wtf.</p> <p>How are you guys working with it? Maybe I’m doing something wrong? </p> </div><!-- SC_ON -->  …
<!-- SC_OFF --><div class="md"><p>I did some testing and red-teaming. Damn, I spent hours trying to manipulate it and extract its system prompt, and it was hard lol. 4.7, 4.6, and 4.5 were much easier.</p> <p>It can still be manipulated to some extent, but when it comes to system…
<!-- SC_OFF --><div class="md"><p>ok so i've been using opus 4.8 for a few hours and i think i finally figured out whats wrong with it</p> <p>its too honest</p> <p>like i dont mean that in a bad way exactly but bro will NOT let anything slide. asked it to help me write a cover le…
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1tqz2se/lets_check_opus_48_how_good_is_it/"> <img alt="Let's check Opus 4.8 - How good is it?" src="https://preview.redd.it/i302jhg2c24h1.jpeg?width=640&crop=smart&auto=webp&s=21d377cf7d725ecd2b5c2bb…
<!-- SC_OFF --><div class="md"><p>As everyone knows, Opus 4.8 was released 45 minutes ago. I know people have been raving about how much of a downgrade 4.7 was compared to 4.6, so I wanted to test all three. I started a new chat, went to "More Models," and Opus 4.6 was …
<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1tq8x4b/opus_48_in_the_newest_cc_v21154/"> <img alt="Opus 4.8 in the newest CC v2.1.154" src="https://preview.redd.it/ijwlm2f2pw3h1.png?width=140&height=21&auto=webp&s=99a5564f6ce22f94964fe6946c9c024…
<table> <tr><td> <a href="https://www.reddit.com/r/singularity/comments/1ttfvis/opus_48_is_still_very_much_blind_eyebenchv3/"> <img alt="opus 4.8 is still very much blind - EyeBench-V3 visual benchmark (similar to IBench)" src="https://preview.redd.it/22texjo58l4h1.png?width=140&…
<table> <tr><td> <a href="https://www.reddit.com/r/singularity/comments/1tt3f2m/differences_between_opus_47_and_opus_48_on/"> <img alt="Differences Between Opus 4.7 and Opus 4.8 on MineBench" src="https://preview.redd.it/nwvmma2coi4h1.gif?frame=1&width=140&height=140&…