English(EN) Did Opus 4.8 not even make it to the top 10 Overall of LM Arena?

Anthropic 的 Opus 4.8 未能进入聊天机器人竞技场排行榜前十名

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 08:29

Reddit 的 ClaudeAI 子版块上一位用户对 Anthropic 的 Opus 4.8 模型性能提出了质疑，指出其在 LMSYS 聊天机器人竞技场排行榜上未能进入前十名。讨论暗示，尽管 Opus 4.8 功能强大，但与其他领先的大型语言模型相比，其表现可能未达预期。 AI

影响关于模型在排行榜上表现的质疑可能会影响用户认知和采用。

排序理由用户讨论质疑模型在公开排行榜上的表现。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

r/ClaudeAI TIER_2 English(EN) · /u/flarenz · 2026-06-04 08:29

Did Opus 4.8 not even make it to the top 10 Overall of LM Arena?

<table> <tr><td> <a href="https://www.reddit.com/r/ClaudeAI/comments/1twhikl/did_opus_48_not_even_make_it_to_the_top_10/"> <img alt="Did Opus 4.8 not even make it to the top 10 Overall of LM Arena?" src="https://preview.redd.it/v18ttcfa885h1.png?width=640&crop=smart&auto=…