PulseAugur
实时 17:32:53
English(EN) Today, Anthropic engineers on average ship 8x as much code per quarter as they did compared to 2021-2025. https://t.co/QCc9cqGgf4

Anthropic的Mythos Preview以52倍的速度提升加速AI开发

Anthropic最新的AI模型Mythos Preview在研究和编码能力方面取得了显著进展。该模型在训练AI模型方面实现了52倍的速度提升,远超之前的版本和人类的表现。此外,Mythos Preview在解决复杂编码问题上成功率达到76%,并有望在一年内达到与人类相当的代码质量。这些改进正在加速AI开发,可能为AI系统设计其后继者的递归自我改进铺平道路。 AI

影响 加速AI开发和研究,可能使AI系统能够设计其自身的后继者,并为AI能力设定新的基准。

排序理由 Anthropic发布的Mythos Preview,详细介绍了显著的性能改进和AI自我改进的潜力,符合前沿发布的标准。

在 X — Anthropic 阅读 →

AI 生成摘要 · Google Gemini · 来自 7 个来源。 我们如何撰写摘要 →

Anthropic的Mythos Preview以52倍的速度提升加速AI开发

报道来源 [7]

  1. X — Anthropic TIER_1 English(EN) · AnthropicAI ·

    None of this guarantees recursive self-improvement is on the horizon. It’s not yet clear that Claude is capable of research judgment—of choosing the right probl

    None of this guarantees recursive self-improvement is on the horizon. It’s not yet clear that Claude is capable of research judgment—of choosing the right problems to work on. But if these trends continue, AI systems designing and building their own successors is plausible. This

  2. X — Anthropic TIER_1 English(EN) · AnthropicAI ·

    AI research is a series of next-step decisions. We looked at sessions where a human researcher took a wrong turn, showed Claude the session up to that point, an

    AI research is a series of next-step decisions. We looked at sessions where a human researcher took a wrong turn, showed Claude the session up to that point, and asked it what to do next. Mythos Preview improved on humans 64% of the time—up from 22% in 2024. https://t.co/Y0HLoktx…

  3. X — Anthropic TIER_1 English(EN) · AnthropicAI ·

    Each time we release a model, we run the same test: give it code that trains a small AI model, ask the new model to speed it up. It takes a skilled human 4-8 ho

    Each time we release a model, we run the same test: give it code that trains a small AI model, ask the new model to speed it up. It takes a skilled human 4-8 hours to reach 4x faster. In May 2024, Claude Opus 4 averaged a ~3x speedup. This April, Mythos Preview achieved ~52x.

  4. X — Anthropic TIER_1 English(EN) · AnthropicAI ·

    The speedup isn’t just in volume. On open-ended coding problems where answers are unclear, Claude’s success rate is now 76%—a 50 point jump in just 6 months.

    The speedup isn’t just in volume. On open-ended coding problems where answers are unclear, Claude’s success rate is now 76%—a 50 point jump in just 6 months. Many engineers also say Claude’s code quality is now on par with human code; we expect it to be better within the year. h…

  5. X — Anthropic TIER_1 English(EN) · AnthropicAI ·

    Today, Anthropic engineers on average ship 8x as much code per quarter as they did compared to 2021-2025. https://t.co/QCc9cqGgf4

    Today, Anthropic engineers on average ship 8x as much code per quarter as they did compared to 2021-2025. https://t.co/QCc9cqGgf4

  6. X — Anthropic TIER_1 English(EN) · AnthropicAI ·

    Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successo

    Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx

  7. r/singularity TIER_2 English(EN) · /u/Educational_Grab_473 ·

    Anthropic - Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor.

    <table> <tr><td> <a href="https://www.reddit.com/r/singularity/comments/1twsm5g/anthropic_our_internal_data_shows_claude_is/"> <img alt="Anthropic - Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously bui…