English(EN) We spent 2 hours working in the future

METR研究人员模拟200小时AI代理，发现工作流程提升3-5倍

作者 PulseAugur 编辑部 · [1 个来源] · 2026-03-19 07:00

METR的研究人员进行了一项桌面演习，模拟使用具有200小时时间跨度的AI代理，预测了大约12-18个月后可能具备的能力。该演习旨在了解新兴的工作流程和潜在的生产力提升。参与者发现，AI代理可以显著加快任务完成速度，实现快速原型设计和迭代，但也突出了优先级排序和组织方面的瓶颈。 AI

排序理由该集群描述了一项模拟未来AI能力的桌面演习，而非当前发布或基准测试。

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

METR (Model Evaluation & Threat Research) TIER_1 English(EN) · 2026-03-19 07:00

We spent 2 hours working in the future

<h2 id="introduction">Introduction</h2> <p>METR aims to keep the public informed about the capabilities of and risks posed by AI — by some metrics the fastest-moving technology in history, and one that could speed up further as AI automates AI R&D. By late next year, the rate…