English(EN) We Built the Hardest Test in Human History to Measure AI. It Lasted 18 Months.

开发人工智能智能基准以应对快速发展的人工智能

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-26 23:01

研究人员开发了一项为期18个月的挑战性测试，旨在衡量人工智能系统的智能。由于之前的人工智能基准很快就被超越，因此创建了该测试。这项新的、更严格的评估旨在提供对人工智能能力的更准确、更持久的评估。 AI

影响这个新基准可以提供对人工智能进展更准确、更持久的衡量标准，从而指导未来的发展。

排序理由该集群描述了创建新的人工智能基准，属于研究范畴。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Towards AI TIER_1 English(EN) · Harish K · 2026-06-26 23:01

我们构建了人类历史上最难的AI测试，历时18个月。

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://pub.towardsai.net/we-built-the-hardest-test-in-human-history-to-measure-ai-it-lasted-18-months-d95631265107?source=rss----98111c9905da---4"><img src="https://cdn-images-1.medium.com/max/2600/1*MX1tglNSej4…