Researchers have developed a challenging, 18-month-long test designed to measure the intelligence of artificial intelligence systems. The test was created because previous benchmarks were quickly surpassed by AI. This new, more rigorous evaluation aims to provide a more accurate and enduring assessment of AI capabilities. AI
IMPACT This new benchmark could provide a more accurate and lasting measure of AI progress, guiding future development.
RANK_REASON The cluster describes the creation of a new benchmark for AI, which falls under research. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →