Analysis reveals severe errors in influential METR AI time horizons graph

By PulseAugur Editorial · [2 sources] · 2026-05-25 18:30

A recent analysis by Nathan Witkin, published in Transformer, has identified numerous severe errors in the widely cited METR AI time horizons graph. These flaws include guesstimated human baseline data, incentivizing longer task completion times by paying hourly wages, a biased sample of human benchmarkers, and potential test-training data contamination. The analysis concludes that the graph is too compromised to draw meaningful conclusions and should be discarded in favor of more reliable information. AI

IMPACT Undermines claims of rapid AI advancement, urging a focus on more rigorous research methodologies.

RANK_REASON The cluster critiques a widely cited graph, highlighting methodological flaws and calling for its dismissal, which constitutes commentary on AI research practices.

Read on r/MachineLearning →

other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Analysis reveals severe errors in influential METR AI time horizons graph

COVERAGE [2]

r/MachineLearning TIER_1 English(EN) · /u/common_yarrow · 2026-05-25 18:30

The famous METR AI time horizons graph contains numerous severe errors [D]

<div class="md"><p>Nathan Witkin, a research writer at NYU Stern’s Tech and Society Lab, <a href="https://www.transformernews.ai/p/against-the-metr-graph-coding-capabilities-software-jobs-task-ai">writes</a> damningly about the famous METR AI time horizons graph in…
r/Anthropic TIER_1 English(EN) · /u/common_yarrow · 2026-05-27 05:43

The famous METR AI time horizons graph contains numerous severe errors

<table> <tr><td> <a href="https://www.reddit.com/r/Anthropic/comments/1towc1y/the_famous_metr_ai_time_horizons_graph_contains/"> <img alt="The famous METR AI time horizons graph contains numerous severe errors" src="https://external-preview.redd.it/YGupIMOYSbGCDtgHTF6wVzDz-NsT_fl…

COVERAGE [2]

The famous METR AI time horizons graph contains numerous severe errors [D]

The famous METR AI time horizons graph contains numerous severe errors

RELATED ENTITIES

RELATED TOPICS