metre
PulseAugur coverage of metre — every cluster mentioning metre across labs, papers, and developer communities, ranked by signal.
No coverage in the last 90 days.
- 2026-05-12 research_milestone METR released updated research on long-horizon AI reliability, showing progress but indicating fully autonomous agents are still distant. source
4 day(s) with sentiment data
-
OpenAI releases o3 and o4-mini models with advanced reasoning and tool capabilities
OpenAI has released its new o3 and o4-mini models, which represent a significant advancement in reasoning capabilities and tool integration within ChatGPT. The o3 model is positioned as OpenAI's most powerful reasoning …
-
METR finds GPT-4o shows impressive agent skills but suffers fixable failures
METR has released preliminary findings from an evaluation of GPT-4o's autonomous capabilities across 77 tasks. The model demonstrated impressive skills like systematic exploration but also exhibited failure modes such a…
-
METR proposes autonomy evaluation protocol for AI risks
The Model Evaluation & Threat Research (METR) initiative has released an example protocol for assessing AI models' potential for autonomy-related risks. This protocol focuses on systems capable of executing harmful task…
-
METR releases guidelines for eliciting AI model capabilities and risks
The Model Evaluation & Threat Research (METR) organization has published guidelines for assessing AI model capabilities, focusing on elicitation techniques. These guidelines aim to measure a model's potential performanc…
-
METR measures GPT-4 post-training enhancements, finding significant capability gains
Researchers at METR have conducted experiments to measure the impact of post-training enhancements on AI agent capabilities. Their findings indicate that OpenAI's own post-training efforts on GPT-4 significantly boosted…
-
2023 Year In Review
METR, an AI safety research organization, detailed its 2023 accomplishments, including developing methodologies for evaluating AI agents on autonomous tasks and contributing to OpenAI's GPT-4 system card. The organizati…
-
OpenAI partners with US National Labs, proposes AI policy to White House
OpenAI has submitted proposals to the White House Office of Science and Technology for the US AI Action Plan, focusing on strengthening American AI leadership through regulatory, export control, copyright, and infrastru…