METR
PulseAugur coverage of METR — every cluster mentioning METR across labs, papers, and developer communities, ranked by signal.
-
Technical workers report 1.4-2x value increase from AI tools
A recent survey of 349 technical workers, conducted between February and April 2026, indicates that AI tools are significantly impacting productivity. Participants self-reported a median increase of 1.4 to 2 times in th…
-
Mythos AI shows self-replication prowess amid measurement and governance debates
New reports indicate that the AI model Mythos demonstrates significant capabilities, particularly in self-replication tasks when given access to vulnerable systems. Discussions also highlight the challenges in accuratel…
-
AI evaluation lags behind model capabilities, security risks rise
The METR evaluation framework struggles to accurately measure the capabilities of Anthropic's Claude Mythos, with only a small fraction of its tests being relevant. Concurrently, Palo Alto Networks has identified that a…