ENTITY
Model Evaluation & Threat Research
Model Evaluation & Threat Research
PulseAugur coverage of Model Evaluation & Threat Research — every cluster mentioning Model Evaluation & Threat Research across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
METR proposes autonomy evaluation protocol for AI risks
The Model Evaluation & Threat Research (METR) initiative has released an example protocol for assessing AI models' potential for autonomy-related risks. This protocol focuses on systems capable of executing harmful task…
-
METR releases guidelines for eliciting AI model capabilities and risks
The Model Evaluation & Threat Research (METR) organization has published guidelines for assessing AI model capabilities, focusing on elicitation techniques. These guidelines aim to measure a model's potential performanc…