PulseAugur
LIVE 10:41:36
ENTITY Model Evaluation & Threat Research

Model Evaluation & Threat Research

PulseAugur coverage of Model Evaluation & Threat Research — every cluster mentioning Model Evaluation & Threat Research across labs, papers, and developer communities, ranked by signal.

Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
  1. RESEARCH · CL_12648 ·

    METR proposes autonomy evaluation protocol for AI risks

    The Model Evaluation & Threat Research (METR) initiative has released an example protocol for assessing AI models' potential for autonomy-related risks. This protocol focuses on systems capable of executing harmful task…

  2. RESEARCH · CL_12649 ·

    METR releases guidelines for eliciting AI model capabilities and risks

    The Model Evaluation & Threat Research (METR) organization has published guidelines for assessing AI model capabilities, focusing on elicitation techniques. These guidelines aim to measure a model's potential performanc…