Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · METR (Model Evaluation & Threat Research) Italiano(IT) · 27mo

Example autonomy evaluation protocol

The Model Evaluation & Threat Research (METR) initiative has released an example protocol for assessing AI models' potential for autonomy-related risks. This protocol focuses on systems capable of executing harmful tasks end-to-end without human bottlenecks, including those that autonomously procure human assistance. METR aims for the evaluation to be practical, cost-effective, and completed by a small team within a month, with a budget of a few million dollars. The goal is to provide a continuous metric of dangerous capabilities to inform mitigation strategies and allow for societal oversight. AI

IMPACT Provides a framework for evaluating AI autonomy risks, potentially guiding safety investments and development.
- Model Evaluation & Threat Research
RESEARCH · METR (Model Evaluation & Threat Research) Italiano(IT) · 27mo

Guidelines for capability elicitation

The Model Evaluation & Threat Research (METR) organization has published guidelines for assessing AI model capabilities, focusing on elicitation techniques. These guidelines aim to measure a model's potential performance after some level of post-training enhancement, rather than its raw state. The process involves initial basic elicitation, followed by analysis of remaining failure modes to determine if they can be easily fixed with further effort. METR emphasizes the importance of considering finetuning, prompting, and tooling in threat modeling, especially for open-source or potentially modifiable models. AI

IMPACT Provides a framework for evaluating AI model safety and potential risks through structured capability elicitation.
- Model Evaluation & Threat Research

Brief

Example autonomy evaluation protocol

Guidelines for capability elicitation