PulseAugur
EN
LIVE 01:15:07

AI safety outsiders urged to focus on model specs and constitutions

The AI safety community should prioritize contributing to model specifications and constitutions, as these documents are publicly accessible and require no specialized ML knowledge. This approach allows external contributors to influence AI behavior by suggesting changes to natural language documents, which can be easily integrated by lab insiders. Focusing on these specifications is seen as a tractable way for outsiders to impact AI safety, especially in areas like macrostrategy and threat modeling that labs may overlook. AI

IMPACT Provides a strategic direction for external AI safety researchers to influence model behavior through accessible documentation.

RANK_REASON The cluster discusses a strategy for external AI safety researchers, rather than announcing a new model, research finding, or product.

Read on LessWrong (AI tag) →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

AI safety outsiders urged to focus on model specs and constitutions

COVERAGE [2]

  1. LessWrong (AI tag) TIER_1 English(EN) · Cleo Nardo ·

    Outsiders should focus on specs/constitutions

    <p><span>I think model specs/ constitutions are a good focus for the external AI safety community because:</span></p><ol><li value="1"><span>It’s a natural language document. So you don’t need to know any ML or engineering.</span></li><li value="2"><span>You don’t need to know ab…

  2. LessWrong (AI tag) TIER_1 English(EN) · Cleo Nardo ·

    Outsiders should focus on specs/constitutions (among other things)

    <p><span>I think that the external AI safety community should prioritise model specs/constitutions over the next 12 months. It shouldn't be our top priority,</span><span class="footnote-reference" id="fnrefnuoi01kvkhq"><sup><a href="#fnnuoi01kvkhq">[1]</a></sup></span><span> but …