PulseAugur
EN
LIVE 14:17:32

New AI Safety Org Geodesic Research Targets Alignment Initialization

Geodesic Research, a new AI safety organization based in Cambridge, UK, is focusing on empirically building robust alignment initializations for large language models. The organization's research agenda targets the potential for misalignment to arise during long-horizon capabilities reinforcement learning, which may be difficult to correct later in the training process. Geodesic aims to develop methods for embedding persistent alignment priors early in model training, building on prior work in alignment pretraining and addressing its limitations in production environments. AI

IMPACT This new organization's focus on early alignment initialization could influence how future frontier models are trained and secured against misalignment.

RANK_REASON Launch of a new AI safety organization with a defined research agenda and empirical approach. [lever_c_demoted from significant: ic=1 ai=1.0]

Read on LessWrong (AI tag) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New AI Safety Org Geodesic Research Targets Alignment Initialization

COVERAGE [1]

  1. LessWrong (AI tag) TIER_1 English(EN) · Puria ·

    Announcing Geodesic Research

    <p><a href="https://geodesicresearch.ai/" rel="noreferrer"><span>We</span></a><span>'re a Cambridge, UK-based AI safety organisation that’s asking: </span><i><span>how can we build the most robust alignment initialisations for capable LLMs</span></i><span>?</span></p><p><span>We’…