PulseAugur
EN
LIVE 08:47:46

Researchers pinpoint 'first-token broadcasters' controlling language identity in transformers

Researchers have identified specific attention heads in transformer models, termed 'first-token broadcasters,' that are crucial for maintaining a model's language identity. These heads, particularly prominent in models like GPT-2 and instruction-tuned Qwen2.5, persistently attend to the initial token of a prompt, thereby propagating the intended language signal throughout the generation process. Experiments using Language Identity Head Ablation (LIHA) demonstrate that instruction tuning significantly localizes this language signaling mechanism to early layers of the model, a contrast to base models where the influence is more distributed. AI

IMPACT Provides a mechanistic understanding of language drift in LLMs, potentially leading to improved control and robustness in multilingual models.

RANK_REASON Academic paper detailing a new mechanistic insight into transformer model behavior. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Researchers pinpoint 'first-token broadcasters' controlling language identity in transformers

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Anjelo Jann Laroza ·

    First-Token Broadcasters: Mechanistic Origins of Language Identity and Distributed Robustness in Transformers

    Why do multilingual language models sometimes generate in the wrong language, and why is this so hard to fix? We introduce Language Identity Head Ablation (LIHA), a causal intervention that zeros each attention head individually and measures the resulting language switch rate acr…