PulseAugur
EN
LIVE 08:56:27

LLM function-vector heads split into 'writers' and 'cancellers'

Researchers have identified two distinct populations within function-vector (FV) heads in large language models, challenging the assumption that these heads are a homogeneous group. By employing a sign-preserving criterion instead of magnitude-only ranking, they found that FV heads either push correct logits up (writers) or push them down (cancellers). This dual nature was observed across multiple model families and scales, and zero-ablating cancellers led to improved accuracy. AI

IMPACT Reveals a more nuanced understanding of how LLMs process information, potentially impacting future model interpretability and design.

RANK_REASON Academic paper detailing novel findings about LLM internal mechanisms. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Han-yu Wang ·

    Function-Vector Heads Are Two Populations: Writers and Cancellers in In-Context Learning

    arXiv:2606.07560v1 Announce Type: cross Abstract: Function-vector (FV) heads (Todd et al., 2024) are typically identified by the magnitude of their causal contribution to in-context rule tasks, under the implicit assumption that the top set is a homogeneous functional class. This…