Brief · PulseAugur

TOOL · arXiv cs.LG English(EN) · 7h

Function-Vector Heads Are Two Populations: Writers and Cancellers in In-Context Learning

Researchers have identified two distinct populations within function-vector (FV) heads in large language models, challenging the assumption that these heads are a homogeneous group. By employing a sign-preserving criterion instead of magnitude-only ranking, they found that FV heads either push correct logits up (writers) or push them down (cancellers). This dual nature was observed across multiple model families and scales, and zero-ablating cancellers led to improved accuracy. AI

IMPACT Reveals a more nuanced understanding of how LLMs process information, potentially impacting future model interpretability and design.

Pythia
Todd et al.
Function-vector heads