PulseAugur
EN
LIVE 02:23:49

Minimal circuits for indirect object identification found in attention-only transformers

Researchers have identified minimal computational circuits responsible for indirect object identification (IOI) in attention-only transformers. By training small, single-layer models from scratch on a symbolic IOI task, they discovered that just two attention heads were sufficient for perfect accuracy, even without MLPs or normalization layers. Further analysis revealed these heads specialize into additive and contrastive subcircuits that work together to resolve IOI, demonstrating that task-specific training can induce highly interpretable and minimal reasoning circuits in transformers. AI

IMPACT Provides insights into the fundamental reasoning capabilities and interpretability of transformer architectures.

RANK_REASON Academic paper detailing findings on transformer interpretability. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Minimal circuits for indirect object identification found in attention-only transformers

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Rabin Adhikari ·

    Emergence of Minimal Circuits for Indirect Object Identification in Attention-Only Transformers

    arXiv:2510.25013v2 Announce Type: replace-cross Abstract: Mechanistic interpretability aims to reverse-engineer large language models (LLMs) into human-understandable computational circuits. However, the complexity of pretrained models often obscures the minimal mechanisms requir…