Brief · PulseAugur

RESEARCH · arXiv cs.CL English(EN) · 4d · [3 sources]

Multilingual Steering by Design: Multilingual Sparse Autoencoders and Principled Layer Selection

Researchers have developed a new method for improving multilingual language control in large language models using sparse autoencoders (SAEs). Their approach involves training SAEs on multilingual data to enhance cross-lingual representations and introduces a principled rule for selecting effective layers for intervention. This method stabilizes the balance between language identification accuracy and generation quality, offering a more reliable way to steer LLMs across different languages. AI

IMPACT This research offers a more principled and reliable method for controlling multilingual LLMs, potentially improving cross-lingual tasks like translation and summarization.

LLaMA-3.1-8B
Gemma-2-9B
Sparse Autoencoders
Yusser Al Ghussin
Large Language Models