Beyond Routing: Characterising Expert Tuning and Representation in Vision Mixture-of-Experts
Researchers have developed new methods to understand the internal workings of Mixture-of-Experts (MoE) models in computer vision. By analyzing how different visual categories are routed to specific experts and examining the tuning of these experts to various inputs, they found that an animate-inanimate distinction is a dominant factor in expert partitioning. The study reveals that experts tune to broader, continuous visual and semantic dimensions beyond simple category boundaries, highlighting the benefits of moving beyond basic routing analyses for a deeper understanding of MoE specialization. AI
IMPACT Provides novel methods for interpreting the specialized functions within complex vision models, advancing AI research.