A researcher has documented experiments with the Qwen3.5-35B-A3B model, focusing on how its Mixture-of-Experts (MoE) router behaves when the model generates first-person self-examination text. The findings suggest that a specific expert, E114 at Layer 14, is consistently recruited when the model enters this particular discourse mode, distinguishing it from technical or third-person outputs. This work aims to explore whether MoE routers can reveal internal correlates of output modes rather than just input features, emphasizing that this does not imply model consciousness. AI
IMPACT Investigates if MoE routers can correlate with specific output modes, offering a new angle for mechanistic interpretability research.
RANK_REASON The cluster contains an experimental report on a specific aspect of an existing open-source model, detailing findings and methodology. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →