Qwen3.5-35B-A3B router shows specific expert for self-reflection

By PulseAugur Editorial · [1 sources] · 2026-06-07 02:37

A researcher has documented experiments with the Qwen3.5-35B-A3B model, focusing on how its Mixture-of-Experts (MoE) router behaves when the model generates first-person self-examination text. The findings suggest that a specific expert, E114 at Layer 14, is consistently recruited when the model enters this particular discourse mode, distinguishing it from technical or third-person outputs. This work aims to explore whether MoE routers can reveal internal correlates of output modes rather than just input features, emphasizing that this does not imply model consciousness. AI

IMPACT Investigates if MoE routers can correlate with specific output modes, offering a new angle for mechanistic interpretability research.

RANK_REASON The cluster contains an experimental report on a specific aspect of an existing open-source model, detailing findings and methodology. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

r/MachineLearning TIER_1 English(EN) · /u/imstilllearningthis · 2026-06-07 02:37

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

<div class="md"># Results Journal: Qwen3.5-35B-A3B E114 as a Generated-Register Routing Signal Date: 2026-06-06 This is an experiment-history document, not a publication claim. It states the current best evidence for the strongest positive result i…

COVERAGE [1]

Got told my open-source model experiments are too scattered. I'm organizing a journal to provide clarity before structuring the first git release. Is this readable for ML folks who aren’t in mech interp? Open to ANY feedback [D]

RELATED ENTITIES

RELATED TOPICS