New framework analyzes concept representations in neural models

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-05 04:00

Researchers have developed a new framework to analyze how neural models represent human-interpretable concepts. This framework uses axes of containment and disentanglement to study concept subspaces within models. Experiments on text and speech models revealed that the choice of estimation method significantly impacts these properties, and that while phone information is well-represented in speech models, speaker information is more difficult to isolate. AI

影响 Introduces a novel framework for understanding internal model representations, potentially aiding in interpretability and bias detection.

排序理由 This is a research paper detailing a new framework for analyzing concept representations in neural models. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

arXiv
HuBERT

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Burin Naowarat, Hao Tang, Sharon Goldwater · 2026-05-05 04:00

A framework for analyzing concept representations in neural models

arXiv:2605.01381v1 Announce Type: new Abstract: Understanding how neural models represent human-interpretable concepts is challenging. Prior work has explored linear concept subspaces from diverse perspectives, such as probing and concept erasure. We introduce a unified framework…

报道来源 [1]

A framework for analyzing concept representations in neural models

相关实体

相关话题