English(EN) SoftSAE: Dynamic Top-K Selection for Adaptive Sparse Autoencoders

SoftSAE 引入动态稀疏性以实现自适应神经网络可解释性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-07 17:28

研究人员推出 SoftSAE，这是一种新颖的自适应稀疏自编码器，旨在提高神经网络的可解释性。与使用固定数量特征的传统方法不同，SoftSAE 根据单个输入的复杂性动态调整稀疏度级别。这使得模型能够为每个数据样本选择适当数量的特征，从而获得更准确、信息量更大的表示。SoftSAE 的源代码是公开可用的。 AI

影响通过根据输入复杂性调整特征选择，增强了 LLM 和 ViT 的可解释性。

排序理由该集群包含一篇详细介绍新研究方法的 arXiv 预印本。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Jakub St\k{e}pie\'n, Marcin Mazur, Jacek Tabor, Przemys{\l}aw Spurek · 2026-05-08 04:00

SoftSAE：自适应稀疏自编码器的动态 Top-K 选择

arXiv:2605.06610v1 Announce Type: new Abstract: Sparse Autoencoders (SAEs) have become an important tool in mechanistic interpretability, helping to analyze internal representations in both Large Language Models (LLMs) and Vision Transformers (ViTs). By decomposing polysemantic a…
arXiv cs.CV TIER_1 English(EN) · Przemysław Spurek · 2026-05-07 17:28

SoftSAE：自适应稀疏自编码器的动态 Top-K 选择

Sparse Autoencoders (SAEs) have become an important tool in mechanistic interpretability, helping to analyze internal representations in both Large Language Models (LLMs) and Vision Transformers (ViTs). By decomposing polysemantic activations into sparse sets of monosemantic feat…

报道来源 [2]

SoftSAE：自适应稀疏自编码器的动态 Top-K 选择

SoftSAE：自适应稀疏自编码器的动态 Top-K 选择

相关实体

相关话题