English(EN) The Truth Lies Somewhere in the Middle (of the Generated Tokens)

新研究表明，生成令牌的平均池化可改善大型语言模型的状态表示

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 04:20

一篇新研究论文提出，将生成令牌的隐藏状态进行平均池化，是捕捉语言模型内部状态的更优方法。该方法通过聚合分布在多个令牌上的信息，比单独使用单个令牌能产生更丰富的语义表示。研究表明，源自生成令牌的表示优于源自提示令牌的表示，为理解模型行为动态提供了见解。 AI

影响这项研究可能带来更准确、更易于解释的语言模型内部表示，从而改进下游应用。

排序理由该集群包含一篇学术论文，详细介绍了一种表示语言模型状态的新颖方法。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Brian Cheung · 2026-05-11 04:20

The Truth Lies Somewhere in the Middle (of the Generated Tokens)

How should hidden states generated autoregressively be collapsed into a representation that reflects a language model's internal state? Despite tokens being generated under causal masking, we find that mean pooling across their hidden states yields more semantic representations t…