English(EN) Effective Context in Transformers: An Analysis of Fragmentation and Tokenization

论文分析数据表示如何影响Transformer上下文

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-13 13:08

一篇新论文分析了数据（如字节、字符或子词标记）的不同表示如何影响Transformer模型的性能。该研究引入了“分片”的概念，解释了为什么即使有更大的上下文窗口，更小的单元也会降低预测准确性。相反，该研究表明分词如何有效地扩展感知到的上下文窗口，为理解Transformer中的表示选择提供了一个框架。 AI

影响为理解数据表示选择如何影响Transformer模型性能和上下文处理提供了理论框架。

排序理由该集群包含一篇讨论Transformer模型及其数据表示的理论方面的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Aslan Tchamkerten · 2026-05-13 13:08

Effective Context in Transformers: An Analysis of Fragmentation and Tokenization

Transformers predict over a representation of a sequence. The same data can be written as bytes, characters, or subword tokens, and these representations may be lossless. Yet, under a fixed context window, they need not expose the same information to the model. This raises a basi…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-13 13:08

Effective Context in Transformers: An Analysis of Fragmentation and Tokenization

Transformers predict over a representation of a sequence. The same data can be written as bytes, characters, or subword tokens, and these representations may be lossless. Yet, under a fixed context window, they need not expose the same information to the model. This raises a basi…

报道来源 [2]

Effective Context in Transformers: An Analysis of Fragmentation and Tokenization

Effective Context in Transformers: An Analysis of Fragmentation and Tokenization

相关实体

相关话题