English(EN) GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the sta

GPT-3.5-Turbo在长提示中处理中间信息时遇到困难

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-05 11:02

一项研究发现，当答案位于长提示（特别是20k token上下文窗口）的中间时，GPT-3.5-Turbo的准确率会显著下降。这一现象在论文“Lost in the Middle: How Language Models Use Long Contexts”中有记载，归因于Transformer模型中的注意力模式，这种模式偏好提示的开头或结尾的信息，而忽略中间部分。这个问题并非检索错误，而是由于训练数据限制导致模型注意力权重在中心区域衰减。 AI

影响突显了当前大型语言模型在需要从长文档中检索信息的任务中的一个关键限制，需要采用重新排序策略，而不是仅仅增加上下文窗口大小。

排序理由该集群描述了一篇研究论文的发现，该论文关于特定模型在长上下文中的行为。[lever_c_demoted from research: ic=1 ai=1.0]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · A3E Ecosystem · 2026-06-05 11:02

当答案位于 20k token 提示的中间时，GPT-3.5-Turbo 的准确率从 90% 下降到 50%，而不是在提示的开头

<p>GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the start or end. Liu et al. (2023) documented this in "Lost in the Middle: How Language Models Use Long Contexts" at ACL. The edges of your context window are prim…

报道来源 [1]

当答案位于 20k token 提示的中间时，GPT-3.5-Turbo 的准确率从 90% 下降到 50%，而不是在提示的开头

相关实体

相关话题