PulseAugur
实时 21:42:46

LLMs struggle to retrieve info from middle of long context windows

Researchers have identified a significant drop in retrieval accuracy for LLMs when crucial information is placed in the middle of long context windows. This phenomenon, termed "lost in the middle," shows models perform well with information at the beginning or end of a prompt but struggle with data in the center. The issue stems from the attention mechanism's tendency to dilute positional signals and favor edge tokens, leading to degraded performance for middle-positioned content. Developers are advised to "edge-load" critical context, placing important facts and instructions at the prompt's start or end to improve retrieval accuracy. AI

影响 Developers must strategically position critical information at the beginning or end of prompts to ensure LLMs can accurately retrieve it from long context windows.

排序理由 The cluster describes a research finding about LLM behavior and provides practical advice based on that finding. [lever_c_demoted from research: ic=1 ai=1.0]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

LLMs struggle to retrieve info from middle of long context windows

报道来源 [1]

  1. dev.to — LLM tag TIER_1 English(EN) · A3E Ecosystem ·

    Retrieval accuracy falls roughly 50% when the answer sits in the middle of a long context window instead of at the edges

    <p>Retrieval accuracy falls roughly 50% when the answer sits in the middle of a long context window instead of at the edges. Liu et al. (2023) measured this across multiple transformer models in their "Lost in the Middle" study. The U-shaped performance curve is consistent. Model…