PulseAugur
LIVE 09:02:08
research · [46 sources] ·
0
research

New research explores advanced memory and retrieval for AI agents

Researchers are developing new methods to enhance the capabilities of AI agents, particularly in handling long contexts and complex reasoning tasks. Several papers propose novel approaches to memory management and retrieval, aiming to overcome limitations in current systems. These advancements include techniques for guided rereading, unified memory paradigms for network infrastructure, and benchmarks for multimodal agentic search, all contributing to more robust and efficient AI agents. AI

Summary written by gemini-2.5-flash-lite from 46 sources. How we write summaries →

IMPACT Advances in memory and retrieval for AI agents could lead to more capable systems for complex reasoning and enterprise knowledge management.

RANK_REASON Multiple arXiv papers introduce novel methods and benchmarks for AI agent capabilities.

Read on arXiv cs.AI →

COVERAGE [46]

  1. arXiv cs.CL TIER_1 · Kai-Wei Chang ·

    LongMemEval-V2: Evaluating Long-Term Agent Memory Toward Experienced Colleagues

    Long-term memory is crucial for agents in specialized web environments, where success depends on recalling interface affordances, state dynamics, workflows, and recurring failure modes. However, existing memory benchmarks for agents mostly focus on user histories, short traces, o…

  2. arXiv cs.AI TIER_1 · William Parris ·

    Semantic Reward Collapse and the Preservation of Epistemic Integrity in Adaptive AI Systems

    Recent advances in reinforcement learning from human feedback (RLHF) and preference optimization have substantially improved the usability, coherence, and safety of large language models. However, recurring behaviors such as performative certainty, hallucinated continuity, calibr…

  3. arXiv cs.CL TIER_1 · Qiuzhuang Sun ·

    PRISM: Pareto-Efficient Retrieval over Intent-Aware Structured Memory for Long-Horizon Agents

    Long-horizon language agents accumulate conversation history far faster than any fixed context window can hold, making memory management critical to both answer accuracy and serving cost. Existing approaches either expand the context window without addressing what is retrieved, p…

  4. arXiv cs.AI TIER_1 · Scott Sanner ·

    Goal-Oriented Reasoning for RAG-based Memory in Conversational Agentic LLM Systems

    LLM-based conversational AI agents struggle to maintain coherent behavior over long horizons due to limited context. While RAG-based approaches are increasingly adopted to overcome this limitation by storing interactions in external memory modules and performing retrieval from th…

  5. arXiv cs.AI TIER_1 · Zenglin Xu ·

    Remember the Decision, Not the Description: A Rate-Distortion Framework for Agent Memory

    Long-horizon language agents must operate under limited runtime memory, yet existing memory mechanisms often organize experience around descriptive criteria such as relevance, salience, or summary quality. For an agent, however, memory is valuable not because it faithfully descri…

  6. arXiv cs.AI TIER_1 · Jimmy Lin ·

    Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?

    Does a lexical retriever suffice as large language models (LLMs) become more capable in an agentic loop? This question naturally arises when building deep research systems. We revisit it by pairing BM25 with frontier LLMs that have better reasoning and tool-use abilities. To supp…

  7. arXiv cs.AI TIER_1 · Min Zhang ·

    MemReread: Enhancing Agentic Long-Context Reasoning via Memory-Guided Rereading

    To tackle long-context reasoning tasks without the quadratic complexity of standard attention mechanisms, approaches based on agent memory have emerged, which typically maintain a dynamically updated memory when linearly processing document chunks. To mitigate the potential loss …

  8. arXiv cs.AI TIER_1 · Tony Q. S. Quek ·

    Bridging the Cognitive Gap: A Unified Memory Paradigm for 6G Agentic AI-RAN

    As 6G evolves, the radio access network must transcend traditional automation to embrace agentic AI capable of perception, reasoning, and evolution. A fundamental cognitive gap persists in current disaggregated architectures, where interfaces force the physical layer to compress …

  9. arXiv cs.CL TIER_1 · Jianfei Yang ·

    InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search

    Existing benchmarks for multimodal agentic search evaluate multimodal search and visual browsing, but visual evidence is either confined to the input or treated as an answer endpoint rather than part of an interleaved search trajectory. We introduce \textbf{InterLV-Search}, a ben…

  10. arXiv cs.CL TIER_1 · Junfeng Liao, Qizhou Wang, Jianing Zhu, Bo Du, Rui Yan, Xiuying Chen ·

    Belief Memory: Agent Memory Under Partial Observability

    arXiv:2605.05583v1 Announce Type: cross Abstract: LLM agents that operate over long context depend on external memory to accumulate knowledge over time. However, existing methods typically store each observation as a single deterministic conclusion (e.g., inferring "API~X failed"…

  11. arXiv cs.LG TIER_1 · Yijia Zheng, Marcel Worring ·

    LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG

    arXiv:2605.06285v1 Announce Type: cross Abstract: Single-step retrieval-augmented generation (RAG) provides an efficient way to incorporate external information for simple question answering tasks but struggles with complex questions. Agentic RAG extends this paradigm by replacin…

  12. arXiv cs.LG TIER_1 · Zeyu Yang, Qi Ma, Jason Chen, Anshumali Shrivastava ·

    Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

    arXiv:2605.06647v1 Announce Type: cross Abstract: Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exploratory queries, inspect returned snippets, and iteratively reformula…

  13. arXiv cs.CL TIER_1 · Chunyu Li, Jingyi Kang, Ding Chen, Mengyuan Zhang, Jiajun Shen, Bo Tang, Xuanhe Zhou, Feiyu Xiong, Zhiyu Li ·

    MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval

    arXiv:2605.06132v1 Announce Type: new Abstract: In agent memory systems, the reranking model serves as the critical bridge connecting user queries with long-term memory. Most systems adopt the "retrieve-then-rerank" two-stage paradigm, but generic reranking models rely on semanti…

  14. arXiv cs.AI TIER_1 · Susheel Suresh, Hazel Mak, Shangpo Chou, Fred Kroon, Sahil Bhatnagar ·

    AgenticRAG: Agentic Retrieval for Enterprise Knowledge Bases

    arXiv:2605.05538v1 Announce Type: new Abstract: We present AgenticRAG, a practical agentic harness for retrieval and analysis over enterprise knowledge bases. Standard RAG pipelines place significant burden of grounding on the search stack, constraining the language model to a fi…

  15. arXiv cs.AI TIER_1 · Huyu Wu, Jun Liu, Xiaochi Wei, Yan Gao, Yi Wu, Yao Hu ·

    Knowledge-Graph Paths as Intermediate Supervision for Self-Evolving Search Agents

    arXiv:2605.05702v1 Announce Type: new Abstract: Self-evolving search agents reduce reliance on human-written training questions by generating and solving their own search tasks. We build on Search Self-Play (SSP), a representative Proposer and Solver framework in which questions …

  16. arXiv cs.AI TIER_1 · Zhuofeng Li, Haoxiang Zhang, Cong Wei, Pan Lu, Ping Nie, Yi Lu, Yuyang Bai, Shangbin Feng, Hangxiao Zhu, Ming Zhong, Yuyu Zhang, Jianwen Xie, Yejin Choi, James Zou, Jiawei Han, Wenhu Chen, Jimmy Lin, Dongfu Jiang, Yu Zhang ·

    Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

    arXiv:2605.05242v1 Announce Type: cross Abstract: Modern retrieval systems, whether lexical or semantic, expose a corpus through a fixed similarity interface that compresses access into a single top-k retrieval step before reasoning. This abstraction is efficient, but for agentic…

  17. arXiv cs.AI TIER_1 · Yuxiang Zhang, Jiangming Shu, Ye Ma, Xueyuan Lin, Shangxi Wu, Jitao Sang ·

    Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

    arXiv:2510.12635v3 Announce Type: replace Abstract: Long-context Large Language Models, despite their expanded capacity, require careful working memory management to mitigate attention dilution during long-horizon tasks. Yet existing approaches rely on external mechanisms that la…

  18. arXiv cs.AI TIER_1 · Spyros Galanis ·

    Information Aggregation with AI Agents

    arXiv:2604.20050v2 Announce Type: replace-cross Abstract: Can Large Language Models (AI agents) aggregate dispersed private information through trading and reason about the knowledge of others by observing price movements? We conduct a controlled experiment where AI agents trade …

  19. arXiv cs.AI TIER_1 · Anshumali Shrivastava ·

    Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

    Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exploratory queries, inspect returned snippets, and iteratively reformulate until useful evidence emerges. This approach re…

  20. arXiv cs.CL TIER_1 · Marcel Worring ·

    LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG

    Single-step retrieval-augmented generation (RAG) provides an efficient way to incorporate external information for simple question answering tasks but struggles with complex questions. Agentic RAG extends this paradigm by replacing single-step retrieval with a multi-step process,…

  21. Hugging Face Daily Papers TIER_1 ·

    LatentRAG: Latent Reasoning and Retrieval for Efficient Agentic RAG

    Single-step retrieval-augmented generation (RAG) provides an efficient way to incorporate external information for simple question answering tasks but struggles with complex questions. Agentic RAG extends this paradigm by replacing single-step retrieval with a multi-step process,…

  22. arXiv cs.CL TIER_1 · Zhiyu Li ·

    MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval

    In agent memory systems, the reranking model serves as the critical bridge connecting user queries with long-term memory. Most systems adopt the "retrieve-then-rerank" two-stage paradigm, but generic reranking models rely on semantic similarity matching and lack genuine reasoning…

  23. arXiv cs.CL TIER_1 · Joshua Adler, Guy Zehavi ·

    Storage Is Not Memory: A Retrieval-Centered Architecture for Agent Recall

    arXiv:2605.04897v1 Announce Type: new Abstract: Extraction at ingestion is the wrong primitive for agent memory: content discarded before the query is known cannot be recovered at retrieval time. We propose True Memory, a six-layer architecture that shifts the center of the syste…

  24. arXiv cs.AI TIER_1 · Siheng Chen ·

    LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

    Long-horizon search agents must manage a rapidly growing working context as they reason, call tools, and observe information. Naively accumulating all intermediate content can overwhelm the agent, increasing costs and the risk of errors. We propose that effective context manageme…

  25. arXiv cs.CL TIER_1 · Guy Zehavi ·

    Storage Is Not Memory: A Retrieval-Centered Architecture for Agent Recall

    Extraction at ingestion is the wrong primitive for agent memory: content discarded before the query is known cannot be recovered at retrieval time. We propose True Memory, a six-layer architecture that shifts the center of the system from a storage schema to a multi-stage retriev…

  26. arXiv cs.AI TIER_1 · Altan Cakir, Ayca Yerlikaya ·

    From Experimental Limits to Physical Insight: A Retrieval-Augmented Multi-Agent Framework for Interpreting Searches Beyond the Standard Model

    arXiv:2605.02491v1 Announce Type: cross Abstract: Modern searches for physics beyond the Standard Model produce rapidly expanding literature containing heterogeneous information, including textual analyses, numerical datasets, and graphical exclusion limits. Integrating these dis…

  27. arXiv cs.CL TIER_1 · Yilun Zhao, Jinbiao Wei, Tingyu Song, Siyue Zhang, Chen Zhao, Arman Cohan ·

    Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

    arXiv:2605.04018v1 Announce Type: new Abstract: Reasoning-intensive retrieval aims to surface evidence that supports downstream reasoning rather than merely matching topical similarity. This capability is increasingly important for agentic search systems, where retrievers must pr…

  28. arXiv cs.CL TIER_1 · Arman Cohan ·

    Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

    Reasoning-intensive retrieval aims to surface evidence that supports downstream reasoning rather than merely matching topical similarity. This capability is increasingly important for agentic search systems, where retrievers must provide complementary evidence across iterative se…

  29. arXiv cs.AI TIER_1 · Ayca Yerlikaya ·

    From Experimental Limits to Physical Insight: A Retrieval-Augmented Multi-Agent Framework for Interpreting Searches Beyond the Standard Model

    Modern searches for physics beyond the Standard Model produce rapidly expanding literature containing heterogeneous information, including textual analyses, numerical datasets, and graphical exclusion limits. Integrating these distributed sources remains a time-consuming and manu…

  30. Hugging Face Daily Papers TIER_1 ·

    From Experimental Limits to Physical Insight: A Retrieval-Augmented Multi-Agent Framework for Interpreting Searches Beyond the Standard Model

    Modern searches for physics beyond the Standard Model produce rapidly expanding literature containing heterogeneous information, including textual analyses, numerical datasets, and graphical exclusion limits. Integrating these distributed sources remains a time-consuming and manu…

  31. Forbes — Innovation TIER_1 · Liran Zvibel, Forbes Councils Member ·

    AI’s Memory Crisis Is Here: Don’t Hoard, Optimize

    The AI industry has been papering over architectural inefficiency with raw capacity.

  32. dev.to — Claude Code tag TIER_1 Français(FR) · Michel Faure ·

    Six days, six seconds: a CI test against the semantic drift of an AI agent

    <p><a class="article-body-image-wrapper" href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmrmmh12ksnvs4h7qnrww.png"><img alt="Strip BD — Françoise deman…

  33. dev.to — Claude Code tag TIER_1 · Michel Faure ·

    Six days, six seconds: a CI test against semantic-layer drift on an AI agent

    <p><a class="article-body-image-wrapper" href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmrmmh12ksnvs4h7qnrww.png"><img alt="Comic strip — Françoise as…

  34. Towards AI TIER_1 · Subrat Pati ·

    Building the AI Memory Stack: Layered Storage, Async Extraction and Atomic Persistence

    <p>Every AI agent you build today can hold a conversation. It can reason, use tools, and chain together complex workflows. But the moment a session ends, everything disappears. The agent forgets who you are, what you were working on, and every preference it learned during the con…

  35. dev.to — MCP tag TIER_1 · Rumblingb ·

    Why Every AI Agent Needs Persistent Memory: Introducing Agent Memory MCP

    <h2> The Memory Problem in AI Agents </h2> <p>Modern LLMs are incredibly powerful, but they have a fundamental limitation: <strong>they forget everything between conversations</strong>. Every time you start a new session with an AI agent, it's like talking to someone with amnesia…

  36. dev.to — MCP tag TIER_1 · Gowtham S ·

    Building a Local Markdown Memory Layer for AI Agents

    <p>I kept running into the same problem with AI coding agents.</p> <p>The agents were getting better, but every new session still felt like starting<br /> from zero.</p> <p>I would explain the repo again. Then my preferences again. Then the decisions we<br /> already made. Then w…

  37. dev.to — MCP tag TIER_1 · Gowtham ·

    Building a Local Markdown Memory Layer for AI Agents

    <p>I kept running into the same problem with AI coding agents.</p> <p>The agents were getting better, but every new session still felt like starting<br /> from zero.</p> <p>I would explain the repo again. Then my preferences again. Then the decisions we<br /> already made. Then w…

  38. dev.to — LLM tag TIER_1 · Ken W Alger ·

    Engineering Agent Memory

    <h2>From Stateless Prompts to Persistent Intelligence</h2> <blockquote> <strong>Where this fits:</strong> This article bridges two series. It closes out the themes introduced in The Backyard Quarry — a data engineering exploration using physical objects as a teaching domain — and…

  39. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    🧠 Graft provides a semantic memory system for AI agents that operates independently of large language models. The tool allows agents to store and retrieve infor

    🧠 Graft provides a semantic memory system for AI agents that operates independently of large language models. The tool allows agents to store and retrieve information based on meaning rather than exact text matching. 💬 Hacker News 🔗 https:// github.com/AEndrix03/Graft # AI # Mach…

  40. dev.to — LLM tag TIER_1 · vishalmysore ·

    ReasoningBank: Building AI Agents that Actually Learn from Experience

    <p>In the world of Large Language Models (LLMs), we often face a frustrating paradox: LLMs are incredibly capable at "reasoning" in the moment, but they are fundamentally <strong>stateless</strong>. Every time you start a new session, the agent has total amnesia. It doesn't remem…

  41. dev.to — LLM tag TIER_1 · Poniak Labs ·

    SubQ Model: Can Subquadratic Make Long-Context AI More Efficient?

    <p><em>Originally published on <a href="https://www.poniaktimes.com/subq-model-efficient-long-context-ai/" rel="noopener noreferrer">Poniak Times</a>. Reposted here for the developer and AI engineering community.</em></p> <p>Subquadratic’s SubQ model claims to make long-context A…

  42. dev.to — LLM tag TIER_1 · Jonathanfarrow ·

    The 10 Best AI Memory Layers for Agents in 2026

    <p>If you are building agents in 2026, you have already hit the wall. Bigger models do not fix forgetfulness. Context windows can grow forever, and the agent still cannot remember what a user told it last Tuesday, that the customer's address changed three months ago, or that a re…

  43. dev.to — LLM tag TIER_1 · 丁久 ·

    AI Agents Memory Patterns: Working, Episodic, Semantic, and Reflective Memory

    <blockquote> <p><em>This article was originally published on <a href="https://dingjiu1989-hue.github.io/en/ai/ai-agents-memory-patterns.html" rel="noopener noreferrer">AI Study Room</a>. For the full version with working code examples and related articles, visit the original post…

  44. dev.to — LLM tag TIER_1 Español(ES) · Tirso García ·

    Building Kernel Memory Protocol: Navigable Memory for AI Agents

    <blockquote> <p>English version: <a href="https://dev.to/tirsogarcia/building-kernel-memory-protocol-navigable-memory-for-ai-agents-315j">Building Kernel Memory Protocol: Navigable Memory for AI Agents</a></p> </blockquote> <p>El problema de muchos agentes de IA no es que les fal…

  45. dev.to — LLM tag TIER_1 · Tirso García ·

    Building Kernel Memory Protocol: Navigable Memory for AI Agents

    <blockquote> <p>Versión en español: <a href="https://dev.to/tirsogarcia/construyendo-kernel-memory-protocol-memoria-navegable-para-agentes-de-ia-24lc">Construyendo Kernel Memory Protocol: memoria navegable para agentes de IA</a></p> </blockquote> <p>The hard part with many AI age…

  46. dev.to — LLM tag TIER_1 · tokozen ·

    How Agentic Search Actually Works: The Research Loop Link-Fetching Agents Miss

    <h1> How Agentic Search Actually Works: The Research Loop Link-Fetching Agents Miss </h1> <p>Most agent tutorials show you the same pattern: take a user query, call a search API, grab the top result, stuff the text into your prompt. Done. Ship it.</p> <p>That works fine for trivi…