PulseAugur
LIVE 12:26:31
research · [2 sources] ·
0
research

Towards Long-horizon Agentic Multimodal Search

Two new research papers explore advancements in agentic search, focusing on how AI agents interact with information over extended sessions. The first paper analyzes over 14 million real search requests to understand user intents and query reformulation patterns, revealing that most sessions are short and query terms often trace back to retrieved evidence. The second paper introduces a framework for long-horizon multimodal search, addressing challenges of context management and token costs by using file-based visual representations and on-demand loading, achieving state-of-the-art results on complex benchmarks. AI

Summary written by None from 2 sources. How we write summaries →

IMPACT These papers offer insights into improving AI agent efficiency and capability in complex information-seeking tasks, potentially leading to more effective search tools.

RANK_REASON Two academic papers published on arXiv detailing new methods and analyses for agentic search systems.

Read on arXiv cs.CV →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 · Jingjie Ning, Jo\~ao Coelho, Yibo Kong, Yunfan Long, Bruno Martins, Jo\~ao Magalh\~aes, Jamie Callan, Chenyan Xiong ·

    Agentic Search in the Wild: Intents and Trajectory Dynamics from 14M+ Real Search Requests

    arXiv:2601.17617v3 Announce Type: replace-cross Abstract: LLM-powered search agents are increasingly being used for multi-step information seeking tasks, yet the IR community lacks empirical understanding of how agentic search sessions unfold and how retrieved evidence is reflect…

  2. arXiv cs.CV TIER_1 · Yifan Du, Zikang Liu, Jinbiao Peng, Jie Wu, Junyi Li, Jinyang Li, Wayne Xin Zhao, Ji-Rong Wen ·

    Towards Long-horizon Agentic Multimodal Search

    arXiv:2604.12890v2 Announce Type: replace Abstract: Multimodal deep search agents have shown great potential in solving complex tasks by iteratively collecting textual and visual evidence. However, managing the heterogeneous information and high token costs associated with multim…