PulseAugur
LIVE 16:56:32
research · [2 sources] ·
0
research

DRACULA dataset introduces user feedback on intermediate actions for research agents

Researchers have introduced DRACULA, a new dataset designed to improve deep research agents by capturing user feedback on intermediate actions. The dataset, collected from 19 expert CS researchers, includes over 8,000 action preferences and 5,000 execution judgments related to synthesizing research papers. Initial findings indicate that large language models struggle to predict user-preferred actions, especially when users have unstated goals, highlighting the need for better action feedback mechanisms for long-horizon agents. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Provides a new dataset for training and evaluating AI agents on complex, multi-step tasks.

RANK_REASON The cluster describes a new dataset and paper published on arXiv.

Read on arXiv cs.CL →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 · Nishant Balepur, Malachi Hamada, Varsha Kishore, Sergey Feldman, Amanpreet Singh, Pao Siangliulue, Joseph Chee Chang, Rachel Rudinger, Eunsol Choi, Jordan Lee Boyd-Graber, Doug Downey, Aakanksha Naik ·

    DRACULA: Hunting for the Actions Users Want Deep Research Agents to Execute

    arXiv:2604.23815v1 Announce Type: new Abstract: Scientific Deep Research (DR) agents answer user queries by synthesizing research papers into multi-section reports. User feedback can improve their utility, but existing protocols only score the final report, making it hard to stud…

  2. arXiv cs.CL TIER_1 · Aakanksha Naik ·

    DRACULA: Hunting for the Actions Users Want Deep Research Agents to Execute

    Scientific Deep Research (DR) agents answer user queries by synthesizing research papers into multi-section reports. User feedback can improve their utility, but existing protocols only score the final report, making it hard to study and learn which intermediate actions DR agents…