AgentHER framework boosts LLM agent training with failed trajectory relabeling

By PulseAugur Editorial · [1 sources] · 2026-04-28 04:00

Researchers have developed AgentHER, a new framework designed to improve the training of LLM agents by repurposing failed trajectories. The system adapts Hindsight Experience Replay to natural language, identifying alternative achievable goals within failed attempts. This method converts discarded data into valuable training material, significantly boosting agent performance and data efficiency across various model sizes. AI

IMPACT Enhances LLM agent training efficiency by leveraging failed trajectories, potentially improving performance on complex real-world tasks.

RANK_REASON Academic paper introducing a novel framework for LLM agent training.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

AgentHER framework boosts LLM agent training with failed trajectory relabeling

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Liang Ding · 2026-04-28 04:00

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

arXiv:2603.21357v3 Announce Type: replace-cross Abstract: LLM agents fail on the majority of real-world tasks -- GPT-4o succeeds on fewer than 15% of WebArena navigation tasks and below 55% pass@1 on ToolBench (Zhou et al., 2024; Qin et al., 2024) -- yet every failed trajectory i…

COVERAGE [1]

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

RELATED ENTITIES

RELATED TOPICS