Researchers have developed a multi-agent large language model that learns to defer to human input. The model is trained using GRPO on a reward system that accounts for costs, and each instance of deferral is used as supervised fine-tuning data. This allows the model to gradually incorporate human expertise, with a tunable cost parameter enabling a trade-off between accuracy and the budget for human intervention during deployment. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Introduces a novel training methodology for multi-agent LLMs, enabling adaptive collaboration with human experts.
RANK_REASON The cluster describes a novel research paper detailing a new method for training multi-agent LLMs. [lever_c_demoted from research: ic=1 ai=1.0]