A new deep research agent called Code Researcher has been developed to tackle complex systems code by analyzing large codebases and their commit histories. This agent significantly outperforms existing methods on benchmarks like kBenchSyz, achieving a 48% crash-resolution rate with GPT-4o and even higher rates with Gemini 2.5-Flash. The research highlights the critical role of gathering extensive global context and employing multi-faceted reasoning for effective code modification in large systems. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT New agent significantly improves code repair rates, potentially accelerating software development and maintenance.
RANK_REASON Academic paper detailing a new AI agent for code analysis and patch generation. [lever_c_demoted from research: ic=1 ai=1.0]