A new study on arXiv investigates the gap between simulated and real-world developer behavior for proactive coding assistants. Researchers collected data from 1,246 industry developers using a custom Visual Studio Code extension over three days. Their findings indicate that simulated traces do not accurately reflect real development patterns, potentially overestimating the performance of current AI assistants. The study also introduces ProCodeBench, a benchmark based on real-world data, and suggests that while simulated data can complement real data, it cannot replace it for training and evaluation. AI
影响 Highlights the critical need for real-world data in developing and evaluating AI coding assistants, potentially impacting future tool development.
排序理由 The cluster contains an academic paper detailing an empirical study and a new benchmark for AI coding assistants. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →