实体
SWE-QA
SWE-QA
PulseAugur coverage of SWE-QA — every cluster mentioning SWE-QA across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
情绪 · 30 天
1 天有情绪数据
最近 · 第 1/1 页 · 共 2 条
-
New LaMR framework prunes code context for LLM agents
Researchers have developed a new framework called LaMR (Latent Multi-Rubric) to improve the efficiency of LLM-powered coding agents. Current agents often waste token budgets on irrelevant code snippets, but LaMR address…
-
SWE-QA benchmark tests LLMs on repository-level code questions
Researchers have introduced SWE-QA, a new benchmark designed to evaluate language models' ability to answer questions about entire software repositories. This benchmark addresses limitations of previous datasets by focu…