实体 SWE-QA

SWE-QA

PulseAugur coverage of SWE-QA — every cluster mentioning SWE-QA across labs, papers, and developer communities, ranked by signal.

总计 · 30天

2

90 天内 2

发布 · 30天

0

90 天内 0

论文 · 30天

2

90 天内 2

层级分布 · 90 天

情绪 · 30 天

1 天有情绪数据

最近 · 第 1/1 页 · 共 2 条

TOOL · CL_36579 · May 14 · 18:30

New LaMR framework prunes code context for LLM agents

Researchers have developed a new framework called LaMR (Latent Multi-Rubric) to improve the efficiency of LLM-powered coding agents. Current agents often waste token budgets on irrelevant code snippets, but LaMR address…
RESEARCH · CL_06686 · Apr 28 · 04:00

SWE-QA benchmark tests LLMs on repository-level code questions

Researchers have introduced SWE-QA, a new benchmark designed to evaluate language models' ability to answer questions about entire software repositories. This benchmark addresses limitations of previous datasets by focu…