ENTITY
SWE-bench Lite
SWE-bench Lite
PulseAugur coverage of SWE-bench Lite — every cluster mentioning SWE-bench Lite across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
AI agents evaluated for goal-directedness and state binding
Two new research papers explore the internal workings and evaluation of language agents. The first paper introduces a "causal state binding" framework to assess if agents' actions are truly driven by relevant internal s…
-
ARISE toolset enhances AI agents for code fault localization and repair
Researchers have developed ARISE, a new system designed to improve the accuracy of AI agents in localizing and repairing software faults. ARISE enhances large language models by providing a detailed program graph that i…