ENTITY SWE-bench Lite

SWE-bench Lite

PulseAugur coverage of SWE-bench Lite — every cluster mentioning SWE-bench Lite across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

9 over 90d

Releases · 30d

0 over 90d

Papers · 30d

8 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

RESEARCH · CL_158657 · Jul 23 · 04:00

New methods compress LLM agent context for improved security and efficiency

Researchers have developed new methods for compressing context in large language model (LLM) agents to improve efficiency and security. One approach, "Twin Agent," separates agents into an "Explore Agent" for untrusted …
TOOL · CL_123146 · Jul 3 · 04:00

BLAgent framework enhances file-level bug localization with agentic RAG

Researchers have developed BLAgent, a novel agentic retrieval-augmented generation (RAG) framework designed to improve file-level bug localization in software maintenance. BLAgent integrates code structure-aware encodin…
TOOL · CL_122986 · Jul 3 · 04:00

New AI memory layer ContextSniper cuts token use for code repair

Researchers have developed ContextSniper, a new token-efficient memory layer for AntTrail's AI agent designed to improve repository-level program repair. ContextSniper precisely selects and ranks code and runtime eviden…
RESEARCH · CL_107786 · Jun 23 · 17:05

New SHERLOC framework boosts LLM code repair efficiency and accuracy

Researchers have developed SHERLOC, a novel framework designed to improve the efficiency and accuracy of Large Language Model (LLM) agents in code repair tasks. This training-free framework utilizes a reasoning LLM with…
TOOL · CL_105034 · Jun 18 · 13:56

Multi-agent LLM system Phoenix automates GitHub issue resolution

Researchers have developed Phoenix, a multi-agent LLM system designed to automatically resolve GitHub issues. The system utilizes six specialized agents, including a planner, coder, and tester, to manage the process fro…
TOOL · CL_99538 · Jun 18 · 13:56

Phoenix LLM system automates GitHub issue resolution with safety controls

A new multi-agent LLM system named Phoenix has been developed to automate the resolution of GitHub issues, from initial triage to the creation of pull requests. This system incorporates seven layers of safety controls a…
RESEARCH · CL_84467 · Jun 10 · 06:01

New Autopilot firewall drastically cuts LLM agent fabrication

Researchers have developed a new execution model called Autopilot designed to prevent large language model agents from fabricating success when operating without human supervision. This system acts as a firewall by exte…
RESEARCH · CL_62821 · Jun 1 · 04:00

AI agents evaluated for goal-directedness and state binding

Two new research papers explore the internal workings and evaluation of language agents. The first paper introduces a "causal state binding" framework to assess if agents' actions are truly driven by relevant internal s…
TOOL · CL_20730 · May 7 · 04:00

ARISE toolset enhances AI agents for code fault localization and repair

Researchers have developed ARISE, a new system designed to improve the accuracy of AI agents in localizing and repairing software faults. ARISE enhances large language models by providing a detailed program graph that i…

New methods compress LLM agent context for improved security and efficiency

BLAgent framework enhances file-level bug localization with agentic RAG

New AI memory layer ContextSniper cuts token use for code repair

New SHERLOC framework boosts LLM code repair efficiency and accuracy

Multi-agent LLM system Phoenix automates GitHub issue resolution

Phoenix LLM system automates GitHub issue resolution with safety controls

New Autopilot firewall drastically cuts LLM agent fabrication

AI agents evaluated for goal-directedness and state binding

ARISE toolset enhances AI agents for code fault localization and repair