Defects4J
PulseAugur coverage of Defects4J — every cluster mentioning Defects4J across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New LLM speeds up bug detection in software development
Researchers have developed a new multi-task large language model (LLM) called MLC designed for efficient line-level bug classification in software development. This model addresses the limitations of existing bug locali…
-
New framework enhances AI-generated software testing reliability
Researchers have developed a new framework called GATF to improve the reliability and transparency of AI-generated test artifacts in autonomous software testing. This framework addresses issues like hallucinations, comp…
-
LLMs show significant performance drops on transformed benchmarks, indicating memorization
Researchers have developed a new method combining metamorphic testing with negative log-likelihood to diagnose data leakage in large language models used for program repair. By creating variant benchmarks through semant…
-
LLMs advance code editing, generation, and bug detection with new techniques
Researchers are exploring various methods to enhance Large Language Models (LLMs) for code-related tasks. One study evaluates locally deployed LLMs like LLaMA 3.2 and Mistral for Python bug detection, finding they can i…