code generation
PulseAugur coverage of code generation — every cluster mentioning code generation across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
New method improves LLM code generation uncertainty estimation
Researchers have developed a new method for estimating uncertainty in code generated by large language models, addressing the risks associated with silently incorrect code. The approach, detailed in a new paper, recogni…
-
New methods refine LLM fine-tuning for better performance
Researchers have developed new methods to improve supervised fine-tuning (SFT) for large language models. One approach, FisherAdapTune, uses the Fisher information geometry to dynamically select parameter groups for ada…
-
New research explores advanced RL for LLM code generation
Three new research papers explore advanced reinforcement learning techniques for improving large language models (LLMs) in code generation. One paper introduces offline reinforcement learning to leverage existing code d…
-
AI-generated code security remains a concern despite advanced prompting
New research indicates that while advanced prompting techniques can influence the types of security vulnerabilities present in AI-generated code, they do not reliably reduce the overall number or severity of these issue…
-
Luddite computer scientist questions AI code generation methods
An organic computer scientist and self-proclaimed Luddite, Anthony, argues that gradient ascent is a superior method to generate-and-test for software development, especially when dealing with potentially harmful outcom…
-
Pass-rate rewards fail to boost AI code generation, study finds
A new research paper explores the effectiveness of using pass-rate rewards in reinforcement learning for code generation tasks. The study found that while pass-rate rewards can alleviate the issue of sparse rewards, the…
-
New research tackles LLM factuality, safety, and complex task performance
Researchers are developing new methods to improve the reliability and safety of large language models (LLMs). Google Research introduced SLED, a decoding strategy that uses all LLM layers to enhance factual accuracy wit…