English(EN) Automating code optimization with LLMs

大型语言模型通过新技术在代码编辑、生成和错误检测方面取得进展

作者 PulseAugur 编辑部 · [19 个来源] · 2023-05-04 00:00

研究人员正在探索各种方法来增强大型语言模型（LLM）在代码相关任务中的应用。一项研究评估了本地部署的 LLM，如 LLaMA 3.2 和 Mistral，用于 Python 错误检测，发现它们可以识别错误但难以精确定位。另一篇论文介绍了 TreeCoder，一个通过将解码策略和约束视为可优化组件来优化 LLM 代码生成的框架，提高了在 MBPP 和 SQL-Spider 等基准测试上的准确性。此外，宝马（BMW）的一项案例研究表明，微调 Qwen2.5-Coder 和 DeepSeek-Coder 等 LLM 可以跨多个文件生成和修改企业领域特定语言。最后，一种名为 CAT 的新方法使用调用链感知来改进基于 LLM 的 Java 项目单元测试生成，显著提高了代码覆盖率。 AI

影响 LLM 代码生成和分析技术的进步可能带来更强大、更高效的软件开发工具。

排序理由多篇 arXiv 论文详细介绍了 LLM 在代码相关任务上的新研究和评估。

在 Practical AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 19 个来源。我们如何撰写摘要 →

报道来源 [19]

Hugging Face Blog TIER_1 English(EN) · 2023-05-04 00:00

StarCoder：一款最先进的用于代码的LLM
arXiv cs.CL TIER_1 English(EN) · Wei Cheng, Yongchang Cao, Chen Shen, Binhua Li, Jue Chen, Yongbin Li, Wei Hu · 2026-05-01 04:00

差异化还是不差异化？用于高效 LLM 代码编辑的结构感知和自适应输出格式

arXiv:2604.27296v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used for code editing, yet the prevalent full-code generation paradigm suffers from severe efficiency bottlenecks, posing challenges for interactive coding assistants that demand low l…
arXiv cs.AI TIER_1 English(EN) · Rongliang Fu, Yi Liu, Qiang Xu, Tsung-Yi Ho · 2026-04-30 04:00

MappingEvolve：LLM驱动的技术映射代码演进

arXiv:2604.26591v1 Announce Type: cross Abstract: Technology mapping is a critical yet challenging stage in logic synthesis. While Large Language Models (LLMs) have been applied to generate optimization scripts, their potential for core algorithm enhancement remains untapped. We …
arXiv cs.CL TIER_1 English(EN) · Wei Hu · 2026-04-30 01:14

差异化还是不差异化？用于高效 LLM 代码编辑的结构感知和自适应输出格式

Large Language Models (LLMs) are increasingly used for code editing, yet the prevalent full-code generation paradigm suffers from severe efficiency bottlenecks, posing challenges for interactive coding assistants that demand low latency and cost. Despite the predominant focus on …
arXiv cs.AI TIER_1 English(EN) · Tsung-Yi Ho · 2026-04-29 12:17

MappingEvolve：LLM驱动的技术映射代码演进

Technology mapping is a critical yet challenging stage in logic synthesis. While Large Language Models (LLMs) have been applied to generate optimization scripts, their potential for core algorithm enhancement remains untapped. We introduce MappingEvolve, an open-source framework …
arXiv cs.LG TIER_1 English(EN) · Fernando Reitich · 2026-04-28 04:00

纠正与腐蚀：LLM协议中错误流动的双速率视角

arXiv:2604.18245v2 Announce Type: replace Abstract: Large language models are increasingly deployed as protocols: structured multi-call procedures that spend additional computation to transform a baseline answer into a final one. These protocols are evaluated only by end-to-end a…
arXiv cs.AI TIER_1 English(EN) · Amal Akli, Mike Papadakis, Maxime Cordy, Yves Le Traon · 2026-04-28 04:00

LLM驱动代码生成中的缺陷任务描述：检测与分析

arXiv:2604.24703v1 Announce Type: cross Abstract: Large language models are widely used for code generation, yet they rely on an implicit assumption that the task descriptions are sufficiently detailed and well-formed. However, in practice, users may provide defective description…
arXiv cs.AI TIER_1 English(EN) · Sivajeet Chand, Kevin Nguyen, Peter Kuntz, Alexander Pretschner · 2026-04-28 04:00

利用大型语言模型进行多文件领域特定语言代码生成：一项工业案例研究

arXiv:2604.24678v1 Announce Type: cross Abstract: Large language models (LLMs) perform strongly on general-purpose code generation, yet their applicability to enterprise domain-specific languages (DSLs) remains underexplored, especially for repository-scale change generation span…
arXiv cs.LG TIER_1 English(EN) · Jelena Ili\'c Vuli\'cevi\'c · 2026-04-28 04:00

对本地部署的 LLM 在 Python 代码错误检测方面的实证评估

arXiv:2604.23361v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated strong performance on a wide range of software engineering tasks, including code generation and analysis. However, most prior work relies on cloud-based models or specialized hardware…
arXiv cs.AI TIER_1 English(EN) · Yves Le Traon · 2026-04-27 17:07

LLM驱动的代码生成中的缺陷任务描述：检测与分析

Large language models are widely used for code generation, yet they rely on an implicit assumption that the task descriptions are sufficiently detailed and well-formed. However, in practice, users may provide defective descriptions, which can have a strong effect on code correctn…
arXiv cs.AI TIER_1 English(EN) · Alexander Pretschner · 2026-04-27 16:38

利用大型语言模型进行多文件领域特定语言代码生成：一项工业案例研究

Large language models (LLMs) perform strongly on general-purpose code generation, yet their applicability to enterprise domain-specific languages (DSLs) remains underexplored, especially for repository-scale change generation spanning multiple files and folder structures from a s…
arXiv cs.LG TIER_1 English(EN) · Henrijs Princis, Arindam Sharma, Cristina David · 2026-04-27 04:00

TreeCoder：系统性探索和优化 LLM 代码生成的解码与约束

arXiv:2511.22277v2 Announce Type: replace Abstract: Large language models (LLMs) have shown remarkable ability to generate code, yet their outputs often violate syntactic or semantic constraints when guided only through natural language prompts. We introduce TreeCoder, the most g…
arXiv cs.AI TIER_1 English(EN) · Guancheng Wang, Qinghua Xu, Lionel C. Briand, Zhaoqiang Guo, Kui Liu · 2026-04-27 04:00

面向Java项目的调用链感知LLM测试用例生成

arXiv:2604.22046v1 Announce Type: cross Abstract: Large language models (LLMs) have recently shown strong potential for generating project-level unit tests. However, existing state-of-the-art approaches primarily rely on execution-path information to guide prompt construction, wh…
arXiv cs.AI TIER_1 English(EN) · Kui Liu · 2026-04-23 20:03

面向Java项目的调用链感知LLM测试用例生成

Large language models (LLMs) have recently shown strong potential for generating project-level unit tests. However, existing state-of-the-art approaches primarily rely on execution-path information to guide prompt construction, which is often insufficient for complex software sys…
arXiv cs.AI TIER_1 English(EN) · Srinath Perera · 2026-04-23 12:21

DryRUN：公共测试在 LLM 驱动的代码生成中的作用

Multi-agent frameworks are widely used in autonomous code generation and have applications in complex algorithmic problem-solving. Recent work has addressed the challenge of generating functionally correct code by incorporating simulation-driven planning and debugging, where lang…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-23 12:21

DryRUN：公共测试在LLM驱动的代码生成中的作用

Multi-agent frameworks are widely used in autonomous code generation and have applications in complex algorithmic problem-solving. Recent work has addressed the challenge of generating functionally correct code by incorporating simulation-driven planning and debugging, where lang…
arXiv cs.CL TIER_1 English(EN) · Jakub Simko · 2026-04-23 07:29

mcdok 在 SemEval-2026 Task 13：微调 LLM 以检测机器生成代码

Multi-domain detection of the machine-generated code snippets in various programming languages is a challenging task. SemEval-2026 Task~13 copes with this challenge in various angles, as a binary detection problem as well as attribution of the source. Specifically, its subtasks a…
Practical AI TIER_1 English(EN) · Practical AI LLC · 2023-08-29 21:30

使用大型语言模型自动化代码优化

<p>You might have heard a lot about code generation tools using AI, but could LLMs and generative AI make our existing code better? In this episode, we sit down with Mike from TurinTech to hear about practical code optimizations using AI “translation” of slow to fast code. We lea…
Lobsters — AI tag TIER_1 English(EN) · arxiv.org via mpweiher · 2026-04-04 13:34

令人尴尬的简单自蒸馏改进代码生成

<p><a href="https://lobste.rs/s/bor4wy/embarrassingly_simple_self">Comments</a></p>

报道来源 [19]

相关实体

相关话题