PulseAugur
实时 07:05:27
English(EN) Automating code optimization with LLMs

大型语言模型通过新技术在代码编辑、生成和错误检测方面取得进展

研究人员正在探索各种方法来增强大型语言模型(LLM)在代码相关任务中的应用。一项研究评估了本地部署的 LLM,如 LLaMA 3.2Mistral,用于 Python 错误检测,发现它们可以识别错误但难以精确定位。另一篇论文介绍了 TreeCoder,一个通过将解码策略和约束视为可优化组件来优化 LLM 代码生成的框架,提高了在 MBPPSQL-Spider 等基准测试上的准确性。此外,宝马(BMW)的一项案例研究表明,微调 Qwen2.5-CoderDeepSeek-Coder 等 LLM 可以跨多个文件生成和修改企业领域特定语言。最后,一种名为 CAT 的新方法使用调用链感知来改进基于 LLM 的 Java 项目单元测试生成,显著提高了代码覆盖率。 AI

影响 LLM 代码生成和分析技术的进步可能带来更强大、更高效的软件开发工具。

排序理由 多篇 arXiv 论文详细介绍了 LLM 在代码相关任务上的新研究和评估。

在 Practical AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 19 个来源。 我们如何撰写摘要 →

大型语言模型通过新技术在代码编辑、生成和错误检测方面取得进展

报道来源 [19]

  1. Hugging Face Blog TIER_1 English(EN) ·

    StarCoder: A State-of-the-Art LLM for Code

  2. arXiv cs.CL TIER_1 English(EN) · Wei Cheng, Yongchang Cao, Chen Shen, Binhua Li, Jue Chen, Yongbin Li, Wei Hu ·

    To Diff or Not to Diff? Structure-Aware and Adaptive Output Formats for Efficient LLM-based Code Editing

    arXiv:2604.27296v1 Announce Type: cross Abstract: Large Language Models (LLMs) are increasingly used for code editing, yet the prevalent full-code generation paradigm suffers from severe efficiency bottlenecks, posing challenges for interactive coding assistants that demand low l…

  3. arXiv cs.AI TIER_1 English(EN) · Rongliang Fu, Yi Liu, Qiang Xu, Tsung-Yi Ho ·

    MappingEvolve: LLM-Driven Code Evolution for Technology Mapping

    arXiv:2604.26591v1 Announce Type: cross Abstract: Technology mapping is a critical yet challenging stage in logic synthesis. While Large Language Models (LLMs) have been applied to generate optimization scripts, their potential for core algorithm enhancement remains untapped. We …

  4. arXiv cs.CL TIER_1 English(EN) · Wei Hu ·

    To Diff or Not to Diff? Structure-Aware and Adaptive Output Formats for Efficient LLM-based Code Editing

    Large Language Models (LLMs) are increasingly used for code editing, yet the prevalent full-code generation paradigm suffers from severe efficiency bottlenecks, posing challenges for interactive coding assistants that demand low latency and cost. Despite the predominant focus on …

  5. arXiv cs.AI TIER_1 English(EN) · Tsung-Yi Ho ·

    MappingEvolve: LLM-Driven Code Evolution for Technology Mapping

    Technology mapping is a critical yet challenging stage in logic synthesis. While Large Language Models (LLMs) have been applied to generate optimization scripts, their potential for core algorithm enhancement remains untapped. We introduce MappingEvolve, an open-source framework …

  6. arXiv cs.LG TIER_1 English(EN) · Fernando Reitich ·

    Correction and Corruption: A Two-Rate View of Error Flow in LLM Protocols

    arXiv:2604.18245v2 Announce Type: replace Abstract: Large language models are increasingly deployed as protocols: structured multi-call procedures that spend additional computation to transform a baseline answer into a final one. These protocols are evaluated only by end-to-end a…

  7. arXiv cs.AI TIER_1 English(EN) · Amal Akli, Mike Papadakis, Maxime Cordy, Yves Le Traon ·

    Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis

    arXiv:2604.24703v1 Announce Type: cross Abstract: Large language models are widely used for code generation, yet they rely on an implicit assumption that the task descriptions are sufficiently detailed and well-formed. However, in practice, users may provide defective description…

  8. arXiv cs.AI TIER_1 English(EN) · Sivajeet Chand, Kevin Nguyen, Peter Kuntz, Alexander Pretschner ·

    Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study

    arXiv:2604.24678v1 Announce Type: cross Abstract: Large language models (LLMs) perform strongly on general-purpose code generation, yet their applicability to enterprise domain-specific languages (DSLs) remains underexplored, especially for repository-scale change generation span…

  9. arXiv cs.LG TIER_1 English(EN) · Jelena Ili\'c Vuli\'cevi\'c ·

    An Empirical Evaluation of Locally Deployed LLMs for Bug Detection in Python Code

    arXiv:2604.23361v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated strong performance on a wide range of software engineering tasks, including code generation and analysis. However, most prior work relies on cloud-based models or specialized hardware…

  10. arXiv cs.AI TIER_1 English(EN) · Yves Le Traon ·

    Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis

    Large language models are widely used for code generation, yet they rely on an implicit assumption that the task descriptions are sufficiently detailed and well-formed. However, in practice, users may provide defective descriptions, which can have a strong effect on code correctn…

  11. arXiv cs.AI TIER_1 English(EN) · Alexander Pretschner ·

    Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study

    Large language models (LLMs) perform strongly on general-purpose code generation, yet their applicability to enterprise domain-specific languages (DSLs) remains underexplored, especially for repository-scale change generation spanning multiple files and folder structures from a s…

  12. arXiv cs.LG TIER_1 English(EN) · Henrijs Princis, Arindam Sharma, Cristina David ·

    TreeCoder: Systematic Exploration and Optimisation of Decoding and Constraints for LLM Code Generation

    arXiv:2511.22277v2 Announce Type: replace Abstract: Large language models (LLMs) have shown remarkable ability to generate code, yet their outputs often violate syntactic or semantic constraints when guided only through natural language prompts. We introduce TreeCoder, the most g…

  13. arXiv cs.AI TIER_1 English(EN) · Guancheng Wang, Qinghua Xu, Lionel C. Briand, Zhaoqiang Guo, Kui Liu ·

    Call-Chain-Aware LLM-Based Test Generation for Java Projects

    arXiv:2604.22046v1 Announce Type: cross Abstract: Large language models (LLMs) have recently shown strong potential for generating project-level unit tests. However, existing state-of-the-art approaches primarily rely on execution-path information to guide prompt construction, wh…

  14. arXiv cs.AI TIER_1 English(EN) · Kui Liu ·

    Call-Chain-Aware LLM-Based Test Generation for Java Projects

    Large language models (LLMs) have recently shown strong potential for generating project-level unit tests. However, existing state-of-the-art approaches primarily rely on execution-path information to guide prompt construction, which is often insufficient for complex software sys…

  15. arXiv cs.AI TIER_1 English(EN) · Srinath Perera ·

    DryRUN: On the Role of Public Tests in LLM-Driven Code Generation

    Multi-agent frameworks are widely used in autonomous code generation and have applications in complex algorithmic problem-solving. Recent work has addressed the challenge of generating functionally correct code by incorporating simulation-driven planning and debugging, where lang…

  16. Hugging Face Daily Papers TIER_1 English(EN) ·

    DryRUN: On the Role of Public Tests in LLM-Driven Code Generation

    Multi-agent frameworks are widely used in autonomous code generation and have applications in complex algorithmic problem-solving. Recent work has addressed the challenge of generating functionally correct code by incorporating simulation-driven planning and debugging, where lang…

  17. arXiv cs.CL TIER_1 English(EN) · Jakub Simko ·

    mcdok at SemEval-2026 Task 13: Finetuning LLMs for Detection of Machine-Generated Code

    Multi-domain detection of the machine-generated code snippets in various programming languages is a challenging task. SemEval-2026 Task~13 copes with this challenge in various angles, as a binary detection problem as well as attribution of the source. Specifically, its subtasks a…

  18. Practical AI TIER_1 English(EN) · Practical AI LLC ·

    Automating code optimization with LLMs

    <p>You might have heard a lot about code generation tools using AI, but could LLMs and generative AI make our existing code better? In this episode, we sit down with Mike from TurinTech to hear about practical code optimizations using AI “translation” of slow to fast code. We lea…

  19. Lobsters — AI tag TIER_1 English(EN) · arxiv.org via mpweiher ·

    Embarrassingly Simple Self-Distillation Improves Code Generation

    <p><a href="https://lobste.rs/s/bor4wy/embarrassingly_simple_self">Comments</a></p>