English(EN) CODEBLOCK: Learning to Supervise Code at the Right Granularity

新的CodeBlock框架通过结构感知增强代码LLM的监督

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-18 04:00

研究人员开发了CodeBlock，一种用于微调代码大型语言模型（LLM）的新型框架，该框架侧重于在结构层面而非逐个标记的层面监督代码。这种方法将代码响应划分为语法上连贯的“代码项”，并优先处理那些对学习最有信息量的代码项，利用数据流信号识别关键依赖关系。实验表明，与传统方法相比，CodeBlock在代码生成基准测试上的性能显著提高，同时仅使用了监督标记的一小部分。 AI

影响这种结构化监督方法可能导致更高效、更有效的代码生成模型训练，从而提高开发人员的生产力。

排序理由该集群包含一篇详细介绍训练代码LLM新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Zhijie Deng, Ling Li, Jinlong Pang, Kaiqin Hu, Qi Xuan, Zhaowei Zhu, Jiaheng Wei · 2026-06-18 04:00

CODEBLOCK: Learning to Supervise Code at the Right Granularity

arXiv:2606.18286v1 Announce Type: new Abstract: Supervised fine-tuning of code LLMs typically applies uniform cross-entropy loss to all response tokens, implicitly assuming that every token provides equally useful learning signal. Recent token-level selection methods challenge th…

报道来源 [1]

CODEBLOCK: Learning to Supervise Code at the Right Granularity

相关实体

相关话题