English(EN) Beyond Layer Importance in Layer-wise Sparsity: An Inter-Layer Perturbation-Absorption Perspective

新方法通过层吸收洞察改进大型语言模型剪枝

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-16 04:00

研究人员开发了一种名为“吸收感知校正”的新方法，通过剪枝来提高大型语言模型（LLMs）的效率。该技术描述了LLMs中的不同层如何响应扰动，发现早期层倾向于放大扰动，而后期层则倾向于吸收扰动。通过结合这种理解，该方法增强了现有的剪枝技术，如OWL和AlphaPruning，在70%稀疏度下，跨不同模型家族，困惑度降低了7.13%，零样本准确率提高了1.02%。 AI

影响通过改进剪枝技术来提高LLM效率，从而在更高的稀疏度水平下获得更好的性能。

排序理由在arXiv上发表的学术论文，详细介绍了一种新的LLM剪枝方法。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Tao Jing, Ningxin Wu, Chen Kang, Dong Yu, Changliang Li, Pengyuan Liu · 2026-06-16 04:00

Beyond Layer Importance in Layer-wise Sparsity: An Inter-Layer Perturbation-Absorption Perspective

arXiv:2606.15161v1 Announce Type: new Abstract: The considerable layer-wise redundancy in large language models (LLMs) has established non-uniform sparsity allocation across layers as the standard pruning approach for efficient compression. Existing layer-wise allocation methods …

报道来源 [1]

Beyond Layer Importance in Layer-wise Sparsity: An Inter-Layer Perturbation-Absorption Perspective

相关实体

相关话题