OpenPangu LLM 量化在 Ascend NPU 上的研究：8 位无损，4 位导致 1B 模型性能下降

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-29 04:00

一项新研究调查了在 Ascend NPU 上部署 OpenPangu 大型语言模型时，各种训练后量化方法的有效性。研究人员发现，8 位仅权重量化对于 1B 和 7B 参数模型几乎是无损的。然而，4 位量化在 1B 模型上表现出更显著的性能下降，尤其是在推理和编码任务中，而对于 7B 模型则仍然可行。研究还强调了超低精度量化的挑战，大多数 2 位和二值化设置导致性能接近随机。 AI

影响为选择 OpenPangu 量化设置提供了面向 NPU 的精度图，有助于高效的国内 LLM 部署。

排序理由该集群包含一篇详细介绍模型量化技术实证研究的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

OpenPangu LLM 量化在 Ascend NPU 上的研究：8 位无损，4 位导致 1B 模型性能下降

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Tong Shi, Jiacheng Wang, Hui Xie, Ying Li, Aishan Liu, Jinyang Guo, Xianglong Liu · 2026-06-29 04:00

An Empirical Study of OpenPangu Quantization on Ascend NPUs

arXiv:2606.21257v2 Announce Type: replace-cross Abstract: OpenPangu models are attractive targets for private and domestic large-language-model deployment, yet their robustness under aggressive post-training quantization on Ascend NPUs has not been systematically characterized. T…

报道来源 [1]

An Empirical Study of OpenPangu Quantization on Ascend NPUs

相关实体

相关话题