English(EN) Learning Fine-grained Parameter Sharing via Sparse Tensor Decomposition

新的FiPS框架以最小的精度损失压缩Transformer模型

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-26 04:00

研究人员开发了一个名为细粒度参数共享（FiPS）的新框架，用于压缩大型Transformer模型。FiPS在一个单一的优化过程中结合了跨块参数共享、低秩分解和稀疏性。该方法有效地减小了Vision Transformers（ViTs）和大型语言模型（LLMs）的尺寸，同时精度或性能损失极小，优于现有的压缩技术。 AI

影响这项研究为减小大型AI模型的尺寸提供了一种实用的方法，有可能在资源受限的设备上实现更广泛的部署。

排序理由该集群包含一篇详细介绍模型压缩新方法的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Cem \"Uy\"uk, Mike Lasby, Mohamed Yassin, Utku Evci, Yani Ioannou · 2026-05-26 04:00

通过稀疏张量分解学习细粒度参数共享

arXiv:2411.09816v4 Announce Type: replace Abstract: Large neural networks achieve state-of-the-art performance on many tasks, yet their sheer size hinders deployment on resource-constrained devices. Among existing compression approaches, cross-layer parameter sharing remains rela…

报道来源 [1]

通过稀疏张量分解学习细粒度参数共享

相关实体

相关话题