English(EN) Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

新的DMEP框架剪枝LoRA-MoE专家以提高效率和准确性

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-29 06:45

研究人员开发了一个名为DMEP的新框架，用于高效微调LoRA-MoE模型。该方法在每个模块的基础上动态剪枝低效专家，从而创建更紧凑和专业化的模型结构。通过在初始训练后移除负载均衡约束，DMEP允许剩余专家进一步专业化。实验表明，DMEP可将可训练参数减少高达43%，并将训练吞吐量提高约10%，同时保持准确性。 AI

影响减少了LoRA-MoE模型的可训练参数并提高了训练效率，可能降低微调成本。

排序理由这是一篇研究论文，详细介绍了一种高效微调现有模型架构的新方法。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Weihang Li, Jianchun Liu, Hongli Xu · 2026-04-30 04:00

Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

arXiv:2604.26340v1 Announce Type: new Abstract: LoRA-MoE has emerged as an effective paradigm for parameter-efficient fine-tuning, combining the low training cost of LoRA with the increased adaptation capacity of Mixture-of-Experts (MoE). However, existing LoRA-MoE frameworks typ…
arXiv cs.LG TIER_1 English(EN) · Hongli Xu · 2026-04-29 06:45

Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

LoRA-MoE has emerged as an effective paradigm for parameter-efficient fine-tuning, combining the low training cost of LoRA with the increased adaptation capacity of Mixture-of-Experts (MoE). However, existing LoRA-MoE frameworks typically adopt a fixed and uniform expert configur…

报道来源 [2]

Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

相关实体

相关话题