PulseAugur
实时 20:23:57
English(EN) Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

新的DMEP框架剪枝LoRA-MoE专家以提高效率和准确性

研究人员开发了一个名为DMEP的新框架,用于高效微调LoRA-MoE模型。该方法在每个模块的基础上动态剪枝低效专家,从而创建更紧凑和专业化的模型结构。通过在初始训练后移除负载均衡约束,DMEP允许剩余专家进一步专业化。实验表明,DMEP可将可训练参数减少高达43%,并将训练吞吐量提高约10%,同时保持准确性。 AI

影响 减少了LoRA-MoE模型的可训练参数并提高了训练效率,可能降低微调成本。

排序理由 这是一篇研究论文,详细介绍了一种高效微调现有模型架构的新方法。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新的DMEP框架剪枝LoRA-MoE专家以提高效率和准确性

报道来源 [2]

  1. arXiv cs.LG TIER_1 English(EN) · Weihang Li, Jianchun Liu, Hongli Xu ·

    Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

    arXiv:2604.26340v1 Announce Type: new Abstract: LoRA-MoE has emerged as an effective paradigm for parameter-efficient fine-tuning, combining the low training cost of LoRA with the increased adaptation capacity of Mixture-of-Experts (MoE). However, existing LoRA-MoE frameworks typ…

  2. arXiv cs.LG TIER_1 English(EN) · Hongli Xu ·

    Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning

    LoRA-MoE has emerged as an effective paradigm for parameter-efficient fine-tuning, combining the low training cost of LoRA with the increased adaptation capacity of Mixture-of-Experts (MoE). However, existing LoRA-MoE frameworks typically adopt a fixed and uniform expert configur…