PulseAugur
实时 22:35:05

CRAFT方法加速了序列到序列模型的训练数据选择

研究人员开发了一种名为CRAFT(Clustered Regression for Adaptive Filtering of Training data)的新方法,用于高效地为序列到序列模型选择高质量的训练数据子集。该方法分解了联合源-目标分布,并使用两阶段选择过程来匹配验证分布并最小化预期距离。CRAFT在英-印翻译任务中表现出显著的改进,取得了比现有方法更高的BLEU分数,同时大大缩短了选择时间。 AI

影响 通过能够快速选择最优训练数据子集,加速了序列到序列模型的微调。

排序理由 关于训练数据选择新方法的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

CRAFT方法加速了序列到序列模型的训练数据选择

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Parthasarathi Panda, Asheswari Swain, Subhrakanta Panda ·

    CRAFT: Clustered Regression for Adaptive Filtering of Training data

    arXiv:2604.22693v1 Announce Type: new Abstract: Selecting a small, high-quality subset from a large corpus for fine-tuning is increasingly important as corpora grow to tens of millions of datapoints, making full fine-tuning expensive and often unnecessary. We propose CRAFT (Clust…

  2. arXiv cs.CL TIER_1 English(EN) · Subhrakanta Panda ·

    CRAFT: Clustered Regression for Adaptive Filtering of Training data

    Selecting a small, high-quality subset from a large corpus for fine-tuning is increasingly important as corpora grow to tens of millions of datapoints, making full fine-tuning expensive and often unnecessary. We propose CRAFT (Clustered Regression for Adaptive Filtering of Traini…