English(EN) Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

新的PRISM方法通过偏好感知数据选择提升LLM微调效率

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-20 17:15

研究人员开发了一种新颖的PRISM方法，通过优先选择最能有效引导模型实现期望行为的数据样本，来高效微调大型语言模型。与之前平等对待所有目标示例的方法不同，PRISM根据当前模型的偏好对这些示例进行加权，从而创建更精确的目标表示。这使得PRISM能够将训练预算集中在最具影响力的Ландау数据上，从而在通用微调和面向安全的任务中都能提高性能。 AI

影响通过优化数据选择来提高LLM训练效率，可能降低计算成本并加速模型开发。

排序理由该集群包含一篇详细介绍LLM微调新方法的学术论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Jing Shao · 2026-05-20 17:15

Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

As LLMs continue to scale, improving training efficiency increasingly depends on using data more effectively. Data selection addresses this problem by allocating a limited training budget to samples that best promote a target behavior. Existing methods usually represent the targe…

报道来源 [1]

Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

相关实体

相关话题