New DRIFT method refines LLM training data for improved performance

By PulseAugur Editorial · [1 sources] · 2026-06-18 04:00

Researchers have developed DRIFT, a novel method for refining instruction data to improve the performance ceiling of large language models. Unlike existing data curation techniques that focus on subset selection, DRIFT aims to enhance the data distribution itself. It utilizes on-policy influence functions, leveraging the model's own rollouts as validation targets to address limitations like proximity gaps and gradient norm bias found in standard influence function formulations. Experiments with 7B-parameter models demonstrate that DRIFT effectively raises performance on instruction and reasoning tasks, outperforming current data curation baselines. AI

IMPACT This research could lead to more capable LLMs by improving the efficiency and effectiveness of training data curation.

RANK_REASON The cluster contains a research paper detailing a new method for improving LLM training data. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Zefan Wang, Lincheng Li, Tianyu Yu, Yuan Yao · 2026-06-18 04:00

DRIFT: Refining Instruction Data via On-Policy Data Attribution

arXiv:2606.18307v1 Announce Type: cross Abstract: Optimizing the training data distribution for Supervised Fine-Tuning (SFT) dictates the capability of Large Language Models (LLMs). While existing data curation methods excel at accelerating training under constrained budgets, the…

COVERAGE [1]

DRIFT: Refining Instruction Data via On-Policy Data Attribution

RELATED ENTITIES

RELATED TOPICS