Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · Hugging Face Daily Papers English(EN) · 6d · [3 sources]

Trajectory-Refined Distillation

Researchers have introduced Trajectory-Refined Distillation (TRD), a new method to improve the post-training process for large language models. TRD addresses a problem called "prefix failure" in on-policy distillation, where dense per-token supervision leads to fragmented gradients. By correcting student model rollouts at the trajectory level before distillation, TRD mitigates this issue and enhances exploration. The method has demonstrated consistent performance improvements across various benchmarks and model scales. AI

IMPACT Enhances LLM reasoning and accuracy by refining distillation techniques.
TOOL · arXiv cs.AI English(EN) · 1w

Constitutional On-Policy Safe Distillation

Researchers have developed a new method called Constitutional On-Policy Safe Distillation (COPSD) to improve the safety and helpfulness of AI models. Existing on-policy self-distillation techniques can lead to a collapse in performance, particularly in reasoning tasks, by overly contracting the model's responses towards conservative outputs. COPSD addresses this by first calibrating the teacher model and then performing distillation conditioned on high-level constitutions, resulting in a better safety-helpfulness trade-off without significantly sacrificing general reasoning abilities. AI

IMPACT Introduces a novel technique to improve AI safety and helpfulness, potentially leading to more reliable and less biased AI systems.

Brief

Trajectory-Refined Distillation

Constitutional On-Policy Safe Distillation