Researchers have developed a new training framework called ProxyCoT to improve the long-context reasoning abilities of large language models. This method transfers reasoning capabilities from shorter "proxy" contexts to full, extended contexts. By first generating high-quality reasoning traces on proxy contexts and then fine-tuning on full contexts, ProxyCoT has demonstrated consistent performance improvements over existing baselines with lower computational costs. The models trained using this approach also show better generalization to out-of-domain tasks. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Enhances LLM performance on complex, long-context tasks, potentially improving applications requiring deep understanding of extensive data.
RANK_REASON The cluster contains an academic paper detailing a new method for improving LLM reasoning. [lever_c_demoted from research: ic=1 ai=1.0]