tool · [1 source] · 2026-05-22 04:00

New ProxyCoT framework enhances LLM long-context reasoning

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed a new training framework called ProxyCoT to improve the long-context reasoning abilities of large language models. This method transfers reasoning capabilities from shorter "proxy" contexts to full, extended contexts. By first generating high-quality reasoning traces on proxy contexts and then fine-tuning on full contexts, ProxyCoT has demonstrated consistent performance improvements over existing baselines with lower computational costs. The models trained using this approach also show better generalization to out-of-domain tasks. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Enhances LLM performance on complex, long-context tasks, potentially improving applications requiring deep understanding of extensive data.

RANK_REASON The cluster contains an academic paper detailing a new method for improving LLM reasoning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

COVERAGE [1]

arXiv cs.AI TIER_1 · Miao Li, Irina Saparina, Alexander Gurung, Mirella Lapata · 2026-05-22 04:00

Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning

arXiv:2605.20201v1 Announce Type: cross Abstract: Recent large language models support inputs of up to 10 million tokens, yet they perform poorly on long-context tasks that require complex reasoning. Such tasks can be solved using only a subset of the input -- a proxy context -- …

COVERAGE [1]

Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning

RELATED ENTITIES

RELATED TOPICS