On the Cost and Benefit of Chain of Thought: A Learning-Theoretic Perspective
Researchers have developed a new learning-theoretic framework to understand Chain of Thought (CoT) reasoning in AI models. This framework models CoT as an interaction between an answer map and a chain rule that generates intermediate questions. The framework decomposes the reasoning risk into two components: the benefit of CoT (oracle-trajectory risk) and the cost of CoT (trajectory-mismatch risk) due to error accumulation. AI
IMPACT Provides a theoretical understanding of Chain of Thought, potentially guiding future model development for more reliable reasoning.