ENTITY mathematical reasoning

mathematical reasoning

PulseAugur coverage of mathematical reasoning — every cluster mentioning mathematical reasoning across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

4 over 90d

Releases · 30d

0 over 90d

Papers · 30d

4 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 4 TOTAL

RESEARCH · CL_115242 · Jun 26 · 05:25

New SMMD training method enhances numerical accuracy in LLMs

Researchers have developed a new training objective called Smooth Maximum Mean Discrepancy (SMMD) to improve the numerical precision of large language models (LLMs). Standard cross-entropy training treats numerical toke…
TOOL · CL_91401 · Jun 15 · 04:00

New LLM Reinforcement Learning Strategy Enhances Exploration

Researchers have introduced Deep Dense Exploration (DDE), a novel strategy designed to improve reinforcement learning for large language models. DDE focuses on exploring deep, recoverable states within unsuccessful traj…
TOOL · CL_40802 · May 19 · 12:37

Code does not improve LLM math reasoning; structured traces do

A new research paper explores the impact of code on mathematical reasoning in large language models. The study found that while code improves programming abilities, it does not generally enhance mathematical reasoning a…
RESEARCH · CL_20433 · May 6 · 15:31

New self-distillation methods enhance LLM reasoning and training stability

Two new papers explore advanced self-distillation techniques for large language models, aiming to improve reasoning and efficiency. The first paper introduces "Power Distribution Bridges," which connects sampling, self-…

New SMMD training method enhances numerical accuracy in LLMs

New LLM Reinforcement Learning Strategy Enhances Exploration

Code does not improve LLM math reasoning; structured traces do

New self-distillation methods enhance LLM reasoning and training stability