GPT-2 Models Struggle to Discover Math Concepts Without Examples

By PulseAugur Editorial · [2 sources] · 2026-06-15 20:54

A new research paper explores the ability of language models, specifically GPT-2 sized models, to discover mathematical concepts like zero. The study found that these models, even with language pretraining, struggle with out-of-distribution generalization for mathematical discovery. However, performance significantly improves when models are trained on examples of zero, with language pretraining reducing the number of required examples by about 50%. AI

IMPACT Investigates the limits of current language models in abstract mathematical reasoning and discovery.

RANK_REASON The cluster contains an academic paper detailing research findings on AI model capabilities.

Read on arXiv cs.CL →

GPT-2

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

GPT-2 Models Struggle to Discover Math Concepts Without Examples

COVERAGE [2]

arXiv cs.AI TIER_1 English(EN) · Phoebe Zeng, Thomas L. Griffiths, Brenden M. Lake · 2026-06-17 04:00

Nothing from Something: Can a Language Model Discover 0?

arXiv:2606.17289v1 Announce Type: new Abstract: AI systems based on artificial neural networks are being developed with aspirations of pushing the boundary of human mathematical knowledge. A key question for these systems is how much they can reach beyond their training data. Mat…
arXiv cs.CL TIER_1 English(EN) · Brenden M. Lake · 2026-06-15 20:54

Nothing from Something: Can a Language Model Discover 0?

AI systems based on artificial neural networks are being developed with aspirations of pushing the boundary of human mathematical knowledge. A key question for these systems is how much they can reach beyond their training data. Mathematical discovery requires a strong form of ou…

COVERAGE [2]

Nothing from Something: Can a Language Model Discover 0?

Nothing from Something: Can a Language Model Discover 0?

RELATED ENTITIES

RELATED TOPICS