PulseAugur
实时 14:30:36

Neural program synthesis models struggle with generalization beyond training data

Researchers have developed a controlled environment to rigorously test the generalization capabilities of neural program synthesis models. Their experiments reveal that while transformers perform well on known data, they struggle significantly with generating novel programs, showing a performance drop of over 30%. The study indicates that increasing compute power yields diminishing returns, following a log-linear relationship, and suggests that maximizing training diversity across various manifolds is crucial for robust generalization. The findings highlight the need for new search-based methods to overcome current scaling limitations. AI

影响 Highlights limitations in transformer generalization for novel program synthesis, suggesting new approaches are needed.

排序理由 Academic paper on generalization boundaries in neural program synthesis.

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Neural program synthesis models struggle with generalization beyond training data

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Henrik Voigt, Michael Habeck, Joachim Giesen ·

    Beyond the Training Distribution: Mapping Generalization Boundaries in Neural Program Synthesis

    arXiv:2604.27551v1 Announce Type: cross Abstract: Large-scale transformers achieve impressive results on program synthesis benchmarks, yet their true generalization capabilities remain obscured by data contamination and opaque training corpora. To rigorously assess whether models…

  2. arXiv cs.CL TIER_1 English(EN) · Joachim Giesen ·

    Beyond the Training Distribution: Mapping Generalization Boundaries in Neural Program Synthesis

    Large-scale transformers achieve impressive results on program synthesis benchmarks, yet their true generalization capabilities remain obscured by data contamination and opaque training corpora. To rigorously assess whether models are truly generalizing or merely retrieving memor…