PulseAugur
EN
LIVE 18:49:18

Gradient-free EntropyBeam model outperforms nanoGPT on Shakespeare benchmark

A new language model called EntropyBeam has demonstrated superior performance on the nanoGPT Shakespeare benchmark, achieving lower cross-entropy than the nanoGPT model. EntropyBeam operates without trainable parameters and completes its learning in a single pass by computing count tables that map character contexts to next-character frequencies. While it stores more data (2.7 million context-transition entries) compared to nanoGPT's learned parameters (60,192), its unique approach of combining multiple weighted orders through a weighted geometric mean results in higher accuracy. AI

IMPACT Introduces a novel, parameter-free approach to language modeling that achieves competitive results on character-level tasks.

RANK_REASON The item describes a new language model and its performance on a specific benchmark, fitting the research category. [lever_c_demoted from research: ic=1 ai=1.0]

Read on LessWrong (AI tag) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Gradient-free EntropyBeam model outperforms nanoGPT on Shakespeare benchmark

COVERAGE [1]

  1. LessWrong (AI tag) TIER_1 English(EN) · zw5 ·

    Gradient-free Single-pass Model Beats nanoGPT on Shakespeare

    <p><span>Beam is a character-level language model that computes count tables mapping character contexts to next-character frequencies.</span></p><p><span>At prediction time, each order </span><span class="math-tex"></span><span> looks up the current context in its count table and…