PulseAugur
EN
LIVE 17:12:47
中文(ZH) HuggingFace CEO力荐,Bengio团队也押注:这个1500美元训出的HRM模型,凭什么火了?

HRM-Text: 1B parameter model with novel architecture challenges LLM paradigms

A new language model called HRM-Text, developed by Sapient Intelligence, is gaining attention for its innovative architecture that focuses on internal reasoning rather than simply increasing model size or training data. This model, with only 1 billion parameters and a training cost of approximately $1500, has achieved impressive scores on benchmarks like MATH and GSM8K. The architecture, known as Hierarchical Reasoning Model (HRM), emphasizes latent reasoning, allowing the model to perform multi-round, hierarchical, and recursive computations within its internal state before producing an output, a concept also explored in research by Yoshua Bengio's team. AI

IMPACT This model's focus on internal reasoning could shift future LLM development towards more efficient computation over sheer scale.

RANK_REASON Novel AI model architecture release with significant benchmark performance and endorsement from prominent AI researchers. [lever_c_demoted from significant: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 鹭羽 ·

    HuggingFace CEO strongly recommends, Bengio team also bets on: Why is this HRM model, trained with $1500, so popular?

    模型参数量只有1B