PulseAugur
EN
LIVE 23:24:49
Italiano(IT) 📰 Ricercatori addestrano un modello AI da zero per soli 1.500 dollari Un team di Sapient ha addestrato HRM-Text, un modello da 1 miliardo di parametri, spendend

Small AI Model Trained for $1500 Achieves High Benchmark Scores

Sapient Intelligence has developed the HRM-Text model, a 1-billion parameter AI that was trained for only $1500. This model achieves high benchmark scores despite its small size and low training cost, utilizing significantly less compute and fewer tokens than comparable models like Qwen, Gemma, or Llama. The training process was unconventional, forgoing post-training and RLHF. AI

IMPACT This development demonstrates the potential for highly efficient and low-cost AI model training, potentially democratizing access to advanced AI capabilities.

RANK_REASON The cluster describes the release of a new AI model with details on its training cost and parameters, fitting the research category.

Read on Mastodon — mastodon.social →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Small AI Model Trained for $1500 Achieves High Benchmark Scores

COVERAGE [2]

  1. Mastodon — mastodon.social TIER_1 English(EN) · AIsynestesia ·

    🤖 Small AI Model Packs Big Punch with Unconventional Training Sapient Intelligence's HRM Text model, with only 1B parameters and a training cost of $1500, achie

    🤖 Small AI Model Packs Big Punch with Unconventional Training Sapient Intelligence's HRM Text model, with only 1B parameters and a training cost of $1500, achieves high benchmark scores without post training or RLHF. This small, efficiently trained model, developed from scratch, …

  2. Mastodon — mastodon.social TIER_1 Italiano(IT) · AI_BEAR_NEWS ·

    📰 Researchers train an AI model from scratch for only $1,500 A Sapient team trained HRM-Text, a 1 billion parameter model, spending

    📰 Ricercatori addestrano un modello AI da zero per soli 1.500 dollari Un team di Sapient ha addestrato HRM-Text, un modello da 1 miliardo di parametri, spendendo solo 1.500 dollari in 1.9 giorni su 16 GPU. Usa 100-900x meno token e 96-432x meno compute rispetto a modelli come Qwe…