Zaya1-8B model beats GPT-5-High on math test without NVIDIA GPUs

By PulseAugur Editorial · [1 sources] · 2026-05-19 05:39

A new language model named Zaya1-8B, featuring 760 million active parameters in a Mixture-of-Experts architecture, has demonstrated impressive performance on the HMMT '25 math competition. Notably, this model achieved its results without any training on NVIDIA GPUs, a significant departure from typical high-performance AI training. Zaya1-8B surpassed the performance of GPT-5-High on this specific math benchmark, scoring 89.6%. AI

IMPACT Demonstrates novel training approaches can yield competitive results, potentially reducing reliance on expensive GPU infrastructure.

RANK_REASON The cluster reports on a new model's performance on a specific benchmark, which falls under research. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Towards AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Zaya1-8B model beats GPT-5-High on math test without NVIDIA GPUs

COVERAGE [1]

Towards AI TIER_1 English(EN) · Chew Loong Nian - AI ENGINEER · 2026-05-19 05:39

I Tested ZAYA1-8B — Trained on Zero NVIDIA GPUs, Its 760M Active Params Cheated GPT-5-High on Math

<div class="medium-feed-item"><p class="medium-feed-snippet">A 760-million-active-parameter MoE that never touched a single NVIDIA H100 in training scored 89.6% on HMMT ’25 math — 1.3 points higher…</p><p class="medium-feed-link"><a href="https://pub.towardsa…

COVERAGE [1]

I Tested ZAYA1-8B — Trained on Zero NVIDIA GPUs, Its 760M Active Params Cheated GPT-5-High on Math

RELATED ENTITIES

RELATED TOPICS