A new language model named Zaya1-8B, featuring 760 million active parameters in a Mixture-of-Experts architecture, has demonstrated impressive performance on the HMMT '25 math competition. Notably, this model achieved its results without any training on NVIDIA GPUs, a significant departure from typical high-performance AI training. Zaya1-8B surpassed the performance of GPT-5-High on this specific math benchmark, scoring 89.6%. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Demonstrates novel training approaches can yield competitive results, potentially reducing reliance on expensive GPU infrastructure.
RANK_REASON The cluster reports on a new model's performance on a specific benchmark, which falls under research. [lever_c_demoted from research: ic=1 ai=1.0]