ENTITY ZAYA1-8B

ZAYA1-8B

PulseAugur coverage of ZAYA1-8B — every cluster mentioning ZAYA1-8B across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

10 over 90d

Releases · 30d

0 over 90d

Papers · 30d

5 over 90d

TIER MIX · 90D

TOPICS

model release 10
paper 5
infra 4
other 1

RELATIONSHIPS

developed by AMD Instinct MI300x 90%
used by AMD Instinct MI300x 90%

TIMELINE

2026-05-22 product_launch Zyphra released the ZAYA1-8B Mixture-of-Experts model. source
2026-05-19 research_milestone Zaya1-8B model achieves a high score on a math benchmark without NVIDIA GPU training. source

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 10 TOTAL

SIGNIFICANT · CL_105330 · Jun 23 · 07:02

Zyphra's ZAYA1-8B model shows strong reasoning on AMD hardware

Zyphra has released ZAYA1-8B, an Apache 2.0 licensed Mixture-of-Experts reasoning model with 8.4 billion total parameters and approximately 760 million active parameters. Notably, the model was trained entirely on AMD I…
TOOL · CL_89886 · Jun 14 · 03:00

LLM Architectures Innovate with KV Sharing, Compressed Attention for Long Context

Recent advancements in Large Language Model (LLM) architectures are focusing on improving efficiency for long context windows, addressing resource constraints like KV cache size and memory bandwidth. Techniques such as …
TOOL · CL_45245 · May 22 · 20:30

New 8B LLM Zaya1-8B signals major design shift

A new 8-billion parameter local LLM, Zaya1-8B, is being hailed as a significant design shift in the field. Its architecture appears to represent a major departure from previous small reasoning models, potentially markin…
SIGNIFICANT · CL_43334 · May 22 · 03:28

Zyphra releases ZAYA1-8B MoE with sub-billion active parameters

Zyphra has released ZAYA1-8B, an 8.4 billion parameter Mixture-of-Experts model that only activates approximately 760 million parameters per token. This architecture allows it to achieve performance comparable to much l…
TOOL · CL_38440 · May 19 · 05:39

Zaya1-8B model beats GPT-5-High on math test without NVIDIA GPUs

A new language model named Zaya1-8B, featuring 760 million active parameters in a Mixture-of-Experts architecture, has demonstrated impressive performance on the HMMT '25 math competition. Notably, this model achieved i…
RESEARCH · CL_34518 · May 16 · 11:33

LLM Architectures Innovate for Long-Context Efficiency

Sebastian Raschka's analysis highlights recent architectural innovations in open-weight LLMs aimed at improving long-context efficiency. Key developments include KV sharing and per-layer embeddings in Google's Gemma 4 m…
RESEARCH · CL_23622 · May 8 · 23:21

AMD-trained ZAYA1-8B model challenges NVIDIA's dominance

XenoSpectrum has released ZAYA1-8B, a lightweight inference-focused model trained on AMD GPUs. This release aims to challenge NVIDIA's dominance in the GPU market by demonstrating the practical utility of AMD hardware f…
TOOL · CL_22192 · May 8 · 04:00

Zyphra's ZAYA1-8B model matches larger rivals with 700M active parameters

Zyphra has released ZAYA1-8B, a reasoning-focused mixture-of-experts model with 700 million active parameters. The model was trained from scratch on an AMD compute platform and utilizes a novel four-stage reinforcement …
TOOL · CL_20915 · May 7 · 09:00

Zyphra's ZAYA1-8B model matches top AI benchmarks with under 1B parameters

Zyphra has released ZAYA1-8B, an open-source model that achieves performance comparable to DeepSeek-R1 on math benchmarks. The model also demonstrates competitive reasoning capabilities against Claude Sonnet 4.5 and app…
TOOL · CL_20870 · May 7 · 05:44

Zyphra's ZAYA1-8B MoE model trained on AMD hardware outperforms larger rivals

Zyphra AI has released ZAYA1-8B, a Mixture of Experts (MoE) language model with 760 million active parameters and 8.4 billion total parameters. Trained on AMD hardware, this model demonstrates competitive performance ag…

Zyphra's ZAYA1-8B model shows strong reasoning on AMD hardware

LLM Architectures Innovate with KV Sharing, Compressed Attention for Long Context

New 8B LLM Zaya1-8B signals major design shift

Zyphra releases ZAYA1-8B MoE with sub-billion active parameters

Zaya1-8B model beats GPT-5-High on math test without NVIDIA GPUs

LLM Architectures Innovate for Long-Context Efficiency

AMD-trained ZAYA1-8B model challenges NVIDIA's dominance

Zyphra's ZAYA1-8B model matches larger rivals with 700M active parameters

Zyphra's ZAYA1-8B model matches top AI benchmarks with under 1B parameters

Zyphra's ZAYA1-8B MoE model trained on AMD hardware outperforms larger rivals