PulseAugur
实时 23:40:06
实体 ZAYA1-8B

ZAYA1-8B

PulseAugur coverage of ZAYA1-8B — every cluster mentioning ZAYA1-8B across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
8
90 天内 8
发布 · 30天
0
90 天内 0
论文 · 30天
5
90 天内 5
层级分布 · 90 天
时间线
  1. 2026-05-22 product_launch Zyphra released the ZAYA1-8B Mixture-of-Experts model. 来源
  2. 2026-05-19 research_milestone Zaya1-8B model achieves a high score on a math benchmark without NVIDIA GPU training. 来源
情绪 · 30 天

3 天有情绪数据

最近 · 第 1/1 页 · 共 8 条
  1. TOOL · CL_45245 ·

    New 8B LLM Zaya1-8B signals major design shift

    A new 8-billion parameter local LLM, Zaya1-8B, is being hailed as a significant design shift in the field. Its architecture appears to represent a major departure from previous small reasoning models, potentially markin…

  2. SIGNIFICANT · CL_43334 ·

    Zyphra releases ZAYA1-8B MoE with sub-billion active parameters

    Zyphra has released ZAYA1-8B, an 8.4 billion parameter Mixture-of-Experts model that only activates approximately 760 million parameters per token. This architecture allows it to achieve performance comparable to much l…

  3. TOOL · CL_38440 ·

    Zaya1-8B model beats GPT-5-High on math test without NVIDIA GPUs

    A new language model named Zaya1-8B, featuring 760 million active parameters in a Mixture-of-Experts architecture, has demonstrated impressive performance on the HMMT '25 math competition. Notably, this model achieved i…

  4. RESEARCH · CL_34518 ·

    LLM Architectures Innovate for Long-Context Efficiency

    Sebastian Raschka's analysis highlights recent architectural innovations in open-weight LLMs aimed at improving long-context efficiency. Key developments include KV sharing and per-layer embeddings in Google's Gemma 4 m…

  5. RESEARCH · CL_23622 ·

    AMD-trained ZAYA1-8B model challenges NVIDIA's dominance

    XenoSpectrum has released ZAYA1-8B, a lightweight inference-focused model trained on AMD GPUs. This release aims to challenge NVIDIA's dominance in the GPU market by demonstrating the practical utility of AMD hardware f…

  6. TOOL · CL_22192 ·

    Zyphra's ZAYA1-8B model matches larger rivals with 700M active parameters

    Zyphra has released ZAYA1-8B, a reasoning-focused mixture-of-experts model with 700 million active parameters. The model was trained from scratch on an AMD compute platform and utilizes a novel four-stage reinforcement …

  7. TOOL · CL_20915 ·

    Zyphra's ZAYA1-8B model matches top AI benchmarks with under 1B parameters

    Zyphra has released ZAYA1-8B, an open-source model that achieves performance comparable to DeepSeek-R1 on math benchmarks. The model also demonstrates competitive reasoning capabilities against Claude Sonnet 4.5 and app…

  8. TOOL · CL_20870 ·

    Zyphra's ZAYA1-8B MoE model trained on AMD hardware outperforms larger rivals

    Zyphra AI has released ZAYA1-8B, a Mixture of Experts (MoE) language model with 760 million active parameters and 8.4 billion total parameters. Trained on AMD hardware, this model demonstrates competitive performance ag…