PulseAugur / Brief
EN
LIVE 10:38:50

Brief

last 24h
[2/2] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. ZAYA1-8B: a 760M-active MoE trained on AMD MI300x

    Zyphra has released ZAYA1-8B, an 8.4 billion parameter Mixture-of-Experts model that only activates approximately 760 million parameters per token. This architecture allows it to achieve performance comparable to much larger models on math and coding benchmarks, including Claude 4.5 Sonnet. The model incorporates architectural changes like Compressed Convolutional Attention and an MLP-based router for expert selection, and was trained on a large cluster of AMD Instinct MI300x nodes. AI

    IMPACT Achieves frontier-level performance with significantly reduced active parameters, potentially lowering inference costs for advanced models.

  2. Technical Deep Dive: How DigitalOcean and AMD Delivered a 2x Production Inference Performance Increase for Character.ai

    Character.ai, in collaboration with DigitalOcean and AMD, has achieved a twofold increase in production inference performance for its AI entertainment platform. This significant improvement was realized through deep technical optimization of AMD Instinct MI300X and MI325X GPU platforms, utilizing advanced techniques like parallelization for Mixture-of-Experts models and efficient FP8 execution. The collaboration resulted in a multi-year, eight-figure annual agreement with DigitalOcean for GPU infrastructure, enabling Character.ai to scale inference predictably and cost-effectively. AI

    Technical Deep Dive: How DigitalOcean and AMD Delivered a 2x Production Inference Performance Increase for Character.ai

    IMPACT Accelerates AI inference performance and reduces costs, enabling more efficient scaling of large language models.