PulseAugur / Brief
EN
LIVE 07:59:06

Brief

last 24h
[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. @Zai_org GLM-5.2 is live on Together AI.

    Together AI has announced the release of GLM-5.2, a new open-source model developed by Zai_org. This model is designed for long-horizon tasks and features a 1 million token context window. GLM-5.2 also boasts improved coding capabilities with adjustable effort levels to balance latency and depth, and an architecture that reduces FLOPs by 2.9x at its maximum context length. AI

    IMPACT This release offers enhanced long-context capabilities and improved efficiency, potentially impacting research and applications requiring extensive context.

  2. Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

    Zai has significantly improved the performance and reduced costs of its GLM-5.1 inference cluster by implementing a new network architecture called ZCube. This custom design, developed with Tsinghua University and HarnetsAI, replaces the standard ROFT setup and addresses inefficiencies in traffic patterns during disaggregated inference. The result is a 33% reduction in hardware costs and a 15% increase in GPU inference throughput, alongside a substantial decrease in latency. AI

    Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

    IMPACT Optimized network architecture for AI inference can lead to lower operational costs and faster model deployment.