PulseAugur / Brief
EN
LIVE 14:56:59

Brief

last 24h
[1/1] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

    Zai has significantly improved the performance and reduced costs of its GLM-5.1 inference cluster by implementing a new network architecture called ZCube. This custom design, developed with Tsinghua University and HarnetsAI, replaces the standard ROFT setup and addresses inefficiencies in traffic patterns during disaggregated inference. The result is a 33% reduction in hardware costs and a 15% increase in GPU inference throughput, alongside a substantial decrease in latency. AI

    Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

    IMPACT Optimized network architecture for AI inference can lead to lower operational costs and faster model deployment.