Together AI has released GLM 5.1, an open-source inference model. The optimization of GLM 5.1 focused on rewriting and fusing the indexer topk kernel to reduce memory and launch overhead. Additionally, CPU overhead was eliminated to improve prefill throughput, with significant gains attributed to the indexer improvements. AI
IMPACT Together AI's release of GLM 5.1 offers an open-source option for inference, potentially lowering costs and increasing accessibility for developers.
RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=2 ai=1.0]
Read on X — Together (inference / OSS) →
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →