中文(ZH) 顶流里最快！智谱，你是在「喷」代码吧

智谱AI推出GLM-5.1-highspeed API，速度达400 tokens/s

作者 PulseAugur 编辑部 · [3 个来源] · 2026-05-22 03:05

智谱AI发布了GLM-5.1-highspeed，这是其GLM-5.1模型的新API，推理速度达到每秒400个token。该新产品被定位为全球领先的LLM提供商中最快的，并在实际测试中表现出色，包括快速的代码生成和内容摘要。速度的提升归功于推理引擎、调度系统和底层基础设施在系统工程方面的显著优化，旨在通过减少等待时间和提高反馈频率来改善AI代理的用户体验。 AI

影响加速了跨各种应用的AI代理响应能力和实时交互能力。

排序理由前沿实验室的模型发布，创下新的速度基准。[lever_c_demoted from frontier_release: ic=2 ai=1.0]

在量子位 (QbitAI) 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

量子位 (QbitAI) TIER_1 中文(ZH) · 十三 · 2026-05-22 03:05

The fastest among the top streams! Zhipu, are you 'spraying' code?

400 tokens/s
Pandaily TIER_1 English(EN) · [email protected] (Pandaily) · 2026-05-22 08:07

Zhipu AI Launches GLM-5.1 High-Speed API: 400 Tokens/s Sets New Global Benchmark

Zhipu AI has launched GLM-5.1-highspeed, an API variant of its GLM-5.1 model delivering 400 tokens per second — reportedly the fastest inference speed among major global LLM providers.
Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-05-22 08:35

Zhipu AI has launched GLM-5.1-highspeed, a high-speed API variant of its GLM-5.1 large language model, delivering 400 tokens per second and reportedly setting a

Zhipu AI has launched GLM-5.1-highspeed, a high-speed API variant of its GLM-5.1 large language model, delivering 400 tokens per second and reportedly setting a new global benchmark for inference speed among major LLM providers. The API targets enterprise applications requiring r…

链接 pandaily.com/zhipu-ai-glm-5.1-high-speed-…

报道来源 [3]

The fastest among the top streams! Zhipu, are you 'spraying' code?

Zhipu AI Launches GLM-5.1 High-Speed API: 400 Tokens/s Sets New Global Benchmark

Zhipu AI has launched GLM-5.1-highspeed, a high-speed API variant of its GLM-5.1 large language model, delivering 400 tokens per second and reportedly setting a

相关实体

相关话题