GLM-5.2 model released with speeds over 200 tps on Together platform

By PulseAugur Editorial · [1 sources] · 2026-06-18 04:16

The GLM-5.2 model has been released and is now accessible on the Together platform. This new version boasts impressive inference speeds, reportedly exceeding 200 tokens per second, and is available for users to test and provide feedback. AI

IMPACT This release offers improved inference speeds for users on the Together platform, potentially enabling faster AI application development.

RANK_REASON This is a model release, but not from a frontier lab, and is being distributed via a platform.

Read on X — Together (inference / OSS) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-18 04:16

RT @nutlope: GLM 5.2 is available now on @togethercompute!

RT @nutlope: GLM 5.2 is available now on @togethercompute! Very fast speeds (200+ tps), try it out & let me know what you think! Video is…

COVERAGE [1]

RT @nutlope: GLM 5.2 is available now on @togethercompute!

RELATED ENTITIES

RELATED TOPICS