The GLM-5.2 model has been released and is now accessible on the Together platform. This new version boasts impressive inference speeds, reportedly exceeding 200 tokens per second, and is available for users to test and provide feedback. AI
IMPACT This release offers improved inference speeds for users on the Together platform, potentially enabling faster AI application development.
RANK_REASON This is a model release, but not from a frontier lab, and is being distributed via a platform.
Read on X — Together (inference / OSS) →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →