PulseAugur
EN
LIVE 10:16:01

Together AI boosts AI training 90% with NVIDIA Blackwell

Together AI has launched new GPU clusters featuring NVIDIA's Blackwell platform, offering significant speedups for AI training and inference. These clusters, powered by the Together Kernel Collection, achieve up to 90% faster training speeds compared to previous NVIDIA H100 hardware, processing over 15,000 tokens per second for large models. Early access customers like Salesforce and Zoom have reported substantial performance gains, with some experiencing double the training speed. Together AI's optimization efforts span custom kernels, inference engines, and speculative decoding, aiming to redefine efficiency in AI model development and deployment. AI

IMPACT Accelerates AI training and inference, potentially lowering costs and increasing the pace of model development and deployment for enterprises.

RANK_REASON This cluster details a significant infrastructure upgrade and performance improvement for AI workloads by a major cloud provider, leveraging new hardware from a leading chip manufacturer.

Read on Together AI blog →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

Together AI boosts AI training 90% with NVIDIA Blackwell

COVERAGE [3]

  1. Together AI blog TIER_1 English(EN) ·

    Together AI Achieves 90% Faster BF16 Training with NVIDIA Blackwell Platform and Together Kernel Collection

  2. Together AI blog TIER_1 Nederlands(NL) ·

    Together AI Delivers Top Speeds for DeepSeek-R1-0528 Inference on NVIDIA Blackwell

    Together AI inference is now among the world’s fastest, most capable platforms for running open-source reasoning models like DeepSeek-R1 at scale, thanks to our new inference engine designed for NVIDIA HGX B200.

  3. Together AI blog TIER_1 English(EN) ·

    Salesforce, Zoom, InVideo Train Faster with Together AI Turbocharged with NVIDIA Blackwell