ENTITY Together AI

Together AI

PulseAugur coverage of Together AI — every cluster mentioning Together AI across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

138

138 over 90d

Releases · 30d

0 over 90d

Papers · 30d

16 over 90d

TIER MIX · 90D

frontier release 7
significant 12
research 13
tool 99
commentary 6
meme 1

TOPICS

RELATIONSHIPS

TIMELINE

2026-06-25 product_launch Together AI announced that its platform, using the GLM-5.2 model, can generate web applications for a few cents per iteration. source
2026-06-22 product_launch Together AI released the Brrrrr inference model. source
2026-06-18 partnership Together AI and NVIDIA are co-hosting an event on July 1st at the AI Engineer World's Fair to discuss open models and collective agent intelligence. source
2026-06-13 product_launch Together AI launched the MiniMax-M3 multimodal model. source
2026-06-12 research_milestone Together AI released benchmarks showing significant performance gains on Blackwell hardware for AI agent infrastructure. source
2026-06-10 research_milestone Together AI achieved ISO 27001:2022 certification after a successful audit. source
2026-06-10 research_milestone Together AI achieved ISO 27001:2022 certification for its Information Security Management System. source
2026-06-09 partnership Together AI partnered with Pax8 to offer AI infrastructure and models to small and medium-sized businesses. source
2026-06-01 product_launch Together AI is announcing a new model called M3. source
2026-05-29 product_launch Together AI is now serving the two fastest speech-to-text models, including NVIDIA Parakeet-TDT 0.6B v3. source
2026-05-29 product_launch Together AI launched a new open-source AI translation application. source
2026-05-22 product_launch Together AI launched updates to its Fine-Tuning Platform, adding support for new LLMs and extending context lengths. source
2026-05-22 product_launch Together AI announced the addition of 1,000 NVIDIA H100 and H200 GPUs to its infrastructure. source
2026-05-22 product_launch Together AI launches GPU clusters with NVIDIA Blackwell platform and optimized kernel collection, achieving significant performance gains. source
2026-05-22 product_launch Together AI launched major upgrades to its Batch Inference API. source

SENTIMENT · 30D

27 day(s) with sentiment data

LAB BRAIN

observation resolved confirmed conf 0.80

Together AI significantly bolsters inference capacity with H100/H200 GPU expansion

The addition of one thousand NVIDIA H100 and H200 GPUs to Together AI's infrastructure represents a substantial investment in inference capabilities. This move directly supports the growing demand for high-throughput AI model serving and is likely intended to power both their internal services and external customer workloads.

hypothesis resolved confirmed conf 0.65

Together AI to offer ATLAS as a distinct inference optimization service

Given the significant performance gains demonstrated by ATLAS, Together AI may soon offer this adaptive-learning inference system as a standalone service or an add-on feature for their existing GPU offerings. This would allow customers to leverage ATLAS's dynamic optimization without needing to manage the underlying infrastructure themselves.

observation expired conf 0.85

Together AI's ATLAS system demonstrates superior inference speed on par with specialized hardware

Together AI's newly launched ATLAS system, an adaptive-learning inference engine, is showing remarkable performance, achieving up to 500 TPS on DeepSeek-V3.1. This performance rivals that of specialized hardware like Groq, suggesting Together AI is effectively optimizing LLM inference beyond standard GPU capabilities.

hypothesis resolved confirmed conf 0.65

Together AI to integrate NVIDIA Blackwell features into all core services

The 90% training speed boost achieved with NVIDIA Blackwell and custom kernels indicates a deep integration. It's likely Together AI will leverage Blackwell's capabilities across their entire platform, including their new instant clusters and fine-tuning services, to offer a performance edge over competitors.

observation expired conf 0.75

Together AI's ATLAS system shows strong performance against specialized hardware

The reported performance of Together AI's ATLAS system, achieving up to 500 TPS on DeepSeek-V3.1 and outperforming specialized hardware like Groq, is a significant technical achievement. This suggests their adaptive inference approach is highly effective and could set a new benchmark for LLM inference speed and efficiency.

All hypotheses →

RECENT · PAGE 1/7 · 138 TOTAL

Together AI

Together AI significantly bolsters inference capacity with H100/H200 GPU expansion

Together AI to offer ATLAS as a distinct inference optimization service

Together AI's ATLAS system demonstrates superior inference speed on par with specialized hardware

Together AI to integrate NVIDIA Blackwell features into all core services

Together AI's ATLAS system shows strong performance against specialized hardware

MiniMax AI and Together AI to Discuss Large-Scale Agent Infrastructure

LiteLLM vs. OpenRouter: A Deep Dive into LLM Proxy Architectures

Open-source AI inference demand drives strategic model choice, says Together AI

Together AI's GLM-5.2 aids web app iteration, user shares workflow

Sail Research launches with $80M to optimize AI infrastructure for autonomous agents

Together AI claims world's fastest speech-to-text stack

Together AI offers web app generation for cents with GLM-5.2

Together AI launches GLM Arena to benchmark GLM 5.2 against Anthropic's Opus 4.8

LLM Inference Pricing Compared Across 7 Providers, Highlighting Caching Costs

Together AI sees 400T token adoption for open-source models

Together AI releases open-source Parallel Kernel Builder for LLM inference

Together AI's GLM 5.2 outperforms Anthropic's Opus on speed and cost

Frontier LLMs struggle with multi-GPU kernel generation, new benchmark reveals

Together AI releases free Brrrrr inference model

Together AI's Blind Test challenges users to distinguish between GLM-5.2 and Opus 4.8

Together AI deploys NVIDIA GB300 NVL72 for next-gen AI inference infrastructure

Together AI offers free GLM-5.2 access via Together Chat

Together AI offers free access to GLM-5.2 model on Together Chat

Together AI releases GLM-5.2, showcasing fast inference and reasoning capabilities

Together AI's voice agent interacts with screens for code editing