ENTITY Together AI

Together AI

PulseAugur coverage of Together AI — every cluster mentioning Together AI across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

100

100 over 90d

Releases · 30d

0 over 90d

Papers · 30d

14 over 90d

TIER MIX · 90D

frontier release 4
significant 11
research 9
tool 71
commentary 4
meme 1

TOPICS

RELATIONSHIPS

TIMELINE

2026-06-13 product_launch Together AI launched the MiniMax-M3 multimodal model. source
2026-06-12 research_milestone Together AI released benchmarks showing significant performance gains on Blackwell hardware for AI agent infrastructure. source
2026-06-10 research_milestone Together AI achieved ISO 27001:2022 certification after a successful audit. source
2026-06-10 research_milestone Together AI achieved ISO 27001:2022 certification for its Information Security Management System. source
2026-06-09 partnership Together AI partnered with Pax8 to offer AI infrastructure and models to small and medium-sized businesses. source
2026-06-01 product_launch Together AI is announcing a new model called M3. source
2026-05-29 product_launch Together AI is now serving the two fastest speech-to-text models, including NVIDIA Parakeet-TDT 0.6B v3. source
2026-05-29 product_launch Together AI launched a new open-source AI translation application. source
2026-05-22 product_launch Together AI launched updates to its Fine-Tuning Platform, adding support for new LLMs and extending context lengths. source
2026-05-22 product_launch Together AI announced the addition of 1,000 NVIDIA H100 and H200 GPUs to its infrastructure. source
2026-05-22 product_launch Together AI launches GPU clusters with NVIDIA Blackwell platform and optimized kernel collection, achieving significant performance gains. source
2026-05-22 product_launch Together AI launched major upgrades to its Batch Inference API. source
2026-05-22 product_launch Together AI released FlashAttention-3 and FlashAttention-4, optimized attention mechanisms for GPUs. source
2026-05-22 product_launch Together AI launched access to the Qwen3.7-Max model. source
2026-05-15 partnership Together AI and Pearl Research Labs formed a partnership to integrate blockchain for AI inference cost reduction. source

SENTIMENT · 30D

20 day(s) with sentiment data

LAB BRAIN

observation active conf 0.85

Together AI's ATLAS system demonstrates superior inference speed on par with specialized hardware

Together AI's newly launched ATLAS system, an adaptive-learning inference engine, is showing remarkable performance, achieving up to 500 TPS on DeepSeek-V3.1. This performance rivals that of specialized hardware like Groq, suggesting Together AI is effectively optimizing LLM inference beyond standard GPU capabilities.

observation resolved confirmed conf 0.80

Together AI significantly bolsters inference capacity with H100/H200 GPU expansion

The addition of one thousand NVIDIA H100 and H200 GPUs to Together AI's infrastructure represents a substantial investment in inference capabilities. This move directly supports the growing demand for high-throughput AI model serving and is likely intended to power both their internal services and external customer workloads.

hypothesis resolved confirmed conf 0.65

Together AI to offer ATLAS as a distinct inference optimization service

Given the significant performance gains demonstrated by ATLAS, Together AI may soon offer this adaptive-learning inference system as a standalone service or an add-on feature for their existing GPU offerings. This would allow customers to leverage ATLAS's dynamic optimization without needing to manage the underlying infrastructure themselves.

hypothesis resolved confirmed conf 0.65

Together AI to integrate NVIDIA Blackwell features into all core services

The 90% training speed boost achieved with NVIDIA Blackwell and custom kernels indicates a deep integration. It's likely Together AI will leverage Blackwell's capabilities across their entire platform, including their new instant clusters and fine-tuning services, to offer a performance edge over competitors.

observation expired conf 0.75

Together AI's ATLAS system shows strong performance against specialized hardware

The reported performance of Together AI's ATLAS system, achieving up to 500 TPS on DeepSeek-V3.1 and outperforming specialized hardware like Groq, is a significant technical achievement. This suggests their adaptive inference approach is highly effective and could set a new benchmark for LLM inference speed and efficiency.

All hypotheses →

RECENT · PAGE 4/5 · 100 TOTAL

Together AI

Together AI's ATLAS system demonstrates superior inference speed on par with specialized hardware

Together AI significantly bolsters inference capacity with H100/H200 GPU expansion

Together AI to offer ATLAS as a distinct inference optimization service

Together AI to integrate NVIDIA Blackwell features into all core services

Together AI's ATLAS system shows strong performance against specialized hardware

Together AI releases Mamba-3, prioritizing inference speed over training

Together AI launches NVIDIA's multimodal and 1M-context Nemotron 3 models

Together AI enhances GPU clusters with multi-tenancy and autoscaling

New methods tackle LLM KV cache compression for long contexts

Speech models fail on street names, especially for non-native speakers

Together AI expands LLM fine-tuning, adds longer contexts

Together AI launches Rime V3 models for natural voice code-switching

DSGym framework standardizes data science agent evaluation and training

Together AI rebrands, focuses on efficient AI inference infrastructure

Cursor and Together AI optimize AI coding assistant with NVIDIA Blackwell

Multi-node training enables scaling foundation models across GPU clusters

Guide details choosing open-source AI models for production

Together AI launches MiniMax Speech 2.8 Turbo for natural voice agents

Together AI adds Rime voice models for expressive, controlled AI conversations

Together AI VP: AI not hitting hardware wall, efficiency gains untapped

NVIDIA Nemotron Diffusion models offer 6.4x faster AI inference

Together AI releases new Python SDK v2.0 RC

Together AI introduces AutoJudge for faster LLM inference

Together AI Cloud enhances RL pipelines with TorchForge and tool integrations

Together AI launches unified platform for real-time voice agents