Together AI
PulseAugur coverage of Together AI — every cluster mentioning Together AI across labs, papers, and developer communities, ranked by signal.
- uses Gemma-4-31B-it-Pearl 90%
- uses Deepgram 90%
- founded Vipul Ved Prakash 90%
- partners with Pearl Research Labs 90%
- uses Nvidia Blackwell B200 90%
- developed Together Code Interpreter 90%
- used by NVIDIA Parakeet-TDT 0.6B v3 90%
- developed Gemma-4-31B-it-Pearl 90%
- employed by Dan Fu 90%
- partners with MiniMax AI 80%
- used by MiniMax AI 75%
- used by DeepSeek-R1 70%
- 2026-06-13 product_launch Together AI launched the MiniMax-M3 multimodal model. source
- 2026-06-12 research_milestone Together AI released benchmarks showing significant performance gains on Blackwell hardware for AI agent infrastructure. source
- 2026-06-10 research_milestone Together AI achieved ISO 27001:2022 certification after a successful audit. source
- 2026-06-10 research_milestone Together AI achieved ISO 27001:2022 certification for its Information Security Management System. source
- 2026-06-09 partnership Together AI partnered with Pax8 to offer AI infrastructure and models to small and medium-sized businesses. source
- 2026-06-01 product_launch Together AI is announcing a new model called M3. source
- 2026-05-29 product_launch Together AI is now serving the two fastest speech-to-text models, including NVIDIA Parakeet-TDT 0.6B v3. source
- 2026-05-29 product_launch Together AI launched a new open-source AI translation application. source
- 2026-05-22 product_launch Together AI launched updates to its Fine-Tuning Platform, adding support for new LLMs and extending context lengths. source
- 2026-05-22 product_launch Together AI announced the addition of 1,000 NVIDIA H100 and H200 GPUs to its infrastructure. source
- 2026-05-22 product_launch Together AI launches GPU clusters with NVIDIA Blackwell platform and optimized kernel collection, achieving significant performance gains. source
- 2026-05-22 product_launch Together AI launched major upgrades to its Batch Inference API. source
- 2026-05-22 product_launch Together AI released FlashAttention-3 and FlashAttention-4, optimized attention mechanisms for GPUs. source
- 2026-05-22 product_launch Together AI launched access to the Qwen3.7-Max model. source
- 2026-05-15 partnership Together AI and Pearl Research Labs formed a partnership to integrate blockchain for AI inference cost reduction. source
20 day(s) with sentiment data
Together AI's ATLAS system demonstrates superior inference speed on par with specialized hardware
Together AI's newly launched ATLAS system, an adaptive-learning inference engine, is showing remarkable performance, achieving up to 500 TPS on DeepSeek-V3.1. This performance rivals that of specialized hardware like Groq, suggesting Together AI is effectively optimizing LLM inference beyond standard GPU capabilities.
Together AI significantly bolsters inference capacity with H100/H200 GPU expansion
The addition of one thousand NVIDIA H100 and H200 GPUs to Together AI's infrastructure represents a substantial investment in inference capabilities. This move directly supports the growing demand for high-throughput AI model serving and is likely intended to power both their internal services and external customer workloads.
Together AI to offer ATLAS as a distinct inference optimization service
Given the significant performance gains demonstrated by ATLAS, Together AI may soon offer this adaptive-learning inference system as a standalone service or an add-on feature for their existing GPU offerings. This would allow customers to leverage ATLAS's dynamic optimization without needing to manage the underlying infrastructure themselves.
Together AI to integrate NVIDIA Blackwell features into all core services
The 90% training speed boost achieved with NVIDIA Blackwell and custom kernels indicates a deep integration. It's likely Together AI will leverage Blackwell's capabilities across their entire platform, including their new instant clusters and fine-tuning services, to offer a performance edge over competitors.
Together AI's ATLAS system shows strong performance against specialized hardware
The reported performance of Together AI's ATLAS system, achieving up to 500 TPS on DeepSeek-V3.1 and outperforming specialized hardware like Groq, is a significant technical achievement. This suggests their adaptive inference approach is highly effective and could set a new benchmark for LLM inference speed and efficiency.
-
New research targets LLM reasoning improvements via context, efficiency, and robustness
Several recent research papers explore methods to enhance the reasoning capabilities of large language models (LLMs). One study suggests that increasing a model's long-context capacity improves reasoning performance acr…
-
Together AI adds 40+ image and video models, including FLUX.2
Together AI has expanded its platform to include advanced multimedia generation capabilities, integrating over 40 new image and video models. This move aims to simplify development by offering a unified API for text, im…
-
Together AI launches accelerator for AI-native app startups
Together AI has launched a new startup accelerator program specifically designed for companies building AI-native applications. The accelerator will provide selected startups with platform credits, engineering expertise…
-
Together AI hires Mahadev Konar to lead GPU infrastructure
Together AI has appointed Mahadev Konar as its new SVP of Infrastructure Engineering to bolster its GPU cloud services. Konar, a key figure in Apache Hadoop's development and formerly VP of Infrastructure at Instacart, …
-
Together AI boosts custom model inference speed, optimizes open-source LLMs
Together AI has launched a new service called Dedicated Container Inference, designed to optimize the deployment and performance of custom generative media models. This platform handles complex orchestration tasks like …
-
Together AI launches LLM evaluation tool with open-source judges
Together AI has launched Together Evaluations, a new platform designed to help developers benchmark large language models for specific tasks. The service allows users to define custom benchmarks and utilize leading open…
-
Together AI integrates Deepgram voice models, launches fast Whisper STT
Together AI has launched new speech-to-text (STT) and text-to-speech (TTS) capabilities, integrating Deepgram's advanced voice models and its own high-performance Whisper V3 API. This move aims to streamline the develop…
-
Together AI achieves SOC 2 Type 2 compliance for secure AI workloads
Together AI has achieved SOC 2 Type 2 compliance, demonstrating a strong commitment to security and data protection. This rigorous process involved an independent audit of their infrastructure, validating controls for a…
-
Together AI deploys 100,000 GPUs in Europe via Hypertec, 5C
Together AI is significantly expanding its infrastructure in Europe through a partnership with Hypertec and 5C Group. This initiative aims to provide up to 2 gigawatts of AI-dedicated data center capacity and nearly 100…
-
Together AI champions open-source models driving AI frontier
Together AI argues that the future of AI development lies in open-source models, challenging the notion that proprietary labs are the sole drivers of innovation. The company highlights that open-source platforms offer g…
-
Together AI unveils YAQA for improved LLM quantization
Together AI has introduced YAQA, a novel post-training quantization technique for large language models. This method aims to preserve the original model's outputs more effectively than existing algorithms by directly mi…
-
New AI research tackles multimodal finetuning, image editing, and verification
Researchers have developed TRACER, a novel method for robust multimodal finetuning that addresses catastrophic forgetting by using a Weighted Moving Average (WMA) teacher. This approach improves out-of-distribution accu…
-
Together AI launches API to execute LLM-generated code
Together AI has launched Together Code Interpreter (TCI), an API designed to securely execute code generated by large language models. This tool addresses the limitation of LLMs being unable to run the code they produce…
-
Together AI launches code execution tools for AI-generated code
Together AI has launched two new products, Together Code Sandbox and Together Code Interpreter, aimed at improving the execution of AI-generated code. Together Code Sandbox offers customizable virtual machine environmen…
-
Together AI acquires Refuel.ai to boost enterprise AI data capabilities
Together AI has acquired Refuel.ai, a company specializing in data cleaning and structuring for AI applications. This acquisition aims to integrate Refuel.ai's models and platform into Together AI's existing infrastruct…
-
Arcee AI moves to Together Endpoints for cost-efficient SLMs
Arcee AI has migrated its specialized small language models (SLMs) from AWS to Together Dedicated Endpoints, seeking improved cost, performance, and operational agility. The company focuses on training efficient models …
-
Together AI boosts AI training 90% with NVIDIA Blackwell
Together AI has launched new GPU clusters featuring NVIDIA's Blackwell platform, offering significant speedups for AI training and inference. These clusters, powered by the Together Kernel Collection, achieve up to 90% …
-
Together AI launches platform for continuous LLM fine-tuning
Together AI has launched a new fine-tuning platform that allows users to continuously improve open-weight language models. The platform now supports preference optimization and continued training, enabling models to ada…
-
Google AI optimizes cloud computing with LAVA, Together AI expands GPU cloud, and Modal streamlines AI/ML deployment
Google DeepMind researchers have developed LAVA, a new AI-driven scheduling algorithm designed to optimize resource allocation in cloud data centers. LAVA continuously re-predicts virtual machine (VM) lifetimes, adaptin…
-
Social AI with Hugging Face
Hugging Face has announced a series of partnerships and product updates aimed at enhancing the accessibility, security, and scalability of AI models. Collaborations with Google, VirusTotal, JFrog, Wiz Research, and Prot…