Together AI unveils 8 new LLM inference and training system advancements

By PulseAugur Editorial · [8 sources] · 2026-06-30 19:54

Together AI has released a series of research papers detailing advancements in LLM inference and training systems. These include methods for optimizing Mixture-of-Experts (MoE) models with Batch-Aware Expert Routing (OEA), and memory-efficient context parallelism with Ulysses. The company also presented Aurora, a unified system for adaptive speculative training, and V1, which unifies generation and self-verification for parallel reasoners. Further innovations include RARO for learning to reason via demonstrations, TTT-Discover for AI-driven scientific discovery, ThunderAgent for program-aware agentic inference, and DSGym for evaluating and training data science agents. AI

IMPACT These advancements aim to improve LLM efficiency, reasoning capabilities, and agentic workflows, potentially accelerating AI-driven discovery and complex task execution.

RANK_REASON Multiple research papers detailing new LLM inference and training techniques released by Together AI.

Read on X — Together (inference / OSS) →

AI-generated summary · Google Gemini · from 8 sources. How we write summaries →

Together AI unveils 8 new LLM inference and training system advancements

COVERAGE [8]

X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

8/ Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining (OEA)

8/ Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining (OEA) https://t.co/dw33plIoxW
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

7/ Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

7/ Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking https://t.co/LgGqu8vl97
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

6/ When RL Meets Adaptive Speculative Training: A Unified Training-Serving System (Aurora)

6/ When RL Meets Adaptive Speculative Training: A Unified Training-Serving System (Aurora) https://t.co/fvLuHrqDbX
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

5/ V1: Unifying Generation and Self-Verification for Parallel Reasoners

5/ V1: Unifying Generation and Self-Verification for Parallel Reasoners https://t.co/X1zUsS7gY8
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

4/ Escaping the Verifier: Learning to Reason via Demonstrations (RARO)

4/ Escaping the Verifier: Learning to Reason via Demonstrations (RARO) https://t.co/gQZCEav8Nb
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

3/ Learning to Discover at Test Time (TTT-Discover)

3/ Learning to Discover at Test Time (TTT-Discover) https://t.co/pKeadv4DHl
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

2/ ThunderAgent: A Simple, Fast and Program-Aware Agentic Inference System

2/ ThunderAgent: A Simple, Fast and Program-Aware Agentic Inference System https://t.co/7I1Yf5s8B8
X — Together (inference / OSS) TIER_1 English(EN) · togethercompute · 2026-06-30 19:54

1/ DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

1/ DSGym: A Holistic Framework for Evaluating and Training Data Science Agents https://t.co/jV4uMB1g48

COVERAGE [8]

RELATED ENTITIES

RELATED TOPICS