Together AI has released a series of research papers detailing advancements in LLM inference and training systems. These include methods for optimizing Mixture-of-Experts (MoE) models with Batch-Aware Expert Routing (OEA), and memory-efficient context parallelism with Ulysses. The company also presented Aurora, a unified system for adaptive speculative training, and V1, which unifies generation and self-verification for parallel reasoners. Further innovations include RARO for learning to reason via demonstrations, TTT-Discover for AI-driven scientific discovery, ThunderAgent for program-aware agentic inference, and DSGym for evaluating and training data science agents. AI
IMPACT These advancements aim to improve LLM efficiency, reasoning capabilities, and agentic workflows, potentially accelerating AI-driven discovery and complex task execution.
RANK_REASON Multiple research papers detailing new LLM inference and training techniques released by Together AI.
Read on X — Together (inference / OSS) →
AI-generated summary · Google Gemini · from 8 sources. How we write summaries →