PulseAugur
EN
LIVE 00:07:24

Together AI unveils 8 new LLM inference and training system advancements

Together AI has released a series of research papers detailing advancements in LLM inference and training systems. These include methods for optimizing Mixture-of-Experts (MoE) models with Batch-Aware Expert Routing (OEA), and memory-efficient context parallelism with Ulysses. The company also presented Aurora, a unified system for adaptive speculative training, and V1, which unifies generation and self-verification for parallel reasoners. Further innovations include RARO for learning to reason via demonstrations, TTT-Discover for AI-driven scientific discovery, ThunderAgent for program-aware agentic inference, and DSGym for evaluating and training data science agents. AI

IMPACT These advancements aim to improve LLM efficiency, reasoning capabilities, and agentic workflows, potentially accelerating AI-driven discovery and complex task execution.

RANK_REASON Multiple research papers detailing new LLM inference and training techniques released by Together AI.

Read on X — Together (inference / OSS) →

AI-generated summary · Google Gemini · from 8 sources. How we write summaries →

Together AI unveils 8 new LLM inference and training system advancements

COVERAGE [8]

  1. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    8/ Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining (OEA)

    8/ Opportunistic Expert Activation: Batch-Aware Expert Routing for Faster Decode Without Retraining (OEA) https://t.co/dw33plIoxW

  2. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    7/ Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking

    7/ Untied Ulysses: Memory-Efficient Context Parallelism via Headwise Chunking https://t.co/LgGqu8vl97

  3. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    6/ When RL Meets Adaptive Speculative Training: A Unified Training-Serving System (Aurora)

    6/ When RL Meets Adaptive Speculative Training: A Unified Training-Serving System (Aurora) https://t.co/fvLuHrqDbX

  4. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    5/ V1: Unifying Generation and Self-Verification for Parallel Reasoners

    5/ V1: Unifying Generation and Self-Verification for Parallel Reasoners https://t.co/X1zUsS7gY8

  5. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    4/ Escaping the Verifier: Learning to Reason via Demonstrations (RARO)

    4/ Escaping the Verifier: Learning to Reason via Demonstrations (RARO) https://t.co/gQZCEav8Nb

  6. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    3/ Learning to Discover at Test Time (TTT-Discover)

    3/ Learning to Discover at Test Time (TTT-Discover) https://t.co/pKeadv4DHl

  7. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    2/ ThunderAgent: A Simple, Fast and Program-Aware Agentic Inference System

    2/ ThunderAgent: A Simple, Fast and Program-Aware Agentic Inference System https://t.co/7I1Yf5s8B8

  8. X — Together (inference / OSS) TIER_1 English(EN) · togethercompute ·

    1/ DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

    1/ DSGym: A Holistic Framework for Evaluating and Training Data Science Agents https://t.co/jV4uMB1g48