PulseAugur
EN
LIVE 20:46:44

Arcee AI moves to Together Endpoints for cost-efficient SLMs

Arcee AI has migrated its specialized small language models (SLMs) from AWS to Together Dedicated Endpoints, seeking improved cost, performance, and operational agility. The company focuses on training efficient models under 72 billion parameters for specific tasks like coding and general text generation. Arcee AI also developed Arcee Conductor, an inference routing system that directs queries to the most suitable model, including third-party options like GPT-4.1 and Claude 3.7 Sonnet, to optimize cost and performance. AI

IMPACT Enables more cost-effective deployment of specialized AI models for enterprise tasks.

RANK_REASON This is a customer story about a company using a cloud provider's infrastructure for their AI models, not a release of a new frontier model or significant industry-wide event.

Read on Together AI blog →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. Together AI blog TIER_1 English(EN) ·

    From AWS to Together Dedicated Endpoints: Arcee AI's journey to greater inference flexibility