Arcee AI has migrated its specialized small language models (SLMs) from AWS to Together Dedicated Endpoints, seeking improved cost, performance, and operational agility. The company focuses on training efficient models under 72 billion parameters for specific tasks like coding and general text generation. Arcee AI also developed Arcee Conductor, an inference routing system that directs queries to the most suitable model, including third-party options like GPT-4.1 and Claude 3.7 Sonnet, to optimize cost and performance. AI
IMPACT Enables more cost-effective deployment of specialized AI models for enterprise tasks.
RANK_REASON This is a customer story about a company using a cloud provider's infrastructure for their AI models, not a release of a new frontier model or significant industry-wide event.
- Arcee AI
- Arcee Conductor
- AWS
- Claude 3.7 Sonnet
- DeepSeek-R1
- GPT-4.1
- Mark McQuade
- Qwen2.5-Instruct
- Qwen2.5-VL
- Together AI
- Together Dedicated Endpoints
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →