Brief · PulseAugur

TOOL · Together AI blog English(EN) · 2w

Deploy and inference any model from HuggingFace

Together AI has launched a new feature allowing developers to deploy and run any model from Hugging Face using their Dedicated Container Inference (DCI) infrastructure. This is facilitated by an agent-based CLI tool called Goose, which automates the complex setup process, including inference server configuration and container generation. The system aims to significantly reduce the lag time between a model's release and its practical use, as demonstrated by the rapid deployment of Netflix's void-model. AI

IMPACT Accelerates the adoption of new AI models by drastically reducing deployment complexity and time.

Hugging Face
Netflix
Together AI
Goose
void-model