Together AI has announced significant improvements to its inference capabilities, achieving a sixfold reduction in cost per turn and a p95 latency under 400 milliseconds. The company is also committed to shipping new models on a weekly basis, indicating a rapid development and deployment cycle. AI
IMPACT Accelerates the availability of more cost-effective and faster AI models for developers and researchers.
RANK_REASON The item details technical improvements and a release cadence for an AI infrastructure provider, fitting the research/infra category. [lever_c_demoted from research: ic=1 ai=0.7]
Read on X — Together (inference / OSS) →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →