Together AI has released benchmarks demonstrating the performance of their inference stack on NVIDIA's Blackwell hardware, showing a 31% increase in transactions per second compared to other open-source engines. This performance boost is attributed to custom kernels optimized for Blackwell's Tensor Cores. The company's coding agents, which are used by Cursor, run on this infrastructure, and Together AI has also introduced AgentPerf, a new benchmark for evaluating AI agent performance. AI
IMPACT Demonstrates hardware optimization for AI agent infrastructure, potentially improving real-time coding agent performance.
RANK_REASON The cluster details benchmarks and performance metrics for AI inference infrastructure, which falls under research.
Read on X — Together (inference / OSS) →
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →