Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · Together AI blog English(EN) · 2w

How Together AI built the world’s fastest speech-to-text stack

Together AI has developed a highly efficient speech-to-text system, significantly outperforming existing models in speed. Their approach addresses the unique challenges of audio data processing, which is substantially larger than text and requires extensive preprocessing. By optimizing the entire data path, from CPU preprocessing to GPU execution, Together AI achieved record-low latency and high throughput for both streaming and offline transcription tasks. AI

IMPACT Sets new SOTA for speech-to-text latency and throughput, potentially lowering costs for AI applications requiring audio processing.
RESEARCH · AWS Machine Learning Blog English(EN) · 1mo

Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch

NVIDIA has released Parakeet-TDT-0.6B-v3, an open-source multilingual audio transcription model capable of processing 25 European languages. The model, deployed on AWS Batch with GPU instances, achieves high inference speeds by predicting text tokens and durations simultaneously, enabling transcription at a significantly reduced cost. This solution architecture is designed to be cost-effective and scalable, processing audio files uploaded to Amazon S3 and utilizing EC2 Spot Instances for further savings. AI

IMPACT Offers a cost-effective solution for large-scale multilingual audio transcription, potentially lowering barriers for data processing and AI training.

Brief

How Together AI built the world’s fastest speech-to-text stack

Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch