Loka has developed a conversational AI voice agent using Amazon Nova 2 Sonic, a speech-to-speech model from AWS. This new approach processes audio end-to-end, capturing nuances lost in traditional text-based pipelines and significantly reducing latency. The system achieved a high score on the Big Bench Audio benchmark, outperforming competitors like Gemini 2.5 Flash Native Audio and GPT Realtime, while also offering cost efficiencies for large-scale deployment. AI
IMPACT Demonstrates advancements in speech-to-speech models for more natural and efficient voice agent interactions.
RANK_REASON This article describes a company using an AI model to build a product, rather than the release of a new model or significant research finding from a frontier lab.
Read on AWS Machine Learning Blog →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →