Gemini Live
PulseAugur coverage of Gemini Live — every cluster mentioning Gemini Live across labs, papers, and developer communities, ranked by signal.
5 天有情绪数据
Gemini Live to offer selectable AI models for users
Google is testing multiple Gemini Live AI models with varied capabilities. This suggests a future where users might be able to select different models for Gemini Live, potentially optimizing for speed, thoughtfulness, or specific features like location awareness.
Gemini Live's interactive UI is in experimental phase
Google is actively testing an interactive UI for Gemini Live that responds visually to user input. This indicates a focus on enhancing user engagement and a current experimental stage for this feature.
Gemini Live voice replication capabilities may face legal challenges
A lawsuit has been filed against Google alleging misuse of voice recordings to train AI models, including those powering Gemini Live. This could lead to potential restrictions or changes in how Gemini Live replicates voices.
-
AI cuts film 'Bitcoin' post-production from 40 years to 6 months
Director Doug Liman and his production company 30 Ninjas have significantly accelerated the post-production process for their film 'Bitcoin.' By leveraging Google's AI tools, specifically Veo and Gemini Live, they reduc…
-
Google tests interactive Gemini Live UI with responsive interface
Google is testing a new interactive user interface for its Gemini Live feature. This updated experience allows Gemini to respond visually to user interactions, creating a more dynamic and engaging interface. The rollout…
-
Google sued for using voices to train AI models
A group of journalists, podcasters, and audiobook narrators have filed a lawsuit against Google in Illinois federal court. They allege that Google misused thousands of hours of their voice recordings without permission …
-
Google tests hidden Gemini Live AI models with varied capabilities
Google appears to be testing at least seven new AI models for its Gemini Live voice assistant, as revealed by code within the Google app. These models, some with codenames like "Capybara" and "Nitrogen," offer varied ca…
-
AI model leaderboard ranks top performers in coding, video, and more
Bindu Reddy compiled a list of top-performing AI models across various domains as of May. The compilation includes leading models for coding, factual search, video generation, image creation, voice synthesis, and low-co…
-
Mira Murati's Thinking Machines ships interactive AI model
Thinking Machines Lab, founded by former OpenAI CTO Mira Murati, has unveiled its first AI model, focusing on "interaction models" designed for real-time collaboration across voice, video, and text. Unlike current AI th…
-
Google DeepMind enhances Gemini audio models for natural voice interactions and translation
Google DeepMind has released upgraded Gemini 2.5 audio models, enhancing capabilities for both live voice agents and text-to-speech generation. The Gemini 2.5 Flash Native Audio model now offers improved function callin…
-
Google DeepMind launches Gemini 3.1 Flash TTS, Live, and Lite models
Google DeepMind has unveiled a suite of Gemini 3.1 Flash models, including Flash TTS for advanced text-to-speech, Flash Live for real-time dialogue, and Flash-Lite for cost-efficient, high-volume workloads. These models…