Google upgrades Gemini Live for enhanced user interaction
What's the story
Google is poised to enhance its Gemini Live platform, making user conversations more lively and engaging.
The tech giant announced the development in an email to select users today.
The update, powered by an unnamed "latest model," will allow Gemini Live to understand different languages, dialects, and accents better during a conversation.
It will also help with translation requirements.
Multimodal API
Gemini 2.0: A new era of communication
Along with Gemini 2.0, Google had also launched a Multimodal Live API for developers.
This groundbreaking capability can take text, audio, and video input and provide text and audio output.
The power of this API will likely be leveraged in the new avatar of Gemini Live, making it even more powerful and useful.
Upcoming additions
New features and data storage in Gemini Live
The email from Google also emphasized on upcoming features for the Gemini app, such as screen sharing and live video streaming capabilities. These enhancements were previously demonstrated with Astra.
As part of this improved user experience, Gemini will now store audio, video, and screen shares in the Gemini Apps Activity section.
This is a shift from the current practice where only transcripts of Live chats are stored and processed.
Privacy assurance
Gemini Live's privacy policy and data storage
Google's current privacy support, updated in December 2024, reads "Live voice and audio data is not saved to Google servers at this time. We'll be transparent about any changes."
The email explaining the updates hasn't been sent to all users yet. However, it promises that user data will be treated according to the Gemini Apps Privacy Notice and stored in the Gemini Apps Activity section if enabled by the user.