Page Loader
Summarize
Google launches voice chat feature for Gemini AI assistant
Gemini Live is now accessible to Gemini Advanced subscribers

Google launches voice chat feature for Gemini AI assistant

Aug 13, 2024
11:47 pm

What's the story

Google has introduced a new voice chat mode named Gemini Live, for its artificial intelligence (AI) assistant, Gemini. The feature was unveiled at the 'Made By Google' event and is now accessible to Gemini Advanced subscribers. It operates in a similar manner to ChatGPT's voice chat feature, providing users with multiple voices and the ability to engage in conversational speech.

User interaction

It facilitates 'in-depth' voice chats

Gemini Live enables users to have "in-depth" voice chats with Google's generative AI-powered chatbot on their phones. The advanced speech engine ensures consistent, emotionally expressive, and realistic multi-turn dialog. Users can interrupt Gemini mid-speech to ask follow-up questions, and the chatbot will adapt to their speech patterns in real time.

Feature flexibility

It offers a 'free-flowing' conversation experience

Google describes Gemini Live as "free-flowing," allowing users to interrupt an answer mid-sentence or pause the conversation and return to it later. The feature also works in the background or when a phone is locked. This was first announced during Google's I/O developer conference earlier this year. Additionally, 10 new Gemini voices have been introduced for users to choose from.

Rollout plan

Initial rollout and future expansion

Currently, Gemini Live is being rolled out in English for Android devices. However, Google plans to extend this feature to iOS and more languages in the coming weeks. Alongside Gemini Live, Google also announced new extensions for apps like Keep, Tasks, Utilities, and YouTube Music as part of its AI assistant enhancements.

Usage scenarios

Gemini Live's practical applications and accessibility

Gemini Live can be used hands-free if desired. Google suggests this could be useful for rehearsing job interviews as Gemini Live can provide speaking tips and suggest skills to highlight when speaking with a hiring manager. The feature is exclusive to Gemini Advanced, a more sophisticated version of Gemini that's part of the Google One AI Premium Plan, priced at $20 per month.

Superior performance

Gemini Live outperforms ChatGPT's advanced voice mode

Gemini Live has an edge over ChatGPT's Advanced Voice Mode due to its superior memory. The generative AI models underpinning Live, Gemini 1.5 Pro and Gemini 1.5 Flash, have a longer-than-average "context window," meaning they can take in and reason over a lot of data before crafting a response. This feature enhances the user experience by providing more comprehensive responses based on extensive data analysis.