Google is gradually introducing a real-time captions feature for its voice-based assistant, Gemini Live, aimed at making AI interactions more discreet and accessible.
Highlights
- Silent AI Interaction Enabled: Gemini Live now supports real-time captions, letting users view spoken responses as text—even with volume muted.
- Hands-Free and Discreet Usage: Ideal for use in meetings, libraries, or public settings, the feature boosts flexibility and privacy in voice-based AI interaction.
- Quick Toggle Option: A new icon in Gemini Live’s interface lets users switch captions on or off without leaving the experience.
- Personalized Caption Settings: Users can adjust text size and style through a new “Caption Preferences” menu in Gemini’s settings.
- Part of a Larger Accessibility Push: The update builds on Google’s broader initiatives like Expressive Captions and TalkBack enhancements, reinforcing its accessibility-first design philosophy.
- Gradual Rollout Ongoing: The feature is being introduced slowly via app version 16.21.57 and may currently be limited to beta users or select regions.
- User Feedback Influencing Development: Google appears to be testing the waters based on community input before a full-scale release.
The update enables users to view live transcriptions of spoken responses on-screen, even when their device’s audio is muted or turned down—providing a practical option for hands-free and silent usage scenarios.
The feature was first observed by Reddit users and later confirmed by reports from Android Authority and 9to5Google. It is being rolled out with version 16.21.57 of the Google app, although availability currently appears to be limited to a subset of users.
A New Option for Silent Interactions
Previously, Gemini Live required users to turn up their device’s volume to receive audible responses, limiting its usability in quiet or public settings.
The new caption feature removes that barrier, displaying text-based replies in real time and allowing for silent interaction in environments such as meetings, libraries, or public transport.
When enabled, users will see a dialogue box icon in the top-right corner of Gemini Live’s full-screen view. Tapping this icon toggles captions on or off, allowing users to seamlessly switch between audio and text responses without leaving the interface.
Customization and Accessibility
In addition to basic on/off functionality, Google has added a “Caption Preferences” menu within Gemini’s settings. This allows users to adjust text size and style, tailoring the visual experience to their comfort and accessibility needs.
Screenshots published by Android Authority reveal a clean, user-friendly interface for caption customization—aligning with Google’s broader accessibility efforts across platforms.
Part of a Broader Accessibility Strategy
The real-time captions update builds on Google’s ongoing work in accessibility, including initiatives such as Expressive Captions on Android and Chrome.
That feature uses AI to add emotional context and ambient cues to transcribed speech. Other recent developments include image descriptions in TalkBack and enhanced screen reader support for Chrome OS.
This Gemini Live update complements those efforts by offering a more inclusive and adaptable experience for users who rely on silent or visual-first communication methods.
Gradual Rollout Reflects Beta Testing Strategy
The rollout appears to be phased, with users in beta channels or select regions receiving access first. While Google has not made a formal announcement, activity on Reddit and other forums suggests a slow and deliberate release pattern.