Anthropic has started rolling out a new voice mode for its Claude AI assistant, bringing real-time spoken conversations to the platform.
Highlights
- Claude gets voice mode — now supports real-time spoken conversations via the mobile app (in English).
- Dual-mode interaction: Speak to Claude and get both audible replies and on-screen summaries for better comprehension.
- Powered by Sonnet 4 model — optimized for dialogue, with smooth transitions between voice and text input.
- Five voice styles: Choose between Buttery, Airy, Mellow, Glassy, and Rounded for a more personalized voice assistant experience.
- Push-to-talk functionality: Ensures control over input, but no real-time voice interruption support (yet).
- Google Workspace integration: Claude Pro users can access Gmail, Calendar, and Docs via voice commands.
- Multimodal support: Talk to Claude about documents and images — ideal for multitasking and collaboration.
- Gradual rollout: Currently in beta, expanding to more users in the coming weeks. Free users get limited access.
- Competitive positioning: Anthropic joins Google, OpenAI, and xAI in pushing voice-first AI assistant experiences.
- Development led by Mike Krieger: With past hints of Amazon/ElevenLabs partnerships, Claude’s voice journey reflects serious innovation.
The feature, currently in beta, is being gradually introduced to users of the Claude mobile app over the coming weeks and is available in English for now.
Voice Mode: What It Offers
The new voice mode allows users to speak directly to Claude and receive audible responses, creating a more hands-free and natural interaction experience.
Alongside spoken responses, the app also presents on-screen summaries and key points from the conversation, blending audio and visual communication for better comprehension and retention.
The voice functionality is powered by Claude’s Sonnet 4 model, which has been optimized for dialogue-based interactions. Users can switch between voice and text input mid-conversation, offering more flexibility than traditional voice assistants.
Feature Highlights
Personalized Voice Options
Claude offers five unique voice personas—Buttery, Airy, Mellow, Glassy, and Rounded—each featuring distinct tones and speaking styles. These voices allow users to personalize their experience by choosing a style that best suits their preferences.
Real-Time Visual Summaries
During voice interactions, Claude displays on-screen highlights of the conversation in real time. This dual-mode interaction supports easier note-taking and helps users track important information as the discussion unfolds.
Push-to-Talk Mechanism
Voice mode uses a push-to-talk system, requiring users to tap to initiate a spoken command. While this ensures more intentional input and reduces accidental activations, it does not yet support real-time voice interruption, a feature found in some competing tools.
Google Workspace Integration
For premium users, Claude voice mode integrates with Google Workspace, enabling voice-activated access to services like Gmail, Google Calendar, and Google Docs. This feature is designed to enhance productivity, especially in professional or enterprise settings.
Document and Image Conversation Support
Users can converse with Claude about documents and images using voice mode, making it a useful tool for multitasking, collaborative reviews, or on-the-go content analysis.
Access and Availability
- The feature is currently available to beta testers and will expand to more users over the next few weeks.
- Voice interactions count toward existing usage caps, with free-tier users limited to approximately 20–30 conversations.
- Claude Pro and Enterprise subscribers receive access to extended features, including Workspace connectors and expanded context support.
Industry Context and Competitive Landscape
Anthropic’s launch of voice mode places it among other major AI players embracing voice-first experiences.
Competitors such as OpenAI’s ChatGPT, Google’s Gemini Live, and xAI’s Grok already offer similar features, signaling a broader industry trend toward more human-like, natural interaction models.
Though voice assistants are not new, the integration of AI reasoning with real-time audio is becoming increasingly important in productivity, accessibility, and multitasking scenarios. Claude’s implementation—with both spoken and visual output—reflects this evolution.
Development Background
The voice mode has been in development for several months. In a March 2025 interview with the Financial Times, Anthropic CPO Mike Krieger hinted at ongoing work involving voice technology and mentioned potential collaborations with Amazon and ElevenLabs, both known for advancements in AI and voice synthesis.
While it’s unclear how these partnerships have influenced the final product, the current rollout shows Anthropic’s intent to remain competitive in the fast-evolving AI assistant space.