Microsoft has expanded access to its AI-powered computer vision tool, Copilot Vision, making it available to all users of the Microsoft Edge browser.
Highlights
Previously restricted to Copilot Pro subscribers, the feature is now open to a broader audience, aiming to enhance browsing experiences through real-time visual analysis.
Originally introduced in December 2024, Copilot Vision is designed to interpret webpage content visually and assist users with tasks such as summarizing information, identifying product details, and answering follow-up questions.
The tool integrates directly into the Edge sidebar and supports both voice and text-based prompts, allowing users to engage naturally with the AI assistant.
The update was announced by Microsoft AI CEO Mustafa Suleyman, who described Copilot Vision as capable of “thinking out loud” with users as they browse. By analyzing on-screen content in real time, the assistant removes the need for copying text or crafting detailed prompts manually.
Capabilities
Copilot Vision functions as an embedded assistant within Microsoft Edge. It uses computer vision and natural language understanding to respond to on-screen information. Users can interact with the tool by clicking the microphone icon in the Edge Copilot sidebar or typing their prompts.
Use cases include:
- Summarizing product reviews or long-form content
- Identifying visual elements on shopping or travel websites
- Answering context-specific questions about the content currently visible on screen
To use the feature, users must opt in via a dedicated link. Once enabled, Copilot Vision appears as a floating panel within the browser.
It is entirely user-activated and does not operate in the background or monitor browser activity passively.
Website Compatibility
At launch, Copilot Vision is optimized for a select set of websites, including:
- Wikipedia
- Amazon
- Target
- OpenTable
- Wayfair
- Williams Sonoma
- Food & Wine
- Tripadvisor
- Geoguessr
- The Verge
These sites were chosen to showcase the assistant’s capabilities in real-world browsing contexts, such as travel planning, shopping, and content summarization.
Privacy and Data Usage
Microsoft has emphasized its commitment to user privacy in the rollout of Copilot Vision. The feature operates on an opt-in basis and requires explicit user permission before analyzing screen content.
No user data, browsing activity, or session inputs are stored during use. However, Copilot’s responses are logged internally to help improve performance and safety systems.
Hardware and Platform Considerations
While the tool is now available to all Edge users, performance may vary depending on hardware capabilities. Users with older or less powerful devices might experience slower responses or limited functionality.
In addition to Edge, Microsoft is exploring broader platform integration. Internal testing of “Xbox Copilot” suggests that visual assistance may expand beyond web browsers in future updates.
Microsoft’s AI Strategy
The release of Copilot Vision reflects Microsoft’s ongoing strategy to make AI more accessible and context-aware. The tool has drawn comparisons to Google’s Gemini Live, particularly for its ability to analyze and respond to visual input in real time.