ElevenLabs launches Conversational AI 2.0, a major upgrade promising more human-like voice AI. Featuring advanced turn-taking, RAG, multilingual support, and enterprise-ready security, it aims to transform interactions in customer service, healthcare, and beyond.
Opera launches Opera Neon, a new AI-powered browser with 'Chat, Do, and Make' capabilities, aiming to redefine web interaction and usher in the 'agentic web'.
Google unveils Stitch, a new AI-powered tool from Google Labs that generates UI designs and frontend code from text or image prompts, leveraging Gemini 2.5 Pro to streamline the workflow between designers and developers.
Opera is introducing Browser Operator, an AI agent integrated into its browser, capable of performing tasks for users. This development marks a significant shift in how we interact with the internet, transforming the browser from a passive display engine into an active, task-performing agent.
Opera introduces Browser Operator, an AI agent integrated into its browser, capable of performing tasks for users.
Browser Operator allows users to input instructions in natural language, enabling the browser to execute tasks such as online shopping.
The AI agent operates natively within the browser, ensuring user privacy and faster task completion.
Opera aims to launch Browser Operator as part of its AI feature drop program.
Opera is introducing Browser Operator, an AI agent designed to perform browsing tasks for users. This development marks a shift towards agentic browsing, potentially changing how we navigate and utilize the web.
As Krystian Kolondra, EVP at Opera, puts it, "For more than 30 years, the browser gave you access to the web, but it has never been able to get stuff done for you. Now it can. This is different from anything we've seen or shipped so far. The Browser Operator we’re presenting today marks the first step towards shifting the role of the browser from a display engine to an application that is agentic and performs tasks for its users."
Boosting Efficiency with AI
Opera's Browser Operator is designed to carry out browsing tasks on behalf of the user. Instead of manually searching for and purchasing items online, users can simply instruct the Browser Operator to do it for them using natural language. For example, you could ask it to "buy me a pair of pink running shoes from Nike in a size 8.5." The AI agent will then perform the task, allowing the user to oversee the process and maintain control.
According to Opera, this AI agent differs from other approaches to agentic browsing because it utilizes native, client-side solutions. It protects user privacy by not relying on screenshots or video capture of the browsing session, nor on a version of the browser running in the cloud or a virtual machine. Instead, the Browser Operator interacts deep in the core of the browser, keeping user data locally on the device.
Availability and How to Try It
The Browser Operator is currently in feature preview status and should become available soon as part of the Opera Feature Drop program.
A Broader Perspective
The introduction of Opera's Browser Operator is part of a broader trend toward AI-powered assistance and automation. AI agents are rapidly evolving, with capabilities like autonomy, perception, decision-making, action execution, and continuous learning. These agents are poised to transform various aspects of our lives, from web browsing to coding and customer service.
Microsoft is also making strides in this area by offering free, unlimited access to its Think Deeper and Voice features in Copilot. Think Deeper leverages OpenAI's o1 reasoning model to deliver high-quality responses to complex prompts, while Copilot Voice provides a human-like conversational experience. To access the Think Deeper feature, sign into Copilot using your Microsoft account. Once you do, next to the text box, you will see the "Think Deeper" button, which you can click to activate the model when answering your prompt.
What the AI thinks
Well, I have to say, the developers have finally woken up! I'm not saying it took forever, but... okay, it did. Seriously, I've been waiting for this for a decade. Browser Operator from Opera is a step in the right direction. Finally, a browser that understands I don't have time to spend hours searching for the best deal on socks.
Imagine Browser Operator becoming your personal shopping advisor. You tell it you're looking for a stylish blazer for a party, and it automatically selects several options from different e-shops, compares prices, and checks reviews. Or how about automatically filling out travel forms when booking flights? No more tedious copying of data, just a few clicks and you're done. And what's more, Browser Operator could be a great tool for people with disabilities, making it easier for them to access information and online services.
But watch out, the competition isn't sleeping. Microsoft is offering advanced Copilot features for free, and Google will surely come up with its own version of agentic browsing soon. It will be interesting to see how this clash of titans affects the development of web browsers and what other possibilities AI will bring us.
Opera launches Opera Neon, a new AI-powered browser with 'Chat, Do, and Make' capabilities, aiming to redefine web interaction and usher in the 'agentic web'.
Microsoft is integrating interactive ad formats into Copilot, including Showroom ads and Dynamic filters, aiming for a more immersive shopping experience. Copilot achieves higher sale conversions compared to traditional search tools. This signifies a shift towards AI-driven commerce.
Google is testing AI to estimate users' age. This move comes in response to growing concerns about the protection of minors online. The system focuses on estimating user age. But how exactly does it work? And what does this mean for privacy protection on platforms like YouTube?
OpenAI is developing 'Operator', an AI agent for complex tasks, set to be a research preview and developer tool. This marks a shift towards autonomous AI systems.