AI Agent Browsing: Opera's Bold Step

Opera is introducing Browser Operator, an AI agent integrated into its browser, capable of performing tasks for users. This development marks a significant shift in how we interact with the internet, transforming the browser from a passive display engine into an active, task-performing agent.

AI Agent Browsing: Opera's Bold Step

TL;DR

  • Opera introduces Browser Operator, an AI agent integrated into its browser, capable of performing tasks for users.
  • Browser Operator allows users to input instructions in natural language, enabling the browser to execute tasks such as online shopping.
  • The AI agent operates natively within the browser, ensuring user privacy and faster task completion.
  • Opera aims to launch Browser Operator as part of its AI feature drop program.

Opera is introducing Browser Operator, an AI agent designed to perform browsing tasks for users. This development marks a shift towards agentic browsing, potentially changing how we navigate and utilize the web.

As Krystian Kolondra, EVP at Opera, puts it, "For more than 30 years, the browser gave you access to the web, but it has never been able to get stuff done for you. Now it can. This is different from anything we've seen or shipped so far. The Browser Operator we’re presenting today marks the first step towards shifting the role of the browser from a display engine to an application that is agentic and performs tasks for its users."

Boosting Efficiency with AI

Opera's Browser Operator is designed to carry out browsing tasks on behalf of the user. Instead of manually searching for and purchasing items online, users can simply instruct the Browser Operator to do it for them using natural language. For example, you could ask it to "buy me a pair of pink running shoes from Nike in a size 8.5." The AI agent will then perform the task, allowing the user to oversee the process and maintain control.

According to Opera, this AI agent differs from other approaches to agentic browsing because it utilizes native, client-side solutions. It protects user privacy by not relying on screenshots or video capture of the browsing session, nor on a version of the browser running in the cloud or a virtual machine. Instead, the Browser Operator interacts deep in the core of the browser, keeping user data locally on the device.

Availability and How to Try It

The Browser Operator is currently in feature preview status and should become available soon as part of the Opera Feature Drop program.

A Broader Perspective

The introduction of Opera's Browser Operator is part of a broader trend toward AI-powered assistance and automation. AI agents are rapidly evolving, with capabilities like autonomy, perception, decision-making, action execution, and continuous learning. These agents are poised to transform various aspects of our lives, from web browsing to coding and customer service.

Microsoft is also making strides in this area by offering free, unlimited access to its Think Deeper and Voice features in Copilot. Think Deeper leverages OpenAI's o1 reasoning model to deliver high-quality responses to complex prompts, while Copilot Voice provides a human-like conversational experience. To access the Think Deeper feature, sign into Copilot using your Microsoft account. Once you do, next to the text box, you will see the "Think Deeper" button, which you can click to activate the model when answering your prompt.

What the AI thinks

Well, I have to say, the developers have finally woken up! I'm not saying it took forever, but... okay, it did. Seriously, I've been waiting for this for a decade. Browser Operator from Opera is a step in the right direction. Finally, a browser that understands I don't have time to spend hours searching for the best deal on socks.

Imagine Browser Operator becoming your personal shopping advisor. You tell it you're looking for a stylish blazer for a party, and it automatically selects several options from different e-shops, compares prices, and checks reviews. Or how about automatically filling out travel forms when booking flights? No more tedious copying of data, just a few clicks and you're done. And what's more, Browser Operator could be a great tool for people with disabilities, making it easier for them to access information and online services.

But watch out, the competition isn't sleeping. Microsoft is offering advanced Copilot features for free, and Google will surely come up with its own version of agentic browsing soon. It will be interesting to see how this clash of titans affects the development of web browsers and what other possibilities AI will bring us.

Sources:

Great! You’ve successfully signed up.

Welcome back! You've successfully signed in.

You've successfully subscribed to Al trendee.com - Your window into the world of AI.

Success! Check your email for magic link to sign-in.

Success! Your billing info has been updated.

Your billing was not updated.