OpenAI’s Operator Lets ChatGPT Use the Web for You

OpenAI is letting some users try a new ChatGPT feature that uses its artificial intelligence to operate a web browser to book trips, buy groceries, hunt for bargains, and do many other online chores.

The new tool, called Operator, is an AI agent: It relies on an AI model trained on both text and images to interpret commands and figure out how to use a web browser to execute them. OpenAI claims it has the potential to automate many day-to-day tasks and workday errands.

OpenAI’s Operator follows rival releases by both Google and Anthropic, which have demonstrated ones capable of using the web. AI agents are widely seen as the next evolutionary stage for AI following chatbots, and many companies have hopped on the hype train by touting them. In most cases, these are very limited in their abilities and simply use a language model to automate things normally done with regular software.

“AI is evolving from this tool that could answer your questions to one that is also able to take action in the world, carrying out complex, multistep workflows,” says Peter Welinder, VP of product at OpenAI. “We’ll see a lot of impact on people’s productivity—but also the quality of work that people are able to accomplish.”

OpenAI admits that giving ChatGPT access to a web browser does introduce new risks, and it says that Operator may sometimes misbehave. It says it has implemented various new safeguards and plans to extend Operator’s capabilities gradually.

Welinder and Yash Kumar, product and engineering lead for OpenAI’s Computer Using Agent, say the plan is to learn from how people use the tool. They acknowledge that the tool could make unwanted bookings or purchases but add that a lot of work has gone into ensuring that it asks before doing anything risky. “It will come back to me and ask for confirmations before taking steps that might be irreversible,” Kumar says.

Source : Wired