Last updated on January 26, 2025
Open AI’s Operator AI agent removed one of hurdle towards the path of Artificial General Intelligence. Are we close to AGI? Let’s check it out.
The Operator is powered by a new model called CUA (Computer Using Agent). The CUA is trained to inspect and use graphical user interface such as button, scroll etc. Open AI explained it detects the error and has the capability to correct it. This is available for pro users. Like in browsers we can open multiple tabs, similar way we can open multiple operators in different tabs and can perform different different tasks.
It is currently available for United States pro users. It’s been expected to be launched in other places within coming weeks.
TechNical aspect of Operator AI
Operator AI Agent uses GPT 4o LLM. It has its own browser that performs the tasks for the user. This uses keyboard and mouse on it own. The interface is much similar to ChatGPT. A user can do all kind of daily tasks by giving instructions to this AI Agent. You can book a cab on Uber, buy groceries or as general as booking a table at the nearby restaurant. It’s been said that it is one of the first kind of AI Agent that actually is believed to possess PHD Level intelligence.
This AI Agent will be having some nominal charges for usage. There are many such agents in pipeline will be launched with in some weeks or months. This agent is so smart that you can upload a handwritten list of grocery or items which you want to buy and it will go to the store website and check for the items. If the item is available it will buy them if it is not available it will return to the user and ask them for the further instructions.

Mechanism behind Operator AI Agent
The AI Agent go to the page of the website where it is shopping for the grocery for example. It takes the screenshot of the webpage after every prompt and does the need by inspecting the screenshot. It actually creates several steps. Every step it takes up the screenshot and does the required job.
It goes like performing the steps, taking the snapshot and achieving the step mentioned. It is in infinite loop till the task is completed.
Find out more in the below video:
A research preview of Operator, an agent that can use its own browser to perform tasks for you. pic.twitter.com/wkBBDIlVqj
— OpenAI (@OpenAI) January 23, 2025
We will post updates regarding the coming agents in the future.
The arrival of Super Agents
Artificial Intelligence experts and architects were speculating something big is coming. PHD level intelligence based agents were in waiting list. The super agents are expected to be a breakthrough in the generative artificial intelligence to replace humans workforce.