What Is Operator?

On January 23, 2025, OpenAI officially launched Operator for ChatGPT Pro subscribers in the United States. Operator is a browser-based AI agent that can navigate websites, click buttons, fill in forms, and complete multi-step online tasks on behalf of users — all within a sandboxed Chrome browser that OpenAI controls server-side.

Unlike earlier AI tools that only generated text instructions for humans to follow, Operator actually executes tasks. You type a goal — "Order my usual groceries from Instacart" or "Book a table for two at any Italian restaurant near me on Friday at 8pm" — and Operator opens a browser, navigates to the relevant site, logs in using credentials you provide, and completes the booking. The model powering Operator is a new variant called Computer-Using Agent (CUA), which was trained specifically on browser interaction data.

How CUA Works

The Computer-Using Agent model is distinct from GPT-4o. CUA is trained using a combination of reinforcement learning from human feedback (RLHF) and novel reinforcement signals derived from successful task completion rates. The model perceives the browser through screenshots taken at high frequency — essentially seeing the web the same way a human does, by looking at rendered pixels rather than reading underlying HTML.

This screenshot-based approach means Operator can work with any website, including JavaScript-heavy single-page applications that are notoriously difficult to automate with traditional scraping tools. CUA then issues keyboard and mouse commands to interact with what it sees, with a feedback loop that lets it recover when pages load unexpectedly or pop-ups appear.

OpenAI published benchmark results showing CUA achieves 58.1% on WebArena (a suite of 812 real-world web tasks) and 87% on WebVoyager — both significantly ahead of any previously published browser agent. For context, the prior best on WebArena was around 36%, achieved by Google's research agent in late 2024.

Safety Guardrails

Given the obvious risks of an AI that can click buttons and submit forms, OpenAI built several layers of protection into Operator. The system always asks for explicit confirmation before completing irreversible actions — purchases, form submissions with personal data, account changes. It refuses to perform tasks involving sensitive account credentials unless users explicitly store them through OpenAI's encrypted credential vault.

Operator also has a "takeover" mode where users can grab control of the browser at any point mid-task, correct an error, and hand control back to the agent. This hybrid human-AI execution model is a practical acknowledgement that current agents still make mistakes on complex, ambiguous tasks.

Initial Use Cases and Limitations

At launch, OpenAI highlighted partnerships with DoorDash, Instacart, Uber, OpenTable, and several travel booking platforms that had pre-authorised Operator to interact with their systems. These partnerships guarantee smoother operation on those sites and, importantly, confirmed compliance with the sites' terms of service — a legal grey area that purely automated bots often violate.

Limitations at launch included: inability to handle CAPTCHAs reliably, difficulty with two-factor authentication flows that require real-time phone access, and occasional "hallucination" of clicks on buttons that don't exist on the page. OpenAI was upfront about these constraints and positioned Operator as a "research preview" even for Pro subscribers.

Pricing and Availability

Operator was initially exclusive to ChatGPT Pro subscribers paying $200/month in the US. OpenAI announced a phased global rollout throughout Q1 2025, with Plus subscribers ($20/month) getting access to a more limited version with fewer autonomous capabilities and more frequent confirmation prompts.

What This Means for Indian Businesses

Operator signals the beginning of the agentic AI era — where AI doesn't just advise but acts. For Indian SMEs, this means repetitive browser-based work (supplier price comparisons, booking travel, filing routine online forms) could soon be delegated entirely to AI. As Operator or equivalent tools become available at lower price points, the 40 hours a month many business owners spend on administrative web tasks could be reclaimed. Watch this space — the open-source equivalent will arrive on Indian servers within the year.