News | OpenAI Launches Operator to Perform Tasks on User’s Behalf

OpenAI Launches Operator to Perform Tasks on User’s Behalf

Published by: Insights Desk Released: Jan 24, 2025 Source: DemandTalk

Highlights:

The agent is powered by a newly introduced OpenAI model called CUA, which is partially based on the company’s multimodal GPT-4o large language model.
OpenAI explained that users can take control at any point during the process. For sensitive actions, such as entering login credentials, Operator prompts users to switch to manual mode.

OpenAI launched Operator, an AI agent that can voluntarily perform tasks on behalf of the users.

Meanwhile, two major competitors also announced updates to their offerings. Perplexity AI Inc., known for its popular AI search engine, introduced a similar agent for its Android app. Anthropic PBC, which already offers automation features, launched a tool to enhance citation quality in AI-generated responses.

Initially available as a research preview in the premium Pro tier of ChatGPT, OpenAI’s Operator can perform multistep tasks like ordering groceries, booking flights, and filling out forms. Users can simply provide instructions using natural language prompts.

The agent is powered by a newly introduced OpenAI model called CUA, which is partially based on the company’s multimodal GPT-4o large language model. OpenAI states that CUA integrates the LLM with “advanced reasoning through reinforcement learning.”

When users instruct Operator to perform tasks on a website, the agent navigates to the appropriate URL using its built-in browser. It can type, click, and scroll to complete the requested actions, taking regular screenshots to ensure everything is functioning correctly.

OpenAI explained that users can take control at any point during the process. For sensitive actions, such as entering login credentials, Operator prompts users to switch to manual mode. During this time, the agent stops taking screenshots until the task is finished.

Operator includes several data protection features. Users can log it out of all accounts with a single click and opt out of having their data used for AI training. Additionally, a security system is in place to detect and block malicious websites that attempt to trick the agent into revealing sensitive information.

Operator offers customizable features. For instance, users can save a shopping list and have the agent purchase the specified items whenever it visits a particular e-commerce site. Additionally, users can set up customization options that apply universally across all websites the agent interacts with.

Looking ahead, OpenAI intends to extend Operator’s availability beyond ChatGPT Pro to other subscription tiers. The company also plans to make the agent accessible through its application programming interface (API). Future updates will include enhancements designed to improve Operator’s ability to handle more complex tasks.

“Operator is currently in an early research preview, and while it’s already capable of handling a wide range of tasks, it’s still learning, evolving and may make mistakes,” OpenAI researchers reported. “Early user feedback will play a vital role in enhancing its accuracy, reliability, and safety.”

OpenAI competitor Perplexity AI has introduced its own agent, Perplexity Assistant, now available in its Android app. The assistant can automate tasks such as making e-commerce purchases, booking taxis, and more. It also features multimodal processing, enabling it to analyze content from a user’s smartphone camera and screen.

At launch, Perplexity Assistant supports actions in apps like Spotify, YouTube, and Uber, as well as email, messaging, and clock applications. The company plans to expand support to additional services in the future.

Another OpenAI rival, Anthropic, also unveiled a product update. Anthropic offers the enterprise-focused Claude LLM series via an API. A new feature, Citations, allows users to upload documents to a Claude model and have it highlight the exact sentences referenced when generating responses to prompts.

il est temps de devenir sérieux avec le genai dan...

harnessing ai: the future of business transformati...

prepare for the future now. achieve greater, secur...

stay ahead with modern technology...

stay ahead...

workforce upskilling for the ai era...

unlock the full potential of generative ai at work...

ai pcs are quickly becoming the key to achieving s...

developing tomorrow’s ai on today’s ai-ready w...

unveiling ai-level productivity...

the new cyber security opportunity in an ‘ai eve...

how ai is changing managed detection and response...

answering your 4 biggest questions about generativ...

understanding the costs of generative ai...

the top 5 generative ai questions on every executi...

7 leading generative ai use cases...

6 steps to success with generative...

revolutionize your product launches with ai-driven...

unlock the full potential of ai-powered software d...

new era energy efficiency whitepaper longform...

compliance automation: a strategic investment for ...

leading the way: how modern workplaces embrace cha...

choosing the right ai foundation model for your ne...

ai governance: the path to responsible ai...

ai in market research: new possibilities, new insi...

ai ready workforce: upskilling for the ai era...

ai pricing strategy: the key to sustainable busine...

ai in business strategy: enhancing decisions boos...

genai at work: revolutionizing modern business ope...

ai misinformation: ai’s role in amplifying misin...

decision intelligence empowering business actions ...

committee machine in ml harnessing ensemble techni...

information processing language serves scalable an...

ai agents in business: transforming operations dr...

ai adoption framework: key components for effectiv...

machine learning use cases that deliver tangible r...

profitable ai-powered data management solutions to...

business-centric cognitive architecture revolution...

ai use cases – innovations for business success...

the role of ai in software development...

alibaba cloud unveils qwen2.5-omni-7b...

openai upgrades chatgpt’s image generation tool ...

microsoft is improving security copilot service wi...

deepseek unveils enhanced v3 model under mit licen...

nvidia reportedly acquires gretel to generate arti...

dataminr raises usd 85 m for real-time analytics...

ai code review startup graphite raises usd 52 m to...

zoom upgrades with agentic ai for enhanced video c...

google introduces gemini robotics and gemini robot...

google launches next-gen lightweight gemma ai mode...

ai21 labs introduces maestro for enhancing llm qua...

servicenow to acquire moveworks in a usd 2.9 b...

qualcomm acquires edge impulse, edge ai startup...

google introduces two new ai features to enhance i...

coreweave plans to buy weight biases for seamless...

openai launches nextgenai consortium with 15 insti...

anthropic pbc raises usd 3.5 b at usd 61.5 b value...

openai introduces gpt-4.5 as the most advanced and...

amazon launches alexa , an llm-powered assistant...

perplexity ai is creating a browser for ‘agentic...

role of machine learning in networking...

OpenAI Launches Operator to Perform Tasks on User’s Behalf

Insights Desk

Related posts

Alibaba Cloud Unveils Qwen2.5-Omni-7B...

OpenAI Upgrades ChatGPT’s Image Generation Tool ...

Microsoft is Improving Security Copilot Service wi...

DeepSeek Unveils Enhanced V3 Model Under MIT Licen...

Nvidia Reportedly Acquires Gretel to Generate Arti...

Dataminr Raises USD 85 M for Real-time Analytics...

AI Code Review Startup Graphite Raises USD 52 M to...

Zoom Upgrades with Agentic AI for Enhanced Video C...

Google Introduces Gemini Robotics and Gemini Robot...

Google Launches Next-Gen Lightweight Gemma AI Mode...

Our Brands