How Much Traffic do you Really Need?
How Much Traffic do you Really Need?
12 Steps to Create Videos

OpenAI launches Operator, an AI agent that can operate your computer [Video]

Categories
User Experience (UX) Design

While it’s working, Operator shows a miniature browser window of its actions.

However, the technology behind Operator is still relatively new and far from perfect. The model reportedly performs best at repetitive web tasks like creating shopping lists or playlists. It struggles more with unfamiliar interfaces like tables and calendars, and does poorly with complex text editing (with a 40 percent success rate), according to OpenAI’s internal testing data.

OpenAI reported the system achieved an 87 percent success rate on the WebVoyager benchmark, which tests live sites like Amazon and Google Maps. On WebArena, which uses offline test sites for training autonomous agents, Operator’s success rate dropped to 58.1 percent. For computer operating system tasks, CUA set an apparent record of 38.1 percent success on the OSWorld benchmark, surpassing previous models but still falling short of human performance at 72.4 percent.

With this imperfect research preview, OpenAI hopes to gather user feedback …

How Desire Paths can Transform your Branding and Public Relations
How Desire Paths can Transform your Branding and Public Relations
5 Steps to Creating Successful Ads