There are two ways companies can create AI agents: APIs and “Computer Use” Silicon Valley is all the rage with API based AI agents, because that’s how tech companies are built. For most of the rest of the world, that isn’t the case, and Computer Use offers a much more intuitive way to interact with AI. Computer Use operates by clicking on a desktop screen “visually”. The benefit is that you can talk/type to the AI and watch it perform its work just like a human colleague. No programming required. Sounds great, so why hasn’t this been more common? Computer Use isn’t new (Anthropic first introduced the term in 2024), but it has historically been challenged in the following ways: 1) Basic mistakes - Early on, if you would ask the AI to scroll down a webpage it would sometimes get stuck in an “infinite scroll”. Now you can give the AI very general requests and it will figure out on its own where to click (one of the prompts in the video below is “add Apex Solutions as a new company with the lead details from the spreadsheet”) 2) AI agents vs copilots - In the past the AI could only do one prompt command at a time. Now, with tools like Cowork and Manus AI, you can have the AI perform an end to end task like updating a CRM record after a sales call. 3) Expensive - It can cost upwards of $10 to perform a multi-step task like gathering leads off of Facebook using a top of the line model like Claude Opus. Now, with services like StaffAI that offer “model routing” you can have AI agents operate a desktop at 1/1000 the cost and at the same accuracy of Opus. Computer Use is ready for prime time to solve many every day office tasks that involve operating a desktop. If you are a business leader of a non-tech company, ask your team about Computer Use and you may find your first AI agent that your company will actually use.

Computer use is a game changer for the customer if you can get token rate down. But I would rather put the tech burden on the tech vendor like StaffAI than use APIs which are easier on the vendor but shifts the burden to the customer - especially for those who don't even know what API stands for

The interesting thing about Computer Use is that it flips the traditional AI adoption curve. Most non technical teams don’t think in API's or workflows. They think in screens, tabs, spreadsheets, CRMs and email inboxes. That’s why 'watching the AI operate the desktop' is such a breakthrough moment for many companies. The interaction model finally matches how people already work.

AI agents are transforming industries by boosting efficiency and insights. Looking forward to seeing the innovative solutions they bring.

Like
Reply

Computer Use is good for Personal Agent

Like
Reply
See more comments

To view or add a comment, sign in

Explore content categories