Skip to content

lemanschik/puppeteer-harness

 
 

Repository files navigation

Browser Harness

Browser Harness ♞

Connect an LLM directly to your real browser with a thin, editable CDP harness. For browser tasks where you need complete freedom.

One websocket to Chrome, nothing between. The agent writes what's missing during execution. The harness improves itself every run.

  ● agent: wants to upload a file
  │
  ● agent-workspace/agent_helpers.js → helper missing
  │
  ● agent writes it                         agent_helpers.js
  │                                                       + custom helper
  ✓ file uploaded

You will never use the browser again.

Setup prompt

Paste into Claude Code or Codex:

Set up https://github.com/browser-use/browser-harness for me.

Read `install.md` and follow the steps to install browser-harness and connect it to my browser.

The agent will open chrome://inspect/#remote-debugging. Tick the checkbox so the agent can connect to your browser:

Remote debugging setup

Click Allow when the per-attach popup appears (Chrome 144+):

Allow remote debugging popup

See agent-workspace/domain-skills/ for example tasks.

Free Browser Use Cloud browsers

Stealth, sub-agents, or headless deployment.
Browser Use Cloud free tier: 3 concurrent browsers, proxies, captcha solving, and more. No card required.

Architecture (~2k lines of modern Javascript)

  • src/browser_harness/ — core daemon and connection management
  • src/browser_use/hard-ported browser-use framework (Agent, DOM, Controller)
  • helpers.jsPuppeteer-powered browser control primitives
  • agent-workspace/agent_helpers.js — helper code the agent edits
  • agent-workspace/domain-skills/ — reusable site-specific skills the agent edits

Features

  • Puppeteer Backend: Reliable interaction via the world's most popular automation library.
  • Autonomous Agent: Integrated Agent loop that observes, thinks, and acts using vision and DOM data.
  • Smart DOM Extraction: Extracts only interactive and visible elements to minimize LLM context.
  • Action Controller: High-level actions like search, scroll, and upload_file.
  • Multi-Tab Support: Manage and switch between multiple browser sessions seamlessly.

The Bitter Lesson of Agent Harnesses · Web Agents That Actually Learn

About

Browser Harness | Self-healing harness that enables LLMs to complete any task.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • JavaScript 100.0%