HubLensLLMbrowser-use/browser-harness
browser-use

browser-harness

AI🌱 NEW PROJECT BOOST#LLM#Automation#Browser Automation#Python#CDP
View on GitHub
80

// summary

Browser Harness is a lightweight, self-healing automation tool that provides LLMs with direct control over browser tasks via the Chrome DevTools Protocol. The agent dynamically updates its own helper functions mid-task to handle missing capabilities without relying on rigid frameworks or pre-defined recipes. Users can contribute by sharing agent-generated domain skills that capture successful interaction patterns for specific websites.

// technical analysis

Browser Harness is a minimalist, self-healing automation framework designed to provide LLMs with direct, unconstrained control over browser tasks via the Chrome DevTools Protocol (CDP). By eliminating complex abstractions and rigid frameworks, it allows the agent to dynamically modify its own helper functions mid-task, effectively learning and adapting to new requirements on the fly. This design philosophy prioritizes flexibility and agent autonomy, ensuring that the system can handle complex browser interactions without being limited by pre-defined recipes or rails.

// key highlights

01
Features a self-healing architecture where the agent can write or modify its own helper functions during execution to overcome missing capabilities.
02
Operates as a thin wrapper directly over the Chrome DevTools Protocol, ensuring minimal latency and maximum control by removing unnecessary middleware.
03
Utilizes an agent-driven skill acquisition model where the system automatically generates and saves domain-specific skills based on successful task completion.
04
Provides a lightweight codebase of approximately 592 lines of Python, making it highly transparent and easy to audit or extend.
05
Offers integration with free remote browser services that include built-in proxy support and captcha solving for stealth and deployment needs.
06
Encourages a collaborative ecosystem where users contribute agent-generated domain skills to help the community handle specific site edge cases.

// use cases

01
Autonomous browser task execution with self-healing capabilities
02
Agent-driven development of reusable domain-specific browser skills
03
Remote browser deployment with integrated proxy and captcha support

// getting started

To begin, follow the instructions in install.md to bootstrap the browser and connect the repository to your local environment. Use the provided setup prompt with an LLM like Claude Code or Codex to initialize the agent and verify the connection. Once configured, you can explore the domain-skills directory to see existing task implementations and use SKILL.md to guide your daily interactions.