Connects Claude to a live Chrome browser session through a WebSocket bridge and extension. Instead of dumping the entire DOM into context after every action, it sends a stripped down HTML snapshot that keeps text and interactive elements but drops the bloat. You install the extension, point it at an LLM (they recommend Gemini Flash), and invoke the browser-agent tool with a natural language prompt like "search for MCP protocol on Google." Good fallback when a site doesn't have a dedicated MCP server. Runs locally in your browser, no remote calls except to your LLM provider. The extension listens on port 9003 and executes whatever automation task you throw at it.
An MCP (Model Context Protocol) server that provides browser automation capabilities through a Chrome extension. It allows terminal-based agents like Claude Code to interact with any website through your live browser session.
Part of the Runbook AI ecosystem. Join the Discord community to provide your feedback and get involved in the development!
https://github.com/user-attachments/assets/a43fba64-bc40-4ef6-9840-e100203e2cf5
Most browser-based MCP tools (like chrome-devtools-mcp) blow up your LLM context window by sending the entire DOM after every browser action.
Runbook AI is different:
eval() or shady scripts (enforced by the Chrome extension sandbox).Add to your MCP settings configuration:
{
"mcpServers": {
"runbook-ai": {
"command": "npx",
"args": ["-y", "runbook-ai-mcp@latest"]
}
}
}
Install the Runbook AI extension from Chrome Web Store.
Enable MCP in the extension settings opened from extension side panel.
Set LLM API key, and model name, base URL. Use of Gemini 3 Flash (gemini-3-flash-preview) is recommended. Get your free API key from Google AI Studio.
By default the extension has access to all websites. If you want to limit the access, go to Chrome Extension Details, and add individual sites to Site access setting.
Open Chrome and keep the extension side panel open.
Start the MCP server (it will automatically start when invoked by your MCP client).
The server exposes a single tool:
browser-agentRun a task in Chrome browser with AI and automation capabilities.
Parameters:
prompt (string, required): The task prompt for the AI agent to executeExample:
{
"name": "browser-agent",
"arguments": {
"prompt": "Go to google.com and search for 'MCP protocol'"
}
}
# Install dependencies
npm install
# Build
npm run build
# Run in development mode
npm run dev
# Run tests
npm test
When a tool is invoked:
Contributions are welcome! Feel free to send out a PR.
therealtimex/browser-use
jae-jae/fetcher-mcp
merajmehrabi/puppeteer-mcp-server
com.thenextgennexus/playwright-mcp-server
saik0s/mcp-browser-use