Browser Agent

3STDIOregistry active

Summary

Connects Claude to a live Chrome browser session through a WebSocket bridge and extension. Instead of dumping the entire DOM into context after every action, it sends a stripped down HTML snapshot that keeps text and interactive elements but drops the bloat. You install the extension, point it at an LLM (they recommend Gemini Flash), and invoke the browser-agent tool with a natural language prompt like "search for MCP protocol on Google." Good fallback when a site doesn't have a dedicated MCP server. Runs locally in your browser, no remote calls except to your LLM provider. The extension listens on port 9003 and executes whatever automation task you throw at it.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Runbook AI MCP Server

An MCP (Model Context Protocol) server that provides browser automation capabilities through a Chrome extension. It allows terminal-based agents like Claude Code to interact with any website through your live browser session.

Part of the Runbook AI ecosystem. Join the Discord community to provide your feedback and get involved in the development!

https://github.com/user-attachments/assets/a43fba64-bc40-4ef6-9840-e100203e2cf5

Why Runbook AI?

Most browser-based MCP tools (like chrome-devtools-mcp) blow up your LLM context window by sending the entire DOM after every browser action.

Runbook AI is different:

Optimized Context: It generates a highly simplified version of the HTML. It strips the junk but keeps essential text and interaction elements. It’s condensed, fast, and won’t eat your tokens.
The Ultimate Catch-all: If a site doesn't have a dedicated MCP server (like Expedia, LinkedIn, or internal tools), this fills the gap perfectly.
Privacy First: It runs entirely in your browser. No remote calls except to your chosen LLM provider. No eval() or shady scripts (enforced by the Chrome extension sandbox).
Efficient Navigation: The simplified HTML goes beyond the viewport, making scrolling and multi-page tasks much more efficient.

Installation

MCP Server

Add to your MCP settings configuration:

{
  "mcpServers": {
    "runbook-ai": {
      "command": "npx",
      "args": ["-y", "runbook-ai-mcp@latest"]
    }
  }
}

Chrome Extension

Install the Runbook AI extension from Chrome Web Store.

Enable MCP in the extension settings opened from extension side panel.

Set LLM API key, and model name, base URL. Use of Gemini 3 Flash (gemini-3-flash-preview) is recommended. Get your free API key from Google AI Studio.

By default the extension has access to all websites. If you want to limit the access, go to Chrome Extension Details, and add individual sites to Site access setting.

Usage

Open Chrome and keep the extension side panel open.

Start the MCP server (it will automatically start when invoked by your MCP client).

Tool Schema

The server exposes a single tool:

`browser-agent`

Run a task in Chrome browser with AI and automation capabilities.

Parameters:

prompt (string, required): The task prompt for the AI agent to execute

Example:

{
  "name": "browser-agent",
  "arguments": {
    "prompt": "Go to google.com and search for 'MCP protocol'"
  }
}

Development

# Install dependencies
npm install

# Build
npm run build

# Run in development mode
npm run dev

# Run tests
npm test

Architecture

MCP Server: Communicates with MCP clients via stdio
WebSocket Server: Listens for Chrome extension connections on port 9003
Chrome Extension: Executes browser automation tasks

When a tool is invoked:

MCP client sends request to MCP server via stdio
MCP server forwards request to Chrome extension via WebSocket
Extension executes the task and returns result
Result is sent back to MCP client

Contributing

Contributions are welcome! Feel free to send out a PR.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Runbook AI MCP Server

Part of the Runbook AI ecosystem. Join the Discord community to provide your feedback and get involved in the development!

https://github.com/user-attachments/assets/a43fba64-bc40-4ef6-9840-e100203e2cf5

Why Runbook AI?

Most browser-based MCP tools (like chrome-devtools-mcp) blow up your LLM context window by sending the entire DOM after every browser action.

Runbook AI is different:

Optimized Context: It generates a highly simplified version of the HTML. It strips the junk but keeps essential text and interaction elements. It’s condensed, fast, and won’t eat your tokens.
The Ultimate Catch-all: If a site doesn't have a dedicated MCP server (like Expedia, LinkedIn, or internal tools), this fills the gap perfectly.
Privacy First: It runs entirely in your browser. No remote calls except to your chosen LLM provider. No eval() or shady scripts (enforced by the Chrome extension sandbox).
Efficient Navigation: The simplified HTML goes beyond the viewport, making scrolling and multi-page tasks much more efficient.

Installation

MCP Server

Add to your MCP settings configuration:

{
  "mcpServers": {
    "runbook-ai": {
      "command": "npx",
      "args": ["-y", "runbook-ai-mcp@latest"]
    }
  }
}

Chrome Extension

Install the Runbook AI extension from Chrome Web Store.

Enable MCP in the extension settings opened from extension side panel.

Set LLM API key, and model name, base URL. Use of Gemini 3 Flash (gemini-3-flash-preview) is recommended. Get your free API key from Google AI Studio.

By default the extension has access to all websites. If you want to limit the access, go to Chrome Extension Details, and add individual sites to Site access setting.

Usage

Open Chrome and keep the extension side panel open.

Start the MCP server (it will automatically start when invoked by your MCP client).

Tool Schema

The server exposes a single tool:

`browser-agent`

Run a task in Chrome browser with AI and automation capabilities.

Parameters:

prompt (string, required): The task prompt for the AI agent to execute

Example:

{
  "name": "browser-agent",
  "arguments": {
    "prompt": "Go to google.com and search for 'MCP protocol'"
  }
}

Development

# Install dependencies
npm install

# Build
npm run build

# Run in development mode
npm run dev

# Run tests
npm test

Architecture

MCP Server: Communicates with MCP clients via stdio
WebSocket Server: Listens for Chrome extension connections on port 9003
Chrome Extension: Executes browser automation tasks

When a tool is invoked:

MCP client sends request to MCP server via stdio
MCP server forwards request to Chrome extension via WebSocket
Extension executes the task and returns result
Result is sent back to MCP client

Contributing

Contributions are welcome! Feel free to send out a PR.

Browser Agent

Runbook AI MCP Server

Why Runbook AI?

Installation

MCP Server

Chrome Extension

Usage

Tool Schema

`browser-agent`

Development

Architecture

Contributing

Browser Agent

Runbook AI MCP Server

Why Runbook AI?

Installation

MCP Server

Chrome Extension

Usage

Tool Schema

`browser-agent`

Development

Architecture

Contributing

Related Web & Browser Automation MCP Servers

Related Web & Browser Automation MCP Servers