AI-First Scraper

1STDIOregistry active

Summary

Drops three web scraping tools into Claude Desktop, Cursor, or Cline: fetch_page for single URLs, fetch_pages_batch for up to 25 in parallel, and search_web that runs queries and returns results already converted to Markdown. Strips ads and clutter so your agent gets clean text instead of raw HTML soup. Runs via uvx with zero install, calls the public ai-first-scraper and ai-first-search APIs by default, or point it at your own backend with environment variables. Useful when you need Claude to pull live web content or search results without choking on markup.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

ai-first-scraper-mcp

Plug Claude Desktop, Cursor, or Cline straight into an ad-free web scraper + search engine. Three tools, one line of config.

What it does

Adds three tools to any MCP-compatible agent:

Tool	What it does
`fetch_page`	Fetch one URL → return clean Markdown (HTML or PDF).
`fetch_pages_batch`	Fetch up to 25 URLs in parallel → return Markdown for each.
`search_web`	Run a web search and return the top-k result pages already converted to Markdown.

No more "the model called curl and then tried to parse 80kB of ad HTML." Your agent receives clean Markdown ready to reason about.

Backed by the ai-first-scraper and ai-first-search APIs.

Install

Fastest — `uvx` (no install, runs from PyPI on demand)

// claude_desktop_config.json  /  cline_mcp_settings.json  /  ~/.cursor/mcp.json
{
  "mcpServers": {
    "ai-first-scraper": {
      "command": "uvx",
      "args": ["ai-first-scraper-mcp"]
    }
  }
}

Restart your client (Claude Desktop / Cursor / Cline). The three tools above will appear automatically.

Alternative — pip install

pip install ai-first-scraper-mcp

{
  "mcpServers": {
    "ai-first-scraper": {
      "command": "ai-first-scraper-mcp"
    }
  }
}

Where the config file lives

Client	Config path
Claude Desktop (macOS)	`~/Library/Application Support/Claude/claude_desktop_config.json`
Claude Desktop (Windows)	`%APPDATA%\Claude\claude_desktop_config.json`
Cursor	`~/.cursor/mcp.json`
Cline (VS Code)	`~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json`

Point at your own backend (optional)

By default this server calls the public ai-first-scraper.onrender.com and ai-first-search.onrender.com instances. If you want to self-host, set env vars in your MCP config:

{
  "mcpServers": {
    "ai-first-scraper": {
      "command": "uvx",
      "args": ["ai-first-scraper-mcp"],
      "env": {
        "SCRAPER_URL": "https://your-scraper.example.com",
        "SEARCH_URL":  "https://your-search.example.com",
        "AFS_TIMEOUT": "60"
      }
    }
  }
}

Verify it works

Open your MCP client and ask the agent:

"Use the search_web tool to find the top 3 recent articles about MCP and summarize them in 5 bullets each."

You should see the agent call search_web, get back Markdown for each result, and produce the summary without ever touching raw HTML.

Companion projects

ai-first-scraper — the per-URL Markdown cleaner this MCP server fans out to.
ai-first-search — search → scrape → markdown pipeline.
mcp-rec — record & replay any MCP server's traffic for tests and bug reports.
llm-cache-proxy — local cache for OpenAI/Anthropic API calls.
promptlocker — lockfile for prompts.
context-diff — see what blew up your Claude Code context window.
agentwatch — overlay for browser AI agents.

Develop locally

git clone https://github.com/yubinkim444/ai-first-scraper-mcp.git
cd ai-first-scraper-mcp

uv sync                    # or: pip install -e .
ai-first-scraper-mcp       # speaks MCP over stdio

To test against a local client, point its MCP config at the same command.

License

MIT © yubinkim444

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Configuration

SCRAPER_URL

URL of the ai-first-scraper instance to use. Default: https://ai-first-scraper.onrender.com

SEARCH_URL

URL of the ai-first-search instance to use. Default: https://ai-first-search.onrender.com

AFS_TIMEOUT

Upstream HTTP timeout in seconds. Default: 45

ai-first-scraper-mcp

Plug Claude Desktop, Cursor, or Cline straight into an ad-free web scraper + search engine. Three tools, one line of config.

What it does

Adds three tools to any MCP-compatible agent:

Tool	What it does
`fetch_page`	Fetch one URL → return clean Markdown (HTML or PDF).
`fetch_pages_batch`	Fetch up to 25 URLs in parallel → return Markdown for each.
`search_web`	Run a web search and return the top-k result pages already converted to Markdown.

No more "the model called curl and then tried to parse 80kB of ad HTML." Your agent receives clean Markdown ready to reason about.

Backed by the ai-first-scraper and ai-first-search APIs.

Install

Fastest — `uvx` (no install, runs from PyPI on demand)

// claude_desktop_config.json  /  cline_mcp_settings.json  /  ~/.cursor/mcp.json
{
  "mcpServers": {
    "ai-first-scraper": {
      "command": "uvx",
      "args": ["ai-first-scraper-mcp"]
    }
  }
}

Restart your client (Claude Desktop / Cursor / Cline). The three tools above will appear automatically.

Alternative — pip install

pip install ai-first-scraper-mcp

{
  "mcpServers": {
    "ai-first-scraper": {
      "command": "ai-first-scraper-mcp"
    }
  }
}

Where the config file lives

Client	Config path
Claude Desktop (macOS)	`~/Library/Application Support/Claude/claude_desktop_config.json`
Claude Desktop (Windows)	`%APPDATA%\Claude\claude_desktop_config.json`
Cursor	`~/.cursor/mcp.json`
Cline (VS Code)	`~/Library/Application Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json`

Point at your own backend (optional)

By default this server calls the public ai-first-scraper.onrender.com and ai-first-search.onrender.com instances. If you want to self-host, set env vars in your MCP config:

{
  "mcpServers": {
    "ai-first-scraper": {
      "command": "uvx",
      "args": ["ai-first-scraper-mcp"],
      "env": {
        "SCRAPER_URL": "https://your-scraper.example.com",
        "SEARCH_URL":  "https://your-search.example.com",
        "AFS_TIMEOUT": "60"
      }
    }
  }
}

Verify it works

Open your MCP client and ask the agent:

"Use the search_web tool to find the top 3 recent articles about MCP and summarize them in 5 bullets each."

You should see the agent call search_web, get back Markdown for each result, and produce the summary without ever touching raw HTML.

Companion projects

ai-first-scraper — the per-URL Markdown cleaner this MCP server fans out to.
ai-first-search — search → scrape → markdown pipeline.
mcp-rec — record & replay any MCP server's traffic for tests and bug reports.
llm-cache-proxy — local cache for OpenAI/Anthropic API calls.
promptlocker — lockfile for prompts.
context-diff — see what blew up your Claude Code context window.
agentwatch — overlay for browser AI agents.

Develop locally

git clone https://github.com/yubinkim444/ai-first-scraper-mcp.git
cd ai-first-scraper-mcp

uv sync                    # or: pip install -e .
ai-first-scraper-mcp       # speaks MCP over stdio

To test against a local client, point its MCP config at the same command.

AI-First Scraper

ai-first-scraper-mcp

What it does

Install

Fastest — `uvx` (no install, runs from PyPI on demand)

Alternative — pip install

Where the config file lives

Point at your own backend (optional)

Verify it works

Companion projects

Develop locally

License

Configuration

AI-First Scraper

ai-first-scraper-mcp

What it does

Install

Fastest — `uvx` (no install, runs from PyPI on demand)

Alternative — pip install

Where the config file lives

Point at your own backend (optional)

Verify it works

Companion projects

Develop locally

License

Configuration

Related Documents & Knowledge MCP Servers

Related Documents & Knowledge MCP Servers

AI-First Scraper

ai-first-scraper-mcp

What it does

Install

Fastest — uvx (no install, runs from PyPI on demand)

Alternative — pip install

Where the config file lives

Point at your own backend (optional)

Verify it works

Companion projects

Develop locally

License

Configuration

AI-First Scraper

ai-first-scraper-mcp

What it does

Install

Fastest — uvx (no install, runs from PyPI on demand)

Alternative — pip install

Where the config file lives

Point at your own backend (optional)

Verify it works

Companion projects

Develop locally

License

Configuration

Related Documents & Knowledge MCP Servers

Related Documents & Knowledge MCP Servers

Fastest — `uvx` (no install, runs from PyPI on demand)

Fastest — `uvx` (no install, runs from PyPI on demand)