Octen Mcp

10authSTDIOregistry active

Summary

Wraps the Octen Extract API so Claude can pull live web pages as clean markdown with three filters most extract tools skip: category labels (tech, health, finance), page structure flags (article, homepage, login wall, no main content), and query driven highlights instead of full body dumps. The big win is filtering upstream. When your RAG pipeline fetches 100 URLs, maybe 20 are index pages or paywalls that cost you LLM tokens to discover they're useless. Octen flags them at fetch time via page_structure so you skip the embedding step entirely. Pass a query parameter and you get ranked snippets per page instead of paying to process full content. Supports batches of up to 20 URLs, configurable cache TTL, and optional image or video URL extraction.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

octen-mcp

MCP server for Octen. Plug it into Claude, Cursor, VS Code, Windsurf, or any MCP client to give your agent live web search and URL extraction.

Core capabilities:

search / news_search: search the live web with domain, text, and time filters.
extract: turn one or more URLs into clean, LLM-ready content.

What makes Octen useful for agents is that extract returns more than page text. Each successful result also includes:

category: what the page is about
page_structure: what kind of page it is
highlights: ranked snippets when you pass a query

That lets an agent skip login walls, nav pages, and off-topic URLs before spending tokens on the full body.

Quick start

You need an OCTEN_API_KEY from octen.ai.

For most MCP clients, the config is:

{
  "mcpServers": {
    "octen": {
      "command": "npx",
      "args": ["-y", "octen-mcp"],
      "env": {
        "OCTEN_API_KEY": "your-key-here"
      }
    }
  }
}

Common config locations:

Claude Desktop: ~/Library/Application Support/Claude/claude_desktop_config.json
Cursor: ~/.cursor/mcp.json
VS Code workspace: .vscode/mcp.json using servers instead of mcpServers
Windsurf / Cline / other clients: use the same command and env var in that client's MCP settings

For Claude Code:

claude mcp add --scope user octen \
  -e OCTEN_API_KEY=your-key-here \
  -- npx -y octen-mcp

Tools

Tool	What it does	Best for
`search`	Search the live web with domain, text, time, and content controls	broad web search
`news_search`	Same engine as `search`, fixed to news	current events and timely reporting
`extract`	Fetch 1-20 URLs and return clean content, labels, and optional highlights	summarization, RAG, fact lookup

Reference docs:

Search: docs.octen.ai/api-reference/search
Extract: docs.octen.ai/api-reference/extract

Why agents like this

Most extract tools stop at "here is the page body." Octen helps one step earlier:

Skip bad pages early: page_structure.primary == "No Main Content" tells the agent it hit a login wall, empty shell, or similar non-content page.
Filter by topic early: category helps a pipeline ignore pages outside the target vertical before embedding or summarizing.
Use less context: query returns highlights when the user wants a specific fact instead of the full page.

For the full decision tree and integration patterns, see docs/best-practices.md.

Example prompts

Fetch octen.ai and summarize the main product features.
Search for recent MCP news from the last week.
Fetch these URLs and only summarize the ones whose category is Finance.
Search site:docs.anthropic.com prompt caching and return only the relevant highlights.

Environment variables

Variable	Required	Default
`OCTEN_API_KEY`	yes	—
`OCTEN_API_URL`	no	`https://api.octen.ai`

Local development

git clone https://github.com/Octen-Team/octen-mcp.git
cd octen-mcp
npm install
npm run build
OCTEN_API_KEY=<key> npm run inspect

More docs

Best practices for agent integration: docs/best-practices.md
Search API reference: docs.octen.ai/api-reference/search
Extract API reference: docs.octen.ai/api-reference/extract

License

MIT © Octen

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Configuration

OCTEN_API_KEY*secret

Octen API key. Get one at https://octen.ai (self-serve).

OCTEN_API_URLdefault: https://api.octen.ai

Override the Octen API base URL. Default: https://api.octen.ai.

octen-mcp

MCP server for Octen. Plug it into Claude, Cursor, VS Code, Windsurf, or any MCP client to give your agent live web search and URL extraction.

Core capabilities:

search / news_search: search the live web with domain, text, and time filters.
extract: turn one or more URLs into clean, LLM-ready content.

What makes Octen useful for agents is that extract returns more than page text. Each successful result also includes:

category: what the page is about
page_structure: what kind of page it is
highlights: ranked snippets when you pass a query

That lets an agent skip login walls, nav pages, and off-topic URLs before spending tokens on the full body.

Quick start

You need an OCTEN_API_KEY from octen.ai.

For most MCP clients, the config is:

{
  "mcpServers": {
    "octen": {
      "command": "npx",
      "args": ["-y", "octen-mcp"],
      "env": {
        "OCTEN_API_KEY": "your-key-here"
      }
    }
  }
}

Common config locations:

Claude Desktop: ~/Library/Application Support/Claude/claude_desktop_config.json
Cursor: ~/.cursor/mcp.json
VS Code workspace: .vscode/mcp.json using servers instead of mcpServers
Windsurf / Cline / other clients: use the same command and env var in that client's MCP settings

For Claude Code:

claude mcp add --scope user octen \
  -e OCTEN_API_KEY=your-key-here \
  -- npx -y octen-mcp

Tools

Tool	What it does	Best for
`search`	Search the live web with domain, text, time, and content controls	broad web search
`news_search`	Same engine as `search`, fixed to news	current events and timely reporting
`extract`	Fetch 1-20 URLs and return clean content, labels, and optional highlights	summarization, RAG, fact lookup

Reference docs:

Search: docs.octen.ai/api-reference/search
Extract: docs.octen.ai/api-reference/extract

Why agents like this

Most extract tools stop at "here is the page body." Octen helps one step earlier:

Skip bad pages early: page_structure.primary == "No Main Content" tells the agent it hit a login wall, empty shell, or similar non-content page.
Filter by topic early: category helps a pipeline ignore pages outside the target vertical before embedding or summarizing.
Use less context: query returns highlights when the user wants a specific fact instead of the full page.

For the full decision tree and integration patterns, see docs/best-practices.md.

Example prompts

Fetch octen.ai and summarize the main product features.
Search for recent MCP news from the last week.
Fetch these URLs and only summarize the ones whose category is Finance.
Search site:docs.anthropic.com prompt caching and return only the relevant highlights.

Environment variables

Variable	Required	Default
`OCTEN_API_KEY`	yes	—
`OCTEN_API_URL`	no	`https://api.octen.ai`

Local development

git clone https://github.com/Octen-Team/octen-mcp.git
cd octen-mcp
npm install
npm run build
OCTEN_API_KEY=<key> npm run inspect

More docs

Best practices for agent integration: docs/best-practices.md
Search API reference: docs.octen.ai/api-reference/search
Extract API reference: docs.octen.ai/api-reference/extract

Octen Mcp

octen-mcp

Quick start

Tools

Why agents like this

Example prompts

Environment variables

Local development

More docs

License

Configuration

Octen Mcp

octen-mcp

Quick start

Tools

Why agents like this

Example prompts

Environment variables

Local development

More docs

License

Configuration

Related AI & LLM Tools MCP Servers

Related AI & LLM Tools MCP Servers