Docalyze

STDIOregistry active

Summary

Connects Claude to your local filesystem to read and analyze documents without sending them to external APIs. Exposes four tools: list_documents for directory browsing with glob patterns, document_info for metadata, read_document for text extraction with pagination, and visual_evaluate_document that returns page images so the AI can reason about charts and layouts directly. Handles PDFs, Excel, CSV, Word, PowerPoint, and images out of the box. The visual analysis is the standout feature here. Point it at a configurable documents root and you can ask Claude to summarize spreadsheets, extract data from forms, or explain what's in a slide deck without any manual export step.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Docalyze MCP Server

An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.

No API keys required. The host AI (GitHub Copilot, Claude, etc.) does all the reasoning directly.

Supported Formats

Format	Extensions	Read	Visual
PDF	`.pdf`	✅	✅
Excel	`.xlsx`, `.xls`	✅	✅
CSV / TSV	`.csv`, `.tsv`	✅	—
JSON	`.json`	✅	—
Word	`.docx`	✅	✅
PowerPoint	`.pptx`	✅	✅
Plain text	`.txt`, `.md`	✅	—
Images	`.png`, `.jpg`, `.jpeg`, `.gif`, `.bmp`, `.tiff`, `.webp`	—	✅

Tools

Tool	Description
`list_documents`	List files under a directory, filtered by glob pattern
`document_info`	Get metadata (size, modified date, sheets) for a file
`read_document`	Extract text content from a document with pagination
`visual_evaluate_document`	Return page images inline so the AI can analyze charts, tables, and diagrams

Installation

From VS Code (recommended)

Search for docalyze in the MCP server gallery (Extensions sidebar → MCP tab) and click Install.

From PyPI

pip install docalyze-mcp-server

From npm

npx docalyze-mcp-server

This requires uv or pipx installed — the npm wrapper calls uvx to run the Python package automatically.

Manual setup

Add to your VS Code mcp.json (or settings.json):

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "python",
      "args": ["-m", "docalyze_mcp_server"],
      "env": {
        "PYTHONIOENCODING": "utf-8"
      }
    }
  }
}

Or, if you installed via pip and want to use the entry point:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server"
    }
  }
}

Optional Dependencies

The base install handles PDF, Excel, CSV, JSON, and plain text. For additional formats:

# Word documents
pip install docalyze-mcp-server[docx]

# PowerPoint
pip install docalyze-mcp-server[pptx]

# OCR (requires Tesseract installed on your system)
pip install docalyze-mcp-server[ocr]

# Everything
pip install docalyze-mcp-server[all]

Configuration

The server reads documents from a configurable root directory. Set the DOCUMENTS_ROOT environment variable to change it:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server",
      "env": {
        "DOCUMENTS_ROOT": "/path/to/your/documents"
      }
    }
  }
}

If not set, it defaults to the directory containing the server script.

License

MIT

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Docalyze MCP Server

An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.

No API keys required. The host AI (GitHub Copilot, Claude, etc.) does all the reasoning directly.

Supported Formats

Format	Extensions	Read	Visual
PDF	`.pdf`	✅	✅
Excel	`.xlsx`, `.xls`	✅	✅
CSV / TSV	`.csv`, `.tsv`	✅	—
JSON	`.json`	✅	—
Word	`.docx`	✅	✅
PowerPoint	`.pptx`	✅	✅
Plain text	`.txt`, `.md`	✅	—
Images	`.png`, `.jpg`, `.jpeg`, `.gif`, `.bmp`, `.tiff`, `.webp`	—	✅

Tools

Tool	Description
`list_documents`	List files under a directory, filtered by glob pattern
`document_info`	Get metadata (size, modified date, sheets) for a file
`read_document`	Extract text content from a document with pagination
`visual_evaluate_document`	Return page images inline so the AI can analyze charts, tables, and diagrams

Installation

From VS Code (recommended)

Search for docalyze in the MCP server gallery (Extensions sidebar → MCP tab) and click Install.

From PyPI

pip install docalyze-mcp-server

From npm

npx docalyze-mcp-server

This requires uv or pipx installed — the npm wrapper calls uvx to run the Python package automatically.

Manual setup

Add to your VS Code mcp.json (or settings.json):

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "python",
      "args": ["-m", "docalyze_mcp_server"],
      "env": {
        "PYTHONIOENCODING": "utf-8"
      }
    }
  }
}

Or, if you installed via pip and want to use the entry point:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server"
    }
  }
}

Optional Dependencies

The base install handles PDF, Excel, CSV, JSON, and plain text. For additional formats:

# Word documents
pip install docalyze-mcp-server[docx]

# PowerPoint
pip install docalyze-mcp-server[pptx]

# OCR (requires Tesseract installed on your system)
pip install docalyze-mcp-server[ocr]

# Everything
pip install docalyze-mcp-server[all]

Configuration

The server reads documents from a configurable root directory. Set the DOCUMENTS_ROOT environment variable to change it:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server",
      "env": {
        "DOCUMENTS_ROOT": "/path/to/your/documents"
      }
    }
  }
}

If not set, it defaults to the directory containing the server script.

License

MIT

Docalyze

Docalyze MCP Server

Supported Formats

Tools

Installation

From VS Code (recommended)

From PyPI

From npm

Manual setup

Optional Dependencies

Configuration

License

Docalyze

Docalyze MCP Server

Supported Formats

Tools

Installation

From VS Code (recommended)

From PyPI

From npm

Manual setup

Optional Dependencies

Configuration

License

Related Documents & Knowledge MCP Servers

Related Documents & Knowledge MCP Servers