Mcp Context Server

8authSTDIO, HTTPregistry active

Summary

Provides persistent multimodal context storage that multiple LLM agents can share through thread-based scoping. You get 13 tools including store_context, search_context, and update_context for managing text and images with metadata filtering, tag-based organization, and date range queries. The real power is in the optional search capabilities: semantic search with embeddings from Ollama, OpenAI, or HuggingFace, full-text search with stemming and boolean queries, or hybrid search combining both with reciprocal rank fusion. Includes automatic LLM-based summarization for search results and cross-encoder reranking for precision. Runs on SQLite by default or PostgreSQL for high-concurrency deployments. Reach for this when you need agents to build and query shared context across conversations without losing state.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

MCP Context Server

A high-performance Model Context Protocol (MCP) server providing persistent multimodal context storage for LLM agents. Built with FastMCP, this server enables seamless context sharing across multiple agents working on the same task through thread-based scoping.

Key Features

Multimodal Context Storage: Store and retrieve both text and images
Thread-Based Scoping: Agents working on the same task share context through thread IDs
Flexible Metadata Filtering: Store custom structured data with any JSON-serializable fields and filter using 16 powerful operators
Date Range Filtering: Filter context entries by creation timestamp using ISO 8601 format
Tag-Based Organization: Efficient context retrieval with normalized, indexed tags
Summary Generation: Optional automatic LLM-based summarization returned alongside truncated text_content in all search tool results for better agent context efficiency (enabled by default with Ollama)
Full-Text Search: Optional linguistic search with stemming, ranking, boolean queries (FTS5/tsvector), and cross-encoder reranking
Semantic Search: Optional vector similarity search for meaning-based retrieval with cross-encoder reranking
Hybrid Search: Optional combined FTS + semantic search using Reciprocal Rank Fusion (RRF) with cross-encoder reranking
Cross-Encoder Reranking: Automatic result refinement using FlashRank cross-encoder models for improved search precision (enabled by default)
Multiple Database Backends: Choose between SQLite (default, zero-config) or PostgreSQL (high-concurrency, production-grade)
High Performance: WAL mode (SQLite) / MVCC (PostgreSQL), strategic indexing, and async operations
MCP Standard Compliance: Works with Claude Code, LangGraph, and any MCP-compatible client
Production Ready: Comprehensive test coverage, type safety, and robust error handling

Connecting to Your AI Assistant

The fastest way to connect the MCP Context Server to Claude Code is the one-command Docker bootstrap.

For step-by-step instructions, prerequisites, troubleshooting, and update/uninstall commands, see the Connecting to Your AI Assistant Guide.

Environment Configuration

The server is fully configured via environment variables, supporting core settings, transport, authentication, embedding providers, summary generation, search features, database tuning, and more. Variables can be set in your MCP client configuration, in a .env file, or directly in the shell.

For the complete reference of all environment variables with types, defaults, constraints, and descriptions, see the Environment Variables Reference.

Summary Generation

Summary generation automatically creates concise LLM-based summaries for each stored context entry. Summaries are returned in the summary field of all search tool results alongside truncated text_content, providing dense, informative summaries that help agents determine relevance without fetching full entries.

For detailed instructions including all providers (Ollama, OpenAI, Anthropic), model selection, and custom prompt configuration, see the Summary Generation Guide.

Semantic Search

For detailed instructions on enabling optional semantic search with multiple embedding providers (Ollama, OpenAI, Azure, HuggingFace, Voyage), see the Semantic Search Guide.

Full-Text Search

For full-text search with linguistic processing, stemming, ranking, and boolean queries, see the Full-Text Search Guide.

Hybrid Search

For combined FTS + semantic search using Reciprocal Rank Fusion (RRF), see the Hybrid Search Guide.

Metadata Filtering

For comprehensive metadata filtering including 16 operators, nested JSON paths, and performance optimization, see the Metadata Guide.

Database Backends

The server supports multiple database backends, selectable via the STORAGE_BACKEND environment variable. SQLite (default) provides zero-configuration local storage perfect for single-user deployments. PostgreSQL offers high-performance capabilities with 10x+ write throughput for multi-user and high-traffic deployments.

For detailed configuration instructions including PostgreSQL setup with Docker, Supabase integration, connection methods, and troubleshooting, see the Database Backends Guide.

API Reference

The MCP Context Server exposes 13 MCP tools for context management:

Core Operations: store_context, search_context, get_context_by_ids, delete_context, update_context, list_threads, get_statistics

Search Tools: semantic_search_context, fts_search_context, hybrid_search_context

Batch Operations: store_context_batch, update_context_batch, delete_context_batch

For complete tool documentation including parameters, return values, filtering options, and examples, see the API Reference.

Docker Deployment

For production deployments with HTTP transport and container orchestration, Docker Compose configurations are available for SQLite, PostgreSQL, and external PostgreSQL (Supabase). See the Docker Deployment Guide for setup instructions and client connection details.

Kubernetes Deployment

For Kubernetes deployments, a Helm chart is provided with configurable values for different environments. See the Helm Deployment Guide for installation instructions, or the Kubernetes Deployment Guide for general Kubernetes concepts.

Authentication

For HTTP transport deployments requiring authentication, see the Authentication Guide for bearer token configuration.

Getting Help

Bug reports: Report a bug
Feature requests: Suggest a feature
Documentation issues: Report a docs issue
Questions: Ask a question

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Configuration

LOG_LEVEL

Log level

STORAGE_BACKEND

Storage backend type: sqlite (default) or postgresql

MAX_IMAGE_SIZE_MB

Maximum individual image size in megabytes

MAX_TOTAL_SIZE_MB

Maximum total request size in megabytes

DB_PATH

Custom database file location path

POOL_MAX_READERS

Maximum number of concurrent read connections in the pool

POOL_MAX_WRITERS

Maximum number of concurrent write connections in the pool

POOL_CONNECTION_TIMEOUT_S

Connection timeout in seconds

POOL_IDLE_TIMEOUT_S

Idle connection timeout in seconds

POOL_HEALTH_CHECK_INTERVAL_S

Connection health check interval in seconds

RETRY_MAX_RETRIES

Maximum number of retry attempts for failed operations

RETRY_BASE_DELAY_S

Base delay in seconds between retry attempts

RETRY_MAX_DELAY_S

Maximum delay in seconds between retry attempts

RETRY_JITTER

Enable random jitter in retry delays

RETRY_BACKOFF_FACTOR

Exponential backoff multiplication factor for retries

SQLITE_FOREIGN_KEYS

Enable SQLite foreign key constraints

SQLITE_JOURNAL_MODE

SQLite journal mode (e.g., WAL, DELETE)

SQLITE_SYNCHRONOUS

SQLite synchronous mode (e.g., NORMAL, FULL, OFF)

SQLITE_TEMP_STORE

SQLite temporary storage location (e.g., MEMORY, FILE)

SQLITE_MMAP_SIZE

SQLite memory-mapped I/O size in bytes

SQLITE_CACHE_SIZE

SQLite cache size (negative value for KB, positive for pages)

SQLITE_PAGE_SIZE

SQLite page size in bytes

SQLITE_WAL_AUTOCHECKPOINT

SQLite WAL autocheckpoint threshold in pages

SQLITE_BUSY_TIMEOUT_MS

SQLite busy timeout in milliseconds

Mcp Context Server

MCP Context Server

Key Features

Connecting to Your AI Assistant

Environment Configuration

Summary Generation

Semantic Search

Full-Text Search

Hybrid Search

Metadata Filtering

Database Backends

API Reference

Docker Deployment

Kubernetes Deployment

Authentication

Getting Help

Configuration

Mcp Context Server

MCP Context Server

Key Features

Connecting to Your AI Assistant

Environment Configuration

Summary Generation

Semantic Search

Full-Text Search

Hybrid Search

Metadata Filtering

Database Backends

API Reference

Docker Deployment

Kubernetes Deployment

Authentication

Getting Help

Configuration

Related AI & LLM Tools MCP Servers

Related AI & LLM Tools MCP Servers