Keyphrases Mcp

1STDIOregistry active

Summary

This server gives Claude access to BERT-based keyphrase extraction when you need more accuracy than asking an LLM to pull out key terms. It exposes a single tool that reads local files from allowable directories and returns ranked keyphrases without sending the full content to the LLM. Under the hood it runs KeyBERT with spaCy preprocessing and paraphrase-multilingual-MiniLM-L12-v2 embeddings, using MMR to keep results diverse. Reach for this when you're processing documents where bidirectional context matters or you want to save tokens by extracting semantic tags locally before feeding them into your workflow. You can specify stop words and control how many phrases come back.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

🔤 Keyphrases-MCP

Empowering LLMs with authentic keyphrase Extraction

Built with the following tools and technologies:

Overview

This Keyphrases MCP Server is a natural language interface designed for agentic applications to extract keyphrasess from provided text. It integrates seamlessly with MCP (Model Content Protocol) clients, enabling AI-driven workflows to extract keyphrases more accurately and with higher relevance using the BERT machine learning model. It works directly with your local files in the allowed directories saving the context tokens for the LLM. The application ensures secure document processing by exposing only extracted keyphrases to the MCP client, not the original file content.

Using this MCP Server, you can ask the following questions:

"Extract 7 keyphrases from the file. [ABSOLUTE_FILE_PATH]"
"Extract 3 keyphrases from the given file ignoring the stop words. Stop words: former, due, amount, [OTHER_STOP_WORDS]. File: [ABSOLUTE_FILE_PATH]"

Keyphrases help users quickly grasp the main topics and themes of a document without reading it in full and enable the following applications:

tags or metadata for documents, improving organization and discoverability in digital libraries
emerging trends, sentiment, identified from customer reviews, social media, or news articles
features or inputs for other tasks, such as text classification, clustering

Reasoning for keyphrases-mcp

Autoregressive LLM models such as in Claude or ChatGPT process text sequentially, which—not only limits their ability to fully contextualize keyphrases across the entire document—but also suffers from context degradation as the input length increases, causing earlier keyphrases to receive diluted attention.

Bidirectional models like BERT, by considering both left and right context and maintaining more consistent attention across the sequence, generally extract existing keyphrases from texts more accurately and with higher relevance especially when no domain-specific fine-tuning is applied.

However, as autoregressive models adopt longer context windows and techniques such as input chunking, their performance in keyphrase extraction is improving, narrowing the gap with BERT. And domain-specific fine-tuning can make autoregressive LLM model to outperform the BERT solution.

This MCP server combines BERT for keyphrase extraction with an autoregressive LLM for text generation or refinement, enabling seamless text processing.

How it works

The server uses a KeyBERT framework for the multi-step extraction pipeline combining spaCy NLP preprocessing with BERT embeddings:

Candidate Generation: KeyphraseCountVectorizer identifies meaningful keyphrase candidates using spaCy's en_core_web_trf model and discarding stop words
Semantic Encoding: Candidates and document are embedded using paraphrase-multilingual-MiniLM-L12-v2 sentence transformer
Relevance Ranking: KeyBERT calculates cosine similarity between candidate keyphrase and document embeddings
Diversity Selection: Maximal Marginal Relevance (MMR) ensures diverse, non-redundant keyphrases
Final Output: Top N most relevant and diverse keyphrases are selected and sorted alphabetically

See configuration document for details.

Documentation

🚀 Integration - How to integrate server with your MCP client or LLM
🔧 Configuration - Environment variables and settings
🛠️ MCP Tools - All available tools
🪵 Development and testing - Guide on how to contribute to the project

License

This project is licensed under the MIT License.

Contact

For questions or support, reach out via GitHub Issues.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Keyphrases Mcp

🔤 Keyphrases-MCP

Overview

Reasoning for keyphrases-mcp

How it works

Documentation

License

Contact

Keyphrases Mcp

🔤 Keyphrases-MCP

Overview

Reasoning for keyphrases-mcp

How it works

Documentation

License

Contact

Related Search & Web Crawling MCP Servers

Related Search & Web Crawling MCP Servers