CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Scholar

mlintangmz2765/scholar-mcp
1 toolsauthSTDIOregistry active
Summary

This is a full-stack academic research server that connects Claude to Scopus, OpenAlex, Semantic Scholar, and Unpaywall APIs for literature search and metadata retrieval. It exposes 23 tools covering paper discovery with Boolean queries, author disambiguation with H-index metrics, citation lineage tracking, and PDF extraction. The standout feature is multimodal content handling: it can render PDF pages as PNGs for vision analysis of charts and equations, extract layout-aware text with PyMuPDF, and fall back to Sci-Hub when papers are paywalled. It also includes book search through Google Books and Library Genesis with smart caching for full-text extraction. Reach for this when you need systematic literature reviews, need to analyze figures in papers, or want to map citation networks without leaving the conversation.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

1 tools
search_papersSearch Google Scholar for academic papers, books, theses, and citations. Returns organic results with title, authors, snippet, citation count, and PDF links.8 params

Search Google Scholar for academic papers, books, theses, and citations. Returns organic results with title, authors, snippet, citation count, and PDF links.

Parameters* required
qstring
Search query (paper title, author, topic, keywords).
hlstring
Language code (default 'en').
numinteger
Results per page (1-20, default 10).
startinteger
Result offset for pagination (multiples of num).
as_sdtstring
Search filter. '0,5' = articles excluding patents (default). '0,33' = case law. '4' = patents only.
as_yhiinteger
Latest publication year (inclusive).
as_ylointeger
Earliest publication year (inclusive). E.g. 2020.
scisbdinteger
Sort. 1 = include only abstracts; 2 = sort by date (most recent).one of 1 · 2

Scholar MCP Server

Python Version Model Context Protocol License: MIT Registry

A Model Context Protocol (MCP) server providing structured access to scientific literature databases. It serves as a unified interface for Scopus, OpenAlex, Semantic Scholar, and Unpaywall, enabling AI agents to perform systematic paper discovery, author disambiguation, citation lineage tracking, and multimodal Content extraction.

Core Capabilities

  • Unified Literature Search

    • Semantic Scholar Integration — High-relevance search and detailed metadata, including AI-generated TLDRs (requires API key).
    • Scopus Integration — Targeted metadata retrieval via advanced Boolean syntax (requires API key).
    • OpenAlex Integration — Broad search across 250M+ works with abstract reconstruction.
    • Unpaywall Resolution — DOI-to-PDF cross-referencing across global Open Access repositories.
    • Sci-Hub Fallback (⚠️ Use with Caution) — Automatic mirror resolution and parsing for paywall bypassing.
  • Book Search & Extraction

    • Google Books & Open Library — Integrated search for book metadata, editions, and descriptions without API keys.
    • Library Genesis (Libgen) — Search and extract full text from books directly using PyMuPDF. Features smart caching and token-saving strategies (TOC reading, targeted keyword searching, and page range extraction).
  • Author Identification & Metrics

    • Instant author disambiguation and ID resolution via OpenAlex autocomplete.
    • Comprehensive profiles: H-index, i10-index, institutional affiliation history, and ORCID linkage.
    • Precision metrics from Elsevier (Scopus) for verified publication counts.
  • Citation Lineage Tracking

    • Map research evolution through forward citations (citing works) and backward references (cited works).
  • Structured & Multimodal Extraction

    • Text Extraction — Layout-aware parsing of OA PDFs using PyMuPDF.
    • Vision Rendering — Page-by-page PNG rendering for LLM-based analysis of charts, tables, and equations.
    • HTML Fallbacks — Extraction from web-based research resources via BeautifulSoup.
  • Topic Mapping & Field Analysis

    • Concepts and domain hierarchy discovery to map research landscapes.
    • Batch metadata retrieval for high-throughput literature processing (up to 50 DOIs/request).
  • Access Management & Fallbacks

    • Automated detection of closed-access content with human-in-the-loop instructions for manual uploads.

Architecture

graph TD
    A[LLM Agent] -->|MCP Protocol| B(Scholar MCP Server)
    B --> C{Database Router}
    C -->|Primary| D[Scopus API]
    C -->|Fallback| E[OpenAlex API]
    C -->|DOI Resolver| F[Unpaywall API]
    C -->|Citations| P[CrossRef API]
    
    D --> G{Access Check}
    E --> G
    F --> G
    P --> G
    
    G -->|Open Access| H[PDF Buffer Download]
    G -->|Closed Access| I[Human-in-the-Loop Prompt]
    H --> J[PyMuPDF Text Extractor]
    H --> K[PyMuPDF Vision Renderer]
    J --> L[Return Context to LLM]
    K --> L
    I --> L
    B --> M{Author Router}
    M -->|Profile| N[OpenAlex Authors API]
    M -->|Metrics| O[Scopus Author API]
    N --> L
    O --> L

Installation

Quick Start (via PyPI)

The fastest way to use the server is directly via PyPI:

pip install scholar-academic-mcp

Manual Setup (for Development)

# Clone the repository
git clone https://github.com/mlintangmz2765/Scholar-MCP.git
cd Scholar-MCP

# Setup virtual environment
python -m venv venv
.\venv\Scripts\activate  # Windows
source venv/bin/activate # Unix

# Install in editable mode
pip install -e .

Environment Variables

VariableRequiredDescription
SCOPUS_API_KEYYesElsevier API key for Scopus search and author retrieval.
S2_API_KEYNoSemantic Scholar API key for TLDRs and S2 graph access.
SCIHUB_MIRRORSNoComma-separated list of active Sci-Hub mirrors for PDF fallback.
LIBGEN_MIRRORSNoComma-separated list of active Library Genesis mirrors.
SCOPUS_INST_TOKENNoInstitutional token for full abstract access via Scopus.
CONTACT_EMAILYesEmail for OpenAlex/Unpaywall polite-pool API routing.

Configuration

Claude Desktop / Cursor

Add the following to your configuration file (e.g., claude_desktop_config.json):

{
  "mcpServers": {
    "scholar-academic-mcp": {
      "command": "scholar-academic-mcp",
      "env": {
        "SCOPUS_API_KEY": "your_scopus_api_key",
        "S2_API_KEY": "your_s2_api_key",
        "SCIHUB_MIRRORS": "https://sci-hub.ru,https://sci-hub.st",
        "LIBGEN_MIRRORS": "https://libgen.la,http://libgen.li",
        "SCOPUS_INST_TOKEN": "your_optional_inst_token",
        "CONTACT_EMAIL": "your_email@domain.com"
      }
    }
  }
}

Quick Start & Examples

Once configured, your AI agent can perform complex research workflows. Below are representative examples of tool inputs and structured outputs.

1. Literature Discovery (Scopus)

Prompt: "Find recent papers about 'Transformer architectures' published after 2022 using Scopus."

Tool Call: search_papers_tool(query="TITLE-ABS-KEY(Transformer architectures) AND PUBYEAR > 2022", limit=3)

Output:

Found 3 papers via Scopus:
- [SCOPUS_ID:85184...] Attention is All You Need? A Survey of Transformer Variants
  Authors: Smith, J., Doe, A.
  Date: 2024-01-15 | DOI: 10.1016/j.artint.2023.104012

2. Multimodal Content Analysis

Prompt: "I need to see the diagram for the neural network architecture on page 3 of this URL."

Tool Call: get_full_text_visual_tool(url="https://arxiv.org/pdf/1706.03762.pdf", max_pages=3)

Output:

  • [Text] "Successfully rendered 3 pages visually..."
  • [Image] (PNG data of page 1)
  • [Image] (PNG data of page 2)
  • [Image] (PNG data of page 3 - containing the architecture diagram)

3. Research Topic Mapping

Prompt: "Help me understand the subfields and domains related to 'Generative AI'."

Tool Call: search_topics_tool(query="Generative AI")

Output:

Found 1 topics for 'Generative AI':
- Artificial Intelligence
  Hierarchy: Computer Science → Artificial Intelligence → Machine Learning
  Works: 12,450 | Citations: 450,210
  Description: A field of computer science that focuses on creating systems capable of generating...

Tools

The server registers 23 tools across 7 categories:

Paper Discovery

ToolSignatureDescription
search_papers_tool(query, limit=5, use_scopus=True, sort_by="relevance")Search papers via Scopus (Boolean syntax) or OpenAlex. Sort by cited_by_count or publication_year.
search_papers_s2_tool(query, limit=5)Search papers via Semantic Scholar. Note: strictly rate-limited to 1 request/sec.
get_paper_details_tool(paper_id)Fetch full metadata and abstract by Scopus ID, DOI, or OpenAlex ID (with automatic routing).
get_paper_details_s2_tool(paper_id)Fetch full metadata from Semantic Scholar, including AI-generated TLDRs. Accepts S2 ID or DOI.
search_titles_unpaywall_tool(query, is_oa=None)Search Unpaywall's database directly by title. Set is_oa=True for strictly OA results.
get_related_works_tool(paper_id, limit=10)Find related/similar papers using OpenAlex's bibliographic coupling.

Book Discovery & Extraction

ToolSignatureDescription
search_books_tool(query, limit=5, source="googlebooks")Search for book metadata via Google Books or Open Library.
get_book_details_tool(book_id, source="googlebooks")Fetch complete book details, descriptions, and ISBNs.
search_libgen_tool(query, limit=5)Search Library Genesis for books to retrieve their download MD5 hashes.
interact_with_book_tool(md5, action, keyword, start_page, end_page)Smart extraction from Libgen. Actions: toc (Table of Contents), search (keywords), pages (range).

Author Analytics

ToolSignatureDescription
autocomplete_authors_tool(name, limit=5)Rapidly disambiguate author names and resolve OpenAlex Author IDs.
search_authors_tool(name, institution=None, limit=5)Detailed bibliometric profiles: H-index, i10-index, ORCID, and research concepts.
search_author_by_orcid_tool(orcid)Look up an author directly by ORCID (raw or URL format).
retrieve_author_works_tool(author_id, limit=15)Chronologically sorted publications for a given OpenAlex author.
get_author_profile_scopus_tool(author_id)Fetch precise Scopus-sourced h-index, citation counts, and affiliation.
get_author_profile_s2_tool(author_id)Fetch Semantic Scholar author profile (H-index, paper count, citations).

Citation Tracking

ToolSignatureDescription
get_citations_tool(paper_id, direction="references")Retrieve forward citations or backward references via OpenAlex.

Full-Text & PDF

ToolSignatureDescription
get_full_text_tool(url, start_page=None, end_page=None)Extract text from an OA PDF or HTML page. Supports page range selection.
get_full_text_visual_tool(url, max_pages=3)Render PDF pages as images for Vision-capable LLMs.
fetch_pdf_text_unpaywall_tool(doi)All-in-one: resolve DOI via Unpaywall → download PDF → extract text.
get_scihub_link_tool(doi)Attempts to resolve a strict paywalled DOI to a free direct PDF link using Sci-Hub.
fetch_pdf_text_scihub_tool(doi)All-in-one bypass: resolve DOI via Sci-Hub → download PDF → extract text.

Citation & Writing

ToolSignatureDescription
get_bibtex_tool(doi)Generate a BibTeX entry for LaTeX via CrossRef content negotiation.
format_citation_tool(doi, style="apa")Format citation in APA, IEEE, Chicago, Harvard, Vancouver, MLA, or Turabian.

Open Access Resolution

ToolSignatureDescription
get_unpaywall_link_tool(doi)Resolve a DOI to all available OA locations via Unpaywall.

Topic Mapping & Batch Analysis

ToolSignatureDescription
search_topics_tool(query, limit=10)Browse research topics/concepts. Returns fields, domains, and publication volume.
batch_lookup_tool(dois: list[str])Batch-fetch metadata for multiple DOIs in a single call (max 50).

Technical Design & Reliability

Scholar MCP is engineered for precision and fault tolerance in high-stakes research environments, utilizing several layers of protection to ensure data integrity:

  • Strict Data Contracts (Pydantic)

    • All upstream API responses are validated against Pydantic models before being returned to the agent.
    • Ensures a predictable, type-safe interface even if upstream database schemas change.
  • Fault-Tolerant Networking (Tenacity)

    • Integrated Exponential Backoff using tenacity for transient HTTP errors (429, 5xx).
    • Configurable rate-limit awareness for Elsevier and OpenAlex "polite pool" routing.
  • Resource Safety & Concurrency

    • Context-Managed Extractors: Automatic cleanup of PDF buffers and file descriptors.
    • Isolated Concurrency: Batch operations utilize asyncio.gather with localized exception handling to prevent session-wide failures.
  • System Observability

    • Structured standard-error (stderr) logging provides execution visibility during the tool lifecycle without interfering with the MCP JSON-RPC protocol.
  • Automated Verification

    • Comprehensive test suite leveraging respx for deterministic API mocking, ensuring 100% coverage of edge cases without network externalites.

Project Structure

Scholar-MCP/
├── .github/workflows/ # GitHub Actions (CI & Releases)
├── scripts/           # Automation & Validation scripts
├── tests/             # Pytest suite (respx mocked)
├── server.py          # FastMCP tool entry point
├── api.py             # API Clients (Scopus, OpenAlex, Unpaywall, CrossRef)
├── extractor.py       # PDF/HTML Extraction & Rendering
├── models.py          # Pydantic Data Validation
├── server.json        # MCP Registry Manifest
├── pyproject.toml     # Python packaging configuration
├── requirements.txt   # Dependencies
├── VERSION            # Version tracking (v1.0.0)
├── LICENSE            # MIT License
├── README.md          # Documentation
├── .env.example       # Template for API keys
└── .gitignore         # Git exclusion rules

Troubleshooting

SymptomCauseResolution
HTTP 401 from ScopusStandard API keys lack META_ABS view access.Set SCOPUS_INST_TOKEN or use OpenAlex as fallback.
HTTP 403 on PDF downloadPublisher anti-bot protection (Cloudflare, DataDome).Provide the PDF manually to the LLM.
Empty Unpaywall resultsPaper is behind a strict paywall with no OA copies.Request the PDF from the author via ResearchGate or institutional access.
SCOPUS_API_KEY is not setMissing environment variable.Ensure .env is configured or pass via MCP client env block.

Contributing

  1. Fork the repository.
  2. Create a feature branch (git checkout -b feature/my-feature).
  3. Commit your changes (git commit -m 'feat: add new capability').
  4. Push to the branch (git push origin feature/my-feature).
  5. Open a Pull Request.

Please ensure all code follows PEP 8 conventions.

License

MIT License. See LICENSE for details.


Disclaimer: Automated querying of publisher APIs must comply with the respective Terms of Service of Elsevier, OpenAlex, and Unpaywall. Do not distribute API keys. Adhere to all applicable rate limits.

mcp-name: io.github.mlintangmz2765/scholar

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

SCOPUS_API_KEY*secret

Your Elsevier Scopus API Key

CONTACT_EMAIL*

Email for OpenAlex polite pool and Unpaywall identification

SCOPUS_INST_TOKENsecret

Optional Elsevier Institutional Token

Categories
Documents & Knowledge
Registryactive
Packagescholar-academic-mcp
TransportSTDIO
AuthRequired
UpdatedApr 15, 2026
View on GitHub

Related Documents & Knowledge MCP Servers

View all →
Pdf Document Mcp

csoai-org/pdf-document-mcp

pdf-document-mcp MCP server by MEOK AI Labs
Mcp Document Converter

xt765/mcp-document-converter

Convert PDF, DOCX, HTML, Markdown, and Text for AI assistant context injection.
10
Markdown Formatter

io.github.xjtlumedia/markdown-formatter

AI Answer Copier — Convert Markdown to PDF, DOCX, HTML, LaTeX, CSV, JSON, XML, XLSX, RTF, PNG
3
Better Notion

io.github.ai-aviate/better-notion

Operate Notion with a single Markdown document — read, create, and update pages in one call.
2
Notion

suekou/mcp-notion-server

Notion MCP Server enables LLMs to access Notion workspaces with optional Markdown conversion to save tokens.
892
Docx

meterlong/mcp-doc

A powerful Word document processing service based on FastMCP, enabling AI assistants to create, edit, and manage docx files with full formatting support. Preserves original styles when editing content. 基于FastMCP的强大Word文档处理服务,使AI助手能够创建、编辑和管理docx文件,支持完整的格式设置功能。在编辑内容时能够保留原始样式和格式,实现精确的文档操作。
185