CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Thoth

ahmedeid1/thoth
1HTTPregistry active
Summary

Connects Claude to your Thoth systematic literature review workspace in read-only mode. Exposes five tools: list your reviews with their critic and citation faithfulness scores, pull the full markdown draft of a completed review, retrieve the per-claim citation audit report that flags unsupported references, and browse discovered papers with their screening status. The citation audit is the centerpiece here. Every claim in a Thoth review gets verified against its source PDF before you see it, and this MCP lets Claude surface those verdicts directly. Useful when you're iterating on research questions across multiple reviews or need an AI to help you spot patterns in citation failures without opening the web UI.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Thoth — sacred ibis logo

Thoth

Agentic systematic literature reviews — with every citation checked against the source.

Named for Thoth, ancient Egypt's ibis-headed god of writing and scribes.

Live demo Public evals MCP Registry Tests App release Deploy cost License: MIT

Try the live demo · See a sample review · Public eval dashboard · Connect via MCP

Browsing a completed Thoth review — draft, critic score, and per-claim citation audit

What is Thoth?

Systematic literature reviews are slow to write — and when you ask an LLM to write one, it confidently invents citations and statistics that aren't in any paper.

Thoth does both halves and checks its own work. Give it a research question and it discovers relevant papers, reads them, drafts an evidence-grounded review — then runs a verification pass (cite_check) that compares every cited claim against the source paper and flags anything unsupported before you read the draft. The result is a review with a critic score, a citation-faithfulness percentage, and a per-claim audit you can trust.

It runs as a polished web app, a public eval dashboard, and an authenticated MCP server your AI assistant can call directly.

See it work

Claude.ai catches 6 fabricated citations in a real draft — using Thoth's audit:

Claude.ai connected to Thoth via MCP, using get_citation_audit to identify 6 unsupported claims

Connected to Thoth via the official MCP Registry, Claude calls get_citation_audit on one deliberately-weak review (faithfulness 0.13 for that single review) and identifies all 6 unsupported claims — every one citing the same paper, with invented percentages that aren't in the source. This is cite_check doing its job: it's a single-review audit sample, not the golden-set aggregate (see /evals).

Every claim, scored against its source — the /showcase review (no login needed). The figures on this card (critic 4.2/5, faithfulness 75%, 8/8 citations checked, 2 unsupported) are this one review's scores — a worked example, not the aggregate:

A completed Thoth review: critic 4.2/5, citation faithfulness 75%, 8/8 citations checked with 2 unsupported — scores for this single sample review

Evaluated in public — /evals tracks citation recall / precision / faithfulness / coverage over an 18-question versioned golden set (7 of 18 populated at this commit), regenerated in CI and published with the last-run date, so a regression is a public, falsifiable signal:

Thoth's public eval dashboard — citation recall, precision, faithfulness, and coverage per golden question

You approve every step — three human-in-the-loop gates (review plan → review discovered papers → approve included papers); nothing runs unattended:

Thoth's three human-in-the-loop approval gates: review plan, review discovered papers, approve included papers

Key features

  • 🔎 cite_check — verifiable citations. Every [paper_id] in the draft is scored against the cited paper and labelled supported / unsupported / unclear, so the LLM can't quietly hallucinate a citation. On the public golden set, the citations it does surface are accurate — citation precision 97%, recall 74% — and the verdict report is published per claim, not summarised away. This is the core differentiator: the citations are measured, not asserted.
  • 🌐 Outbound web search (v2 — under active evaluation). An outbound discoverer → fetcher → screener path is wired across OpenAlex, arXiv, and Exa: it fetches open-access PDFs, OCRs them, and screens each against your plan, so you can run uploaded-only, hybrid, or fully autonomous discovery. The discovery and screening axes are v2 and still being calibrated — they're tracked openly on /evals (both currently at 0%) rather than shipped as a silent claim.
  • 🔌 Authenticated, registered MCP server. OAuth 2.1 + PKCE + Dynamic Client Registration via Clerk, SHA-256 audit logging, rate limits — listed in the official MCP Registry. Most public MCP servers ship with no auth; this one doesn't.
  • 📊 Public eval dashboard. Recall / precision / faithfulness / coverage over a versioned golden set, regenerated in CI and stamped with the last-run date, rendered at /evals — an eval regression is a public signal, not a hidden one.
  • 💸 6 LLM providers, $0/mo by default. Swap providers with one env var; the Mistral free tier runs the whole thing, and the entire stack deploys on free tiers for $0/mo.

🚀 Quickstart

Try it now (nothing to install):

  • Open the live demo → and build a review, or browse a finished one →.

Connect it to your AI assistant — paste this into claude.ai (Pro/Max), Claude Desktop, Cursor, or any MCP client (OAuth runs in your browser; no token to copy):

https://thoth-slr.vercel.app/api/mcp/mcp
Read-only MCP tools (scoped to your account)
  • list_reviews — your reviews with critic + faithfulness scores
  • get_review_draft — the markdown draft of a completed review
  • get_citation_audit — the per-claim cite_check verdict report
  • list_discovered_papers (v2) — papers the discoverer surfaced, with fetch + screening status
  • get_search_queries (v2) — the queries the discoverer generated + per-provider errors

Full reference: docs/mcp/tools.md · auth + audit model: docs/mcp/security.md

Adding Thoth as a custom MCP connector in claude.ai — paste the URL, OAuth via Clerk + Dynamic Client Registration
Adding Thoth as a custom connector in claude.ai — OAuth runs in your browser (Clerk + DCR), no token to copy.

Run it locally:

git clone https://github.com/ahmedEid1/thoth.git && cd thoth
cp .env.example .env        # Clerk + Trigger.dev keys + MISTRAL_API_KEY
docker compose up -d        # postgres, minio, langfuse
pnpm install && pnpm prisma migrate dev
pnpm dev                    # Next.js on :3000
pnpm dev:trigger            # Trigger.dev worker (separate terminal)

Full setup, the agent pipeline, and the v2 flow: docs/architecture.md.

Proof

Live appthoth-slr.vercel.app (Clerk sign-in) · sample review at /showcase
Public evals/evals — citation precision 97%, recall 74% on a versioned 18-question golden set (7 of 18 populated at this commit; faithfulness 38% / coverage 32% tracked in the open as the set fills out; discovery/screening v2 under calibration). Regenerated in CI, published with the last-run date — a regression is a public signal.
MCP Registryio.github.ahmedEid1/thoth — status: active
Tests676 unit/integration + 22 live e2e against the deployed instance (MCP transport, real-browser, authenticated walkthroughs, full agent runs) — all green; tsc + lint clean
Audit logEvery MCP call recorded with a SHA-256 input hash; no raw input stored
Deploy cost$0/mo — Vercel + Neon + Cloudflare R2 + Langfuse + Trigger.dev, all free tiers (self-host option)

For engineers

Thoth is a LangGraph StateGraph driven by a Trigger.dev worker, with durable human-in-the-loop gates, a per-run cost cap, and exactly-once gate delivery. Next.js 16

  • TypeScript (strict), Postgres + Prisma, Clerk auth (web + OAuth 2.1 for MCP), S3-compatible storage, Mistral OCR, Langfuse tracing.
  • Architecture — the agent pipeline, full stack, v2 flow, tests
  • LLM providers — the 6-provider matrix + resilience knobs
  • MCP tools · MCP security
  • Security & privacy — data inventory, jurisdictions, deletion paths
  • Self-host — one VM on Oracle Cloud Always Free
  • Changelog · Releasing

Credits

Ibis icon by Delapouite under CC BY 3.0, via game-icons.net.

License

MIT © 2026 Ahmed Hobeishy

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Categories
AI & LLM Tools
Registryactive
TransportHTTP
UpdatedMay 24, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f