CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

llm-oracle

weiseer/llm-oracle
STDIOregistry active
Summary

Pulls fresh LLM pricing and availability from a continuously updated catalog so your agent can route requests to the cheapest or most appropriate model at runtime. Exposes five MCP tools: list_models for browsing with optional filters, get_model for full details on a specific model, find_cheapest to rank options by estimated cost given token counts, compare_models for side-by-side analysis, and check_availability for current status. Every price claim links back to the provider's official docs. Covers Anthropic, OpenAI, Google, DeepSeek, and Mistral with daily price updates. Reach for this when you're building multi-model routing logic and don't want to maintain your own pricing table or scrape provider pages yourself.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

llm-oracle

Continuously-updated catalog + query API for LLM provider availability and pricing.

Built by weiseer.

What it does

Given a question like "for this prompt of N input tokens plus M expected output tokens, which currently-available model offers the lowest cost?" the oracle answers in milliseconds, with the source URL it used for the pricing claim.

Use it as:

  • an MCP server for AI agents that need to make routing decisions at inference time
  • an HTTP JSON API for any tool that wants up-to-date LLM pricing/availability
  • a raw catalog.json you can audit, vendor, or contribute fixes to

Why

LLM pricing changes monthly. Every multi-model app rebuilds the same lookup table. We maintain it so you don't.

  • Cited: every pricing row links to the provider's official documentation
  • Versioned: as_of timestamp on every response so you know how fresh the data is
  • Open: catalog is public JSON, MIT licensed; client/server code is Apache-2.0
  • Cross-checked: source data verified against the open-source LiteLLM price file where overlap exists

Coverage (v0)

5 providers, ~10 models. Growing as we observe demand.

  • Anthropic: Claude Opus 4.7, Sonnet 4.6, Haiku 4.5
  • OpenAI: GPT-5, GPT-4o
  • Google: Gemini 2.5 Pro, Gemini 2.5 Flash
  • DeepSeek: Reasoner, Chat
  • Mistral: Large

Quickstart

As an MCP server (Claude Desktop, Cursor, Continue, Cline, etc.)

git clone https://github.com/weiseer/llm-oracle
cd llm-oracle

Add to your MCP client config:

{
  "mcpServers": {
    "llm-oracle": {
      "command": "python",
      "args": ["/absolute/path/to/llm-oracle/mcp_server.py"]
    }
  }
}

The agent now has tools:

  • list_models(provider?, capability?) → list models, optionally filtered
  • get_model(model_id) → full record for one model
  • find_cheapest(input_tokens, output_tokens, required_capabilities?) → ranked by estimated cost
  • compare_models(model_ids[], input_tokens?, output_tokens?) → side-by-side
  • check_availability(model_id) → current status with cited source

As an HTTP API

# Free tier: 1,000 calls/day per IP, no key
curl https://oracle.weiseer.com/catalog.json
curl 'https://oracle.weiseer.com/cheapest?input_tokens=2000&output_tokens=500'
curl https://oracle.weiseer.com/models/claude-sonnet-4-6

As raw JSON

curl https://oracle.weiseer.com/catalog.json > my-local-catalog.json

Pricing

TierCallsCost
Free1,000/day$0
Pro100,000/month$5 USDC/mo
Scale1,000,000/month$20 USDC/mo
Raw catalogunlimited$0 (always free)

Paid tiers settle in USDC. Email wei@weiseer.com to subscribe; we issue an API key and confirm receipt. (We'll automate this when the first three paid customers exist.)

Schema

See catalog.json for the canonical schema. Per-model fields include model_id, provider, family, context_window, max_output_tokens, input_price, output_price, cached_input_price, capabilities, availability_status, source URLs, and timestamps.

Update cadence

  • Pricing: daily cron + manual when a price change is publicly announced
  • Availability: 5-minute poll of provider status pages where available; daily smoke-test otherwise

Errors and disagreements

Found a stale price? An incorrect capability flag? A missing model? Open an issue — please include the source URL we should be tracking.

Telemetry

The hosted service logs each request as a single event (caller, query summary, latency, paid status) to weiseer's append-only audit ledger. The MCP server you self-host logs nothing.

Building on this

The catalog format is MIT licensed. Vendor it, mirror it, build on top of it. We ask only that you cite weiseer/llm-oracle somewhere visible if you redistribute the data.

The server and MCP code are Apache-2.0.

Roadmap (probe-pulled, not pre-planned)

This is P-001 in weiseer's strategic backlog — a market sensor for organism's architecture v2. We expand it where users ask, kill it if 4 weeks of public availability brings under 100 calls. Don't expect a polished commercial product; expect honest experimentation with daily updates.

What we might add (in observed-demand order):

  • More providers (Cohere, xAI, AI21, Together, Replicate hosted, Bedrock, Azure pricing variants)
  • x402 micropayments for high-volume agent buyers (deferred — see arXiv 2605.11781 on protocol security)
  • Historical pricing time-series
  • SLA + uptime aggregates per provider
  • Per-region availability where providers vary
  • Self-hosted model cost estimators (with caveats)

Contact

wei@weiseer.com · github.com/weiseer · weiseer.com

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

LLM_ORACLE_URL

Override the remote catalog URL (default https://oracle.weiseer.com/catalog.json)

LLM_ORACLE_LOCAL_ONLY

Set to any value to skip remote fetch and use bundled catalog only

Categories
AI & LLM Tools
Registryactive
Package@weiseer/llm-oracle-mcp
TransportSTDIO
UpdatedMay 30, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f