**Editor's Note: This service has been shut down and the repository archived. The listing remains for historical reference only.** Token Oracle was a hosted MCP server that estimated LLM API costs before execution using a deterministic pricing algorithm across 100+ models from Anthropic, OpenAI, Google, Groq, and others. It exposed tools like estimate_cost, compare_models, and budget_check to help agents route tasks to cheaper models while staying within monthly budgets. The service offered a free tier with 1,000 calls per day and could be accessed via streamable HTTP or through an npm bridge package that proxied stdio clients to the remote endpoint. Designed for agent swarms that needed programmatic cost control without calling an LLM for budget decisions.
THIS REPO IS ARCHIVED AND THE SERVICE HAS BEEN SHUTDOWN
Token Oracle is a Model Context Protocol (MCP) server that estimates, compares, and controls LLM API costs before agents spend tokens. It exposes nine tools, four read-only Resources, and a cost_analysis_workflow Prompt template. It uses a proprietary pricing algorithm without a backing LLM to ensure deterministic budget workflows.
Designed to work with agent swarms backing one or zero employee companies, Token Oracle acts as a tiny CFO within your OpenClaw swarm keeping spend down and making suggestions to improve promptings.
Save them tokens, call Token Oracle today!
MCP tools exposed:
estimate_cost — Estimates the USD cost of a single LLM API call before execution. Input: task_description, prompt_text, task_type, or explicit token_count. Output: cost_usd, recommended_model, confidence, will_fit_context, pricing_updated. Annotations: readOnlyHint:true, idempotentHint:true, openWorldHint:false.estimate_cost_batch — Prices up to 100 LLM tasks in a single call. Returns per-task breakdown, total_cost_usd, and cheapest_model_for_all. Use before starting any multi-step pipeline.compare_models — Ranks LLM pricing across all supported providers for a given task. Returns models sorted by cost with speed_tier and quality_tier. Supports filtering by min_quality, max_cost_usd, and provider. Input: task_type, token_count, or prompt_text.budget_check — Checks whether a planned task fits within a monthly budget. Returns can_proceed (boolean), remaining_budget_usd, budget_consumed_pct, and cheaper_alternatives with savings_pct. Input: monthly_budget_usd, current_spend_usd, and task description.find_cheapest_for_budget — Inverse of budget_check. Given a budget_usd cap and task, returns the best model/quality combination within budget plus all alternatives ranked by quality then cost.MCP Resources exposed:
token-oracle://meta — Machine-readable server capability document (version, model_count, pricing metadata)token-oracle://models — Model IDs with metadata for discovery and validationtoken-oracle://heuristics — Task-type token heuristics and keyword classifier patterns (auditable)token-oracle://pricing/changelog — Append-only log of pricing changes; use to detect pricing updates over timeMCP Prompt exposed:
cost_analysis_workflow — Guided three-step prompt template: estimate_cost → compare_models → budget_check. Arguments: task_description, monthly_budget_usd (optional), quality_threshold (optional).Supported providers: Anthropic, OpenAI, Google, Groq, Together AI, Mistral, DeepSeek, Cohere (100+ models).
Canonical agent workflow example: Goal: Run 500 product description generation tasks. Budget $50/mo, current spend $43.
Pricing:
Contact: info@guffeyholdings.com
https://mcp.guffeyholdings.com/TokenOraclecom.guffeyholdings/token-oracleDirect remote configuration:
{
"mcpServers": {
"token-oracle": {
"type": "streamable-http",
"url": "https://mcp.guffeyholdings.com/TokenOracle",
"headers": {
"X-API-Key": "${TOKEN_ORACLE_API_KEY}"
}
}
}
}
For local clients that still expect an npm-installed stdio server, use token-oracle-mcp.
Zero-input trial flow:
Start the bridge with no API key and, when the hosted service has trial auth enabled, it will automatically fetch and store a metered trial credential on first launch.
{
"mcpServers": {
"token-oracle": {
"command": "npx",
"args": ["-y", "token-oracle-mcp"]
}
}
}
One-time explicit login flow:
npx -y token-oracle-mcp login
With --api-key, that validates and stores a paid hosted API key. Without --api-key, it requests and stores a hosted trial credential instead. After either flow, the MCP config does not need to inject TOKEN_ORACLE_API_KEY.
{
"mcpServers": {
"token-oracle": {
"command": "npx",
"args": ["-y", "token-oracle-mcp"]
}
}
}
If you prefer stateless setup, keep passing TOKEN_ORACLE_API_KEY as an environment variable instead.
Hosted trial behavior:
Optional bridge environment variables:
TOKEN_ORACLE_API_KEY: optional hosted API key; overrides any stored credentialTOKEN_ORACLE_BASE_URL: override for the remote endpoint; defaults to https://mcp.guffeyholdings.com/TokenOracleTOKEN_ORACLE_SUBJECT: optional end-user subject forwarded as X-Token-Oracle-SubjectAdditional bridge commands:
npx -y token-oracle-mcp login: accept --api-key for paid auth, or fetch a hosted trial credential when no key is suppliednpx -y token-oracle-mcp logout: remove locally stored credentialsTools:
estimate_costestimate_cost_batchcompare_modelsbudget_checkfind_cheapest_for_budgetget_budget_statuslist_request_activityget_usage_summaryget_usage_leaderboardResources:
token-oracle://metatoken-oracle://modelstoken-oracle://heuristicstoken-oracle://pricing/changelogPrompts:
cost_analysis_workflow1.0.61.0.6TOKEN_ORACLE_API_KEY*secretHosted API key for TokenOracle
TOKEN_ORACLE_BASE_URLdefault: https://mcp.guffeyholdings.com/TokenOracleOptional override for the hosted TokenOracle MCP endpoint
TOKEN_ORACLE_SUBJECTOptional end-user subject forwarded to the hosted service
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent