This server connects Claude to WhichModel's model recommendation API, which helps autonomous agents pick the right LLM for each task based on live pricing data. You ask it a question about what model to use, and it returns a single confident recommendation optimized for cost. Useful when you're building agents that need to route requests across multiple models and want to minimize API costs without manually tracking pricing changes. The service limits free queries, so you'll likely need an API key for production use. Exposes streamable HTTP transport, meaning it works with Claude Desktop and other MCP clients that support remote connections.
Public tool metadata for what this MCP can expose to an agent.
recommend_modelGet a cost-optimised model recommendation for a specific task. Describe what you need to do and get back the best model with cost estimate, reasoning, and alternatives.6 paramsGet a cost-optimised model recommendation for a specific task. Describe what you need to do and get back the best model with cost estimate, reasoning, and alternatives.
task_typestringchat · code_generation · code_review · summarisation · translation · data_extractioncomplexitystringlow · medium · highdefault: mediumrequirementsobjectbudget_per_callnumberestimated_input_tokensintegerestimated_output_tokensintegercompare_modelsHead-to-head comparison of 2-5 specific models. Compare pricing, capabilities, quality tiers, and optionally project costs based on expected usage volume.3 paramsHead-to-head comparison of 2-5 specific models. Compare pricing, capabilities, quality tiers, and optionally project costs based on expected usage volume.
modelsarrayvolumeobjecttask_typestringchat · code_generation · code_review · summarisation · translation · data_extractionget_pricingRaw pricing data lookup for LLM models. Filter by model, provider, price ceiling, and capabilities. Returns structured pricing data for programmatic consumption.7 paramsRaw pricing data lookup for LLM models. Filter by model, provider, price ceiling, and capabilities. Returns structured pricing data for programmatic consumption.
limitintegermodel_idstringproviderstringcapabilitiesarraymax_input_pricenumberinclude_deprecatedbooleanmin_context_windowintegercheck_price_changesCheck what model pricing has changed since a given date. Useful for monitoring cost changes and spotting new models or deprecations.3 paramsCheck what model pricing has changed since a given date. Useful for monitoring cost changes and spotting new models or deprecations.
sincestringmodel_idstringproviderstringestimate_costEstimate the cost of a specific workload for a given model. Returns cost per call, daily and monthly projections, and a comparison to the cheapest alternative with equivalent capabilities.4 paramsEstimate the cost of a specific workload for a given model. Returns cost per call, daily and monthly projections, and a comparison to the cheapest alternative with equivalent capabilities.
model_idstringinput_tokensintegercalls_per_dayintegeroutput_tokensintegerfind_cheapest_capableFind the cheapest models that meet specific capability requirements. Useful when you have hard constraints (e.g. must support tool_calling + vision) and want the most cost-effective option.3 paramsFind the cheapest models that meet specific capability requirements. Useful when you have hard constraints (e.g. must support tool_calling + vision) and want the most cost-effective option.
quality_floorstringlow · medium · high · frontiermin_context_windowintegerrequired_capabilitiesarrayio.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent