WhichModel

6 toolsHTTPregistry active

Summary

This server connects Claude to WhichModel's model recommendation API, which helps autonomous agents pick the right LLM for each task based on live pricing data. You ask it a question about what model to use, and it returns a single confident recommendation optimized for cost. Useful when you're building agents that need to route requests across multiple models and want to minimize API costs without manually tracking pricing changes. The service limits free queries, so you'll likely need an API key for production use. Exposes streamable HTTP transport, meaning it works with Claude Desktop and other MCP clients that support remote connections.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

6 tools

recommend_modelGet a cost-optimised model recommendation for a specific task. Describe what you need to do and get back the best model with cost estimate, reasoning, and alternatives.6 params

Get a cost-optimised model recommendation for a specific task. Describe what you need to do and get back the best model with cost estimate, reasoning, and alternatives.

Parameters* required

task_typestring

The type of task you need a model forone of chat · code_generation · code_review · summarisation · translation · data_extraction

complexitystring

Task complexity: low, medium, or highone of low · medium · highdefault: medium

requirementsobject

Additional requirements for the model

budget_per_callnumber

Maximum spend in USD for this single call

estimated_input_tokensinteger

Estimated input size in tokens

estimated_output_tokensinteger

Estimated output size in tokens

compare_modelsHead-to-head comparison of 2-5 specific models. Compare pricing, capabilities, quality tiers, and optionally project costs based on expected usage volume.3 params

Head-to-head comparison of 2-5 specific models. Compare pricing, capabilities, quality tiers, and optionally project costs based on expected usage volume.

Parameters* required

modelsarray

Model IDs to compare, e.g. ["anthropic/claude-sonnet-4", "openai/gpt-4.1"]

volumeobject

Expected usage volume for cost projections

task_typestring

Task type for context-aware comparisonone of chat · code_generation · code_review · summarisation · translation · data_extraction

get_pricingRaw pricing data lookup for LLM models. Filter by model, provider, price ceiling, and capabilities. Returns structured pricing data for programmatic consumption.7 params

Raw pricing data lookup for LLM models. Filter by model, provider, price ceiling, and capabilities. Returns structured pricing data for programmatic consumption.

Parameters* required

limitinteger

Maximum number of results to return (1-100, default 20)default: 20

model_idstring

Specific model ID, e.g. "anthropic/claude-sonnet-4"

providerstring

Filter to models from this provider, e.g. "anthropic"

capabilitiesarray

Required capabilities to filter by

max_input_pricenumber

Maximum input price per million tokens in USD

include_deprecatedboolean

Include deprecated and sunset models in results (default false)default: false

min_context_windowinteger

Minimum context window size in tokens

check_price_changesCheck what model pricing has changed since a given date. Useful for monitoring cost changes and spotting new models or deprecations.3 params

Check what model pricing has changed since a given date. Useful for monitoring cost changes and spotting new models or deprecations.

Parameters* required

sincestring

ISO date to check changes from, e.g. "2026-04-01"

model_idstring

Filter to a specific model

providerstring

Filter to a specific provider

estimate_costEstimate the cost of a specific workload for a given model. Returns cost per call, daily and monthly projections, and a comparison to the cheapest alternative with equivalent capabilities.4 params

Estimate the cost of a specific workload for a given model. Returns cost per call, daily and monthly projections, and a comparison to the cheapest alternative with equivalent capabilities.

Parameters* required

model_idstring

Model ID to estimate cost for, e.g. "anthropic/claude-sonnet-4"

input_tokensinteger

Number of input tokens per call

calls_per_dayinteger

Expected number of calls per day (for daily/monthly projections)

output_tokensinteger

Number of output tokens per call

find_cheapest_capableFind the cheapest models that meet specific capability requirements. Useful when you have hard constraints (e.g. must support tool_calling + vision) and want the most cost-effective option.3 params

Find the cheapest models that meet specific capability requirements. Useful when you have hard constraints (e.g. must support tool_calling + vision) and want the most cost-effective option.

Parameters* required

quality_floorstring

Minimum quality tier: "low" (budget+), "medium" (standard+), "high" (premium+), "frontier" (frontier only)one of low · medium · high · frontier

min_context_windowinteger

Minimum context window size in tokens, e.g. 128000

required_capabilitiesarray

Capabilities the model must support, e.g. ["tool_calling", "json_output", "vision"]

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake