Brings ModelWatch's continuous drift monitoring into Claude Desktop so you can define behavioral specs, check drift scores, and pull alert history without leaving your editor. You can create new specs with input prompts and expectation rules (refusal checks, format constraints, semantic similarity thresholds), query recent drift reports across your endpoints, and see which scheduled checks fired alerts. Useful when you're debugging a silent model update from OpenAI or Anthropic and want to cross reference what changed in your production behavior baselines. The server talks to ModelWatch's API using your mw_ key, so you need an active account. If you're already running hourly checks on your LLM outputs and want Claude to help triage when drift spikes, this connects the dots.
MODELWATCH_API_KEY*secretAPI key from https://modelwatch.app (free tier: 5 specs, 500 runs/mo).
MODELWATCH_API_BASEdefault: https://api.modelwatch.appOverride the API base URL for self-hosted deployments.
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent