A paid remote endpoint for running AI model benchmarks and collecting audit-ready evidence. Exposes four tools: run_benchmark_gate for CI-style pass/fail checks, compare_model_scores for head-to-head evaluations, read_benchmark_report for pulling structured results, and issue_benchmark_receipt for generating usage logs. Requires a bearer token from the product site and works over streamable HTTP transport. Reach for this when you need reproducible benchmark verdicts with receipts, especially if you're tracking model performance across releases or need compliance-friendly documentation of your evaluation runs. The server card and public MCP endpoint are live, but all production calls are metered and authenticated.
Hosted MCP for AI SDK benchmark dashboard.
EvalScope Benchmark MCP is a paid remote MCP endpoint for AI SDK benchmark dashboard. It exposes structured JSON tools, a public server card, token-based access, usage receipts, and audit-ready workflow evidence for AI agents and coding teams.
com.clauxel.evalscopebench/evalscopebench-mcpThis is a paid hosted remote MCP. Production calls require a bearer token issued from the product website.
Authorization: Bearer <token>
Unauthenticated browser visits to /mcp return a clear JSON error instead of internal details.
run_benchmark_gatecompare_model_scoresread_benchmark_reportissue_benchmark_receiptThis repository is a public documentation and directory-submission reference for the hosted service. It does not contain the private production source code.
com.mcparmory/google-sheets
domdomegg/google-sheets-mcp
henilcalagiya/google-sheets-mcp
cct15/war-dashboard-data
moooonad/mcp-google-sheets-full
io.github.br0ski777/csv-to-json