Connects Claude or any MCP client to Forge's swarm optimization service for turning PyTorch code into production Triton or CUDA kernels. Exposes three tools: authenticate via browser OAuth, optimize existing PyTorch operations with 32 parallel AI agents running on datacenter GPUs (B200 through T4), and generate new kernels from natural language specs. The optimizer benchmarks every candidate against torch.compile max-autotune and returns drop-in replacements with speedup metrics. Your agent automatically triggers optimization when it spots custom autograd functions, performance comments, or compute-heavy modules. Results come back in minutes at 250k tokens per second inference. Best for teams shipping ML inference who need provably faster kernels without manual Triton expertise.
Public tool metadata for what this MCP can expose to an agent.
get_healthCheck the health status of the mcp-forge API.Check the health status of the mcp-forge API.
No parameter schema in public metadata yet.
validate_licenseValidate your license key and retrieve plan details and quota usage.Validate your license key and retrieve plan details and quota usage.
No parameter schema in public metadata yet.
get_quotaCheck quota usage: plan, sites used, slots remaining and expiry date. Call this before create_generation to warn the user if the quota is near the limit.Check quota usage: plan, sites used, slots remaining and expiry date. Call this before create_generation to warn the user if the quota is near the limit.
No parameter schema in public metadata yet.
create_generationGenerate a new MCP server from a source URL, list of tools and authentication schemes.7 paramsGenerate a new MCP server from a source URL, list of tools and authentication schemes.
titlestringtoolsarraybase_urlstringdescriptionstringsource_typestringauth_schemesarrayforge_versionstring