Gpu Bridge

5 toolsauthSTDIOregistry active

Summary

Connects Claude to 30 GPU-powered AI services through a unified inference API. You get LLM calls, FLUX and Stable Diffusion image generation, Whisper transcription, text-to-speech, embeddings, reranking, and video generation all through a single `gpu_run` tool. Supports both traditional API key auth and x402 protocol for autonomous agent payments with USDC on Base L2. Pricing starts at fractions of a cent per request with volume discounts. Useful when you need Claude to generate images mid-conversation, transcribe audio files, or chain together multiple AI capabilities without switching between providers. Install via npx with your GPU-Bridge API key in the config.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

5 tools

gpu_runRun any GPU-Bridge AI service. 30 services available: LLM inference (sub-second), image generation (FLUX, SD3.5), video generation, video enhancement (up to 4K), speech-to-text (Whisper, <1s), TTS (40+ voices), music generation, voice cloning, embeddings, document reranking (J...3 params

Run any GPU-Bridge AI service. 30 services available: LLM inference (sub-second), image generation (FLUX, SD3.5), video generation, video enhancement (up to 4K), speech-to-text (Whisper, <1s), TTS (40+ voices), music generation, voice cloning, embeddings, document reranking (J...

Parameters* required

inputobject

Service-specific input. Examples: LLM {"prompt":"...","max_tokens":512}, Image {"prompt":"..."}, Whisper {"audio_url":"https://..."}

servicestring

Service key. Common ones: llm-4090 (text), image-4090 (image), whisper-l4 (speech-to-text), tts-l4 (text-to-speech), embedding-l4 (embeddings), rerank (document reranking), pdf-parse (document parsing), nsfw-detect (content moderation), video-enhance (video upscaling)

prioritystring

Routing priority. "fast" = lowest latency (default), "cheap" = lowest cost.one of fast · cheap

gpu_catalogList all available GPU-Bridge services with pricing and model info.

List all available GPU-Bridge services with pricing and model info.

No parameter schema in public metadata yet.

gpu_statusCheck the status of a GPU-Bridge job and retrieve results.1 params

Check the status of a GPU-Bridge job and retrieve results.

Parameters* required

job_idstring

The job ID returned by gpu_run

gpu_balanceCheck GPU-Bridge credit balance, daily spend, and volume discount tier.

Check GPU-Bridge credit balance, daily spend, and volume discount tier.

No parameter schema in public metadata yet.

gpu_estimateEstimate the cost of a GPU-Bridge service before running it.2 params

Estimate the cost of a GPU-Bridge service before running it.

Parameters* required

secondsnumber

Estimated runtime in seconds (optional)

servicestring

Service key (e.g. llm-4090, image-4090)

GPU-Bridge MCP Server

30 GPU-powered AI services as MCP tools — LLMs, image generation, audio, video, embeddings, reranking, PDF parsing, NSFW detection & more. x402 native for autonomous AI agents: pay per request on-chain with USDC on Base L2. No API keys. No accounts.

What is GPU-Bridge?

GPU-Bridge is a unified GPU inference API with native x402 support — the open payment protocol that allows AI agents to autonomously pay for compute with USDC on Base L2. No API keys, no accounts, no human intervention required.

This MCP server exposes all 30 GPU-Bridge services as Model Context Protocol tools, giving Claude (and any MCP-compatible AI) direct access to GPU inference.

Install in Claude Desktop (2 minutes)

1. Get your API key (or use x402 for autonomous agents)

Visit gpubridge.io and grab a free API key, or use the x402 protocol for keyless agent payments.

2. Add to `claude_desktop_config.json`

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "gpu-bridge": {
      "command": "npx",
      "args": ["-y", "@gpu-bridge/mcp-server"],
      "env": {
        "GPUBRIDGE_API_KEY": "your_api_key_here"
      }
    }
  }
}

3. Restart Claude Desktop

That's it. Claude now has access to 30 GPU-powered AI services.

MCP Tools

`gpu_run`

Run any GPU-Bridge service. The primary tool for executing AI tasks.

Parameters:
  service  (string)  — Service key (e.g., "llm-4090", "flux-schnell", "whisper-l4")
  input    (object)  — Service-specific input parameters
  priority (string)  — Optional: "fast" (lowest latency) or "cheap" (lowest cost)

`gpu_catalog`

Get the full catalog of available services with pricing and capabilities.

`gpu_estimate`

Estimate cost before running a service. No authentication required.

`gpu_status`

Check the status of a job and retrieve results.

`gpu_balance`

Check your current balance, daily spend, and volume discount tier.

30 Available Services

Language Models (LLMs)

Service ID	Description	Notes
`llm-4090`	General purpose LLM	Sub-second via Groq
`llm-a100`	Maximum capability LLM	Largest models
`llm-l4`	Ultra-fast, low cost LLM	Budget option
`code-4090`	Code generation	Optimized for code
`llm-stream`	Streaming LLM responses	Real-time output

Image Generation

Service ID	Description	Notes
`flux-schnell`	FLUX.1 Schnell	Fast, 4-step generation
`flux-dev`	FLUX.1 Dev	High quality
`sdxl-4090`	Stable Diffusion XL	Versatile
`sd35-l4`	Stable Diffusion 3.5	Latest SD model
`img2img-4090`	Image-to-image	Style transfer, editing

Vision & Image Analysis

Service ID	Description	Notes
`llava-4090`	Visual Q&A	Image understanding
`ocr-l4`	Text extraction (OCR)	Multi-language
`rembg-l4`	Background removal	Instant
`caption-4090`	Image captioning	Auto-describe images
`nsfw-detect`	Content moderation	NSFW classification

Speech-to-Text

Service ID	Description	Notes
`whisper-l4`	Fast transcription	Sub-second
`whisper-a100`	High accuracy transcription	Large files
`diarize-l4`	Speaker diarization	Who said what

Text-to-Speech

Service ID	Description	Notes
`tts-l4`	Voice cloning TTS	40+ voices
`tts-fast`	Ultra-fast TTS	Lowest latency
`bark-4090`	Expressive TTS	Emotion, laughter

Audio Generation

Service ID	Description	Notes
`musicgen-l4`	Music generation	Text-to-music
`audiogen-l4`	Sound effects	Text-to-SFX

Embeddings & Search

Service ID	Description	Notes
`embed-l4`	Text embeddings	Multilingual
`embed-code`	Code embeddings	For code search
`rerank`	Document reranking	Jina, sub-second

Video

Service ID	Description	Notes
`animatediff`	Text-to-video	AnimateDiff
`video-enhance`	Video upscaling	Up to 4K

Utilities

Service ID	Description	Notes
`pdf-parse`	Document parsing	PDF/DOCX to text

x402: For Autonomous AI Agents

GPU-Bridge supports the x402 payment protocol, enabling truly autonomous AI agents to pay for compute without human intervention.

Agent Request → GPU-Bridge returns HTTP 402 Payment Required
      ↓
Agent pays USDC on Base L2 (gas < $0.01, settles in 2s)
      ↓
Agent retries with payment proof → GPU-Bridge executes and returns result

Python Example with x402

from x402.client import PaymentClient

client = PaymentClient(private_key="0x...", chain="base")

response = client.request(
    "POST",
    "https://api.gpubridge.io/v1/run",
    json={
        "service": "flux-schnell",
        "input": {"prompt": "A robot painting on a canvas", "steps": 4}
    }
)
print(response.json())

Pricing

Category	Starting From
LLMs	$0.003/1K tokens
Image Generation	$0.01/image
Speech-to-Text	$0.005/minute
Text-to-Speech	$0.005/1K chars
Embeddings	$0.0001/1K tokens
Reranking	$0.001/query
PDF Parsing	$0.005/document

All prices in USD. x402 payments in USDC on Base L2.

License

MIT © Healthtech Capital LLC

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Configuration

GPUBRIDGE_API_KEYsecret

GPU-Bridge API key (get one at https://gpubridge.io). Starts with gpub_. Optional if using x402 micropayments.

GPU-Bridge MCP Server

30 GPU-powered AI services as MCP tools — LLMs, image generation, audio, video, embeddings, reranking, PDF parsing, NSFW detection & more. x402 native for autonomous AI agents: pay per request on-chain with USDC on Base L2. No API keys. No accounts.

What is GPU-Bridge?

This MCP server exposes all 30 GPU-Bridge services as Model Context Protocol tools, giving Claude (and any MCP-compatible AI) direct access to GPU inference.

Install in Claude Desktop (2 minutes)

1. Get your API key (or use x402 for autonomous agents)

Visit gpubridge.io and grab a free API key, or use the x402 protocol for keyless agent payments.

2. Add to `claude_desktop_config.json`

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "gpu-bridge": {
      "command": "npx",
      "args": ["-y", "@gpu-bridge/mcp-server"],
      "env": {
        "GPUBRIDGE_API_KEY": "your_api_key_here"
      }
    }
  }
}

3. Restart Claude Desktop

That's it. Claude now has access to 30 GPU-powered AI services.

MCP Tools

`gpu_run`

Run any GPU-Bridge service. The primary tool for executing AI tasks.

Parameters:
  service  (string)  — Service key (e.g., "llm-4090", "flux-schnell", "whisper-l4")
  input    (object)  — Service-specific input parameters
  priority (string)  — Optional: "fast" (lowest latency) or "cheap" (lowest cost)

`gpu_catalog`

Get the full catalog of available services with pricing and capabilities.

`gpu_estimate`

Estimate cost before running a service. No authentication required.

`gpu_status`

Check the status of a job and retrieve results.

`gpu_balance`

Check your current balance, daily spend, and volume discount tier.

30 Available Services

Language Models (LLMs)

Service ID	Description	Notes
`llm-4090`	General purpose LLM	Sub-second via Groq
`llm-a100`	Maximum capability LLM	Largest models
`llm-l4`	Ultra-fast, low cost LLM	Budget option
`code-4090`	Code generation	Optimized for code
`llm-stream`	Streaming LLM responses	Real-time output

Image Generation

Service ID	Description	Notes
`flux-schnell`	FLUX.1 Schnell	Fast, 4-step generation
`flux-dev`	FLUX.1 Dev	High quality
`sdxl-4090`	Stable Diffusion XL	Versatile
`sd35-l4`	Stable Diffusion 3.5	Latest SD model
`img2img-4090`	Image-to-image	Style transfer, editing

Vision & Image Analysis

Service ID	Description	Notes
`llava-4090`	Visual Q&A	Image understanding
`ocr-l4`	Text extraction (OCR)	Multi-language
`rembg-l4`	Background removal	Instant
`caption-4090`	Image captioning	Auto-describe images
`nsfw-detect`	Content moderation	NSFW classification

Speech-to-Text

Service ID	Description	Notes
`whisper-l4`	Fast transcription	Sub-second
`whisper-a100`	High accuracy transcription	Large files
`diarize-l4`	Speaker diarization	Who said what

Text-to-Speech

Service ID	Description	Notes
`tts-l4`	Voice cloning TTS	40+ voices
`tts-fast`	Ultra-fast TTS	Lowest latency
`bark-4090`	Expressive TTS	Emotion, laughter

Audio Generation

Service ID	Description	Notes
`musicgen-l4`	Music generation	Text-to-music
`audiogen-l4`	Sound effects	Text-to-SFX

Embeddings & Search

Service ID	Description	Notes
`embed-l4`	Text embeddings	Multilingual
`embed-code`	Code embeddings	For code search
`rerank`	Document reranking	Jina, sub-second

Video

Service ID	Description	Notes
`animatediff`	Text-to-video	AnimateDiff
`video-enhance`	Video upscaling	Up to 4K

Utilities

Service ID	Description	Notes
`pdf-parse`	Document parsing	PDF/DOCX to text

x402: For Autonomous AI Agents

GPU-Bridge supports the x402 payment protocol, enabling truly autonomous AI agents to pay for compute without human intervention.

Agent Request → GPU-Bridge returns HTTP 402 Payment Required
      ↓
Agent pays USDC on Base L2 (gas < $0.01, settles in 2s)
      ↓
Agent retries with payment proof → GPU-Bridge executes and returns result

Python Example with x402

from x402.client import PaymentClient

client = PaymentClient(private_key="0x...", chain="base")

response = client.request(
    "POST",
    "https://api.gpubridge.io/v1/run",
    json={
        "service": "flux-schnell",
        "input": {"prompt": "A robot painting on a canvas", "steps": 4}
    }
)
print(response.json())

Pricing

Category	Starting From
LLMs	$0.003/1K tokens
Image Generation	$0.01/image
Speech-to-Text	$0.005/minute
Text-to-Speech	$0.005/1K chars
Embeddings	$0.0001/1K tokens
Reranking	$0.001/query
PDF Parsing	$0.005/document

All prices in USD. x402 payments in USDC on Base L2.

Gpu Bridge

Tools

GPU-Bridge MCP Server

What is GPU-Bridge?

Install in Claude Desktop (2 minutes)

1. Get your API key (or use x402 for autonomous agents)

2. Add to claude_desktop_config.json

3. Restart Claude Desktop

MCP Tools

gpu_run

gpu_catalog

gpu_estimate

gpu_status

gpu_balance

30 Available Services

Language Models (LLMs)

Image Generation

Vision & Image Analysis

Speech-to-Text

Text-to-Speech

Audio Generation

Embeddings & Search

Video

Utilities

x402: For Autonomous AI Agents

Python Example with x402

Pricing

Links

License

Configuration

Gpu Bridge

Tools

GPU-Bridge MCP Server

What is GPU-Bridge?

Install in Claude Desktop (2 minutes)

1. Get your API key (or use x402 for autonomous agents)

2. Add to claude_desktop_config.json

3. Restart Claude Desktop

MCP Tools

gpu_run

gpu_catalog

gpu_estimate

gpu_status

gpu_balance

30 Available Services

Language Models (LLMs)

Image Generation

Vision & Image Analysis

Speech-to-Text

Text-to-Speech

Audio Generation

Embeddings & Search

Video

Utilities

x402: For Autonomous AI Agents

Python Example with x402

Pricing

Links

License

Configuration

Related AI & LLM Tools MCP Servers

Related AI & LLM Tools MCP Servers

2. Add to `claude_desktop_config.json`

`gpu_run`

`gpu_catalog`

`gpu_estimate`

`gpu_status`

`gpu_balance`

2. Add to `claude_desktop_config.json`

`gpu_run`

`gpu_catalog`

`gpu_estimate`

`gpu_status`

`gpu_balance`