Connects Claude to OpenAI DALL-E, Google Gemini image models, and local Stable Diffusion (via SD WebUI) through a single interface. Exposes `generate_image` with auto-routing based on prompt keywords, plus on-demand image transforms (resize, crop, format conversion) through `image://` URIs. Background task support for long SD generations, persistent event store for session resumability over HTTP, and a built-in gallery viewer. Ships with Docker, systemd packages, and .mcpb bundles for Claude Desktop. Reach for this when you want Claude to generate production images without writing custom API clients for each provider, or when you need to test prompts across multiple models from one conversation.
Public tool metadata for what this MCP can expose to an agent.
image-generationGenerate an image from a given text10 paramsGenerate an image from a given text
urlstringstepsnumberwidthnumberheightnumberpromptstringreturn_typestringurl · binary · base64aspect_ratiostring1:1 · 16:9 · 21:9 · 3:2 · 2:3 · 4:5output_formatstringpng · svgadvance_configobjectfile_store_keystringMulti-provider image generation MCP server built on FastMCP. Generate images from Claude Desktop, Claude Code, or any MCP client using OpenAI, Google Gemini, Stable Diffusion (SD WebUI), or a zero-cost placeholder provider.
Documentation | Config wizard | PyPI | Docker
gpt-image-1.5, gpt-image-1, dall-e-3), Google Gemini (gemini-2.5-flash-image, gemini-3.x previews), SD WebUI (Stable Diffusion / Forge / reForge), and a zero-cost placeholder for testing.style_profile (strengths, prompt grammar, lifecycle); list_providers includes a top-level warnings array for deprecated models. See Model Catalog.provider="auto" routes by prompt content (text/logo → OpenAI, photoreal/anime → SD WebUI, draft → placeholder).image://{id}/view?format=webp&width=512&crop_x=... resizes / re-encodes / crops on demand without re-generating.task=True (poll for status); short OpenAI calls stream progress in the foreground.app: resources..deb/.rpm with hardened systemd, OIDC + bearer auth, persistent EventStore for HTTP session resumability.With this server mounted in an MCP client, you can ask:
gpt-image-1.5 for typography-aware photorealism.generate_image with provider="sd_webui" and a stylised checkpoint like dreamshaperXL.image://{id}/view?width=512&height=512&crop_x=... resource transforms.image://list resource and the MCP Apps gallery viewer.pip install image-generation-mcp
If you add optional extras via the PROJECT-EXTRAS-START / PROJECT-EXTRAS-END sentinels in pyproject.toml, document them below:
| Extra | Includes | Use when |
|---|---|---|
mcp | fastmcp[tasks]>=3.0,<4 | Background-task support (task=True) — required for long SD generations. |
openai | openai>=1.0 | Enables the OpenAI provider. |
google-genai | google-genai>=1.0 | Enables the Gemini provider. |
all | fastmcp[tasks] + openai + google-genai | Everything except SD WebUI (which is HTTP-only — no extra needed). |
Example: pip install image-generation-mcp[all].
git clone https://github.com/pvliesdonk/image-generation-mcp.git
cd image-generation-mcp
uv sync --all-extras --all-groups
docker pull ghcr.io/pvliesdonk/image-generation-mcp:latest
A compose.yml ships at the repo root as a starting point — copy .env.example to .env, edit, and docker compose up -d.
To attach a remote Python debugger (development only — the protocol is unauthenticated), see Remote debugging.
Download .deb or .rpm packages from the GitHub Releases page. Both install a hardened systemd unit; env configuration is sourced from /etc/image-generation-mcp/env (copy from the shipped /etc/image-generation-mcp/env.example).
Download the .mcpb bundle from the GitHub Releases page and double-click to install, or run:
mcpb install image-generation-mcp-<version>.mcpb
Claude Desktop prompts for required env vars via a GUI wizard — no manual JSON editing needed.
For manual Claude Desktop configuration and setup options, see Claude Desktop deployment.
image-generation-mcp serve # stdio transport
image-generation-mcp serve --transport http --port 8000 # streamable HTTP
For library usage (embedding the domain logic without the MCP transport), import from the image_generation_mcp package directly — see the project's domain modules under src/image_generation_mcp/ for entry points.
The server registers a built-in get_server_info tool (via fastmcp_pvl_core.register_server_info_tool) so operators can confirm the deployed version with a single MCP call. The default response carries server_name, server_version, and core_version. Servers that talk to a remote upstream wire upstream version reporting inside the DOMAIN-UPSTREAM-START / DOMAIN-UPSTREAM-END sentinel in src/image_generation_mcp/server.py — see CLAUDE.md for the wiring pattern.
Core environment variables shared across all fastmcp-pvl-core-based services:
| Variable | Default | Description |
|---|---|---|
FASTMCP_LOG_LEVEL | INFO | Log level for FastMCP internals and app loggers (DEBUG / INFO / WARNING / ERROR). The -v CLI flag overrides to DEBUG. |
FASTMCP_ENABLE_RICH_LOGGING | true | Set to false for plain / structured JSON log output. |
IMAGE_GENERATION_MCP_KV_STORE_URL | file:///data/state | Persistent-state backend URL for pvl-core subsystems — file:///path (survives restarts), memory:// (dev/ephemeral). |
Domain-specific variables go below under Domain configuration.
Callers authenticate via a bearer token or OIDC (mutually exclusive). See the Authentication guide for setup, mapped multi-subject tokens, OIDC, and troubleshooting.
After copier copy and gh repo create --push:
CLAUDE.md.uv sync --all-extras --all-groups.uv run pre-commit install.uv run pytest -x -q && uv run ruff check --fix . && uv run ruff format . && uv run mypy src/ tests/.CI workflows reference three repository secrets. Configure them via Settings → Secrets and variables → Actions or with gh secret set:
| Secret | Used by | How to generate |
|---|---|---|
RELEASE_TOKEN | release.yml, copier-update.yml | Fine-grained PAT at https://github.com/settings/personal-access-tokens/new with contents: write and pull_requests: write (the copier-update cron opens PRs). Scoped to this repo. |
CODECOV_TOKEN | ci.yml | https://codecov.io — sign in with GitHub, add the repo, copy the upload token from the repo settings page. |
CLAUDE_CODE_OAUTH_TOKEN | claude.yml, claude-code-review.yml | Run claude setup-token locally and paste the result. |
gh secret set RELEASE_TOKEN
gh secret set CODECOV_TOKEN
gh secret set CLAUDE_CODE_OAUTH_TOKEN
GITHUB_TOKEN is auto-provided — no action needed.
The PR gate (matches CI):
uv run pytest -x -q # tests
uv run ruff check --fix . && uv run ruff format . # lint + format
uv run mypy src/ tests/ # type-check
Pre-commit runs a subset of the gate on each commit; see .pre-commit-config.yaml for details, or CLAUDE.md for the full Hard PR Acceptance Gates.
uv sync creates .venv/bin/* scripts with absolute shebangs pointing at the venv Python. If you move the repo after scaffolding (mv /old/path /new/path), uv run pytest fails with ModuleNotFoundError: No module named 'fastmcp' because the stale shebang resolves to a different interpreter than the venv's site-packages.
Fix:
rm -rf .venv
uv sync --all-extras --all-groups
uv run python -m pytest also works as a one-shot workaround (bypasses the stale entry-script shim).
uv.lock refresh after copier updateWhen copier update introduces new dependencies (e.g. a new extra added to pyproject.toml.jinja), CI runs uv sync --frozen which fails against a stale lockfile. Run uv lock locally and commit the refreshed uv.lock alongside accepting the copier-update PR.
All domain environment variables use the IMAGE_GENERATION_MCP_ prefix.
| Variable | Default | Required | Description |
|---|---|---|---|
IMAGE_GENERATION_MCP_SCRATCH_DIR | ~/.image-generation-mcp/images/ | No | Directory for saved generated images. |
IMAGE_GENERATION_MCP_READ_ONLY | true | No | Hide write-tagged tools (generate_image). Set to false to enable generation. |
IMAGE_GENERATION_MCP_DEFAULT_PROVIDER | auto | No | Default provider: auto, openai, gemini, sd_webui, placeholder. |
| Variable | Default | Required | Description |
|---|---|---|---|
IMAGE_GENERATION_MCP_OPENAI_API_KEY | — | No | OpenAI API key; enables OpenAI provider when set. |
IMAGE_GENERATION_MCP_GOOGLE_API_KEY | — | No | Google API key with Gemini access; enables Gemini provider when set. |
IMAGE_GENERATION_MCP_SD_WEBUI_HOST | — | No | SD WebUI URL (e.g. http://localhost:7860); enables SD WebUI provider when set. Deprecated alias: A1111_HOST. |
IMAGE_GENERATION_MCP_SD_WEBUI_MODEL | — | No | SD WebUI checkpoint name for preset detection and override. Deprecated alias: A1111_MODEL. |
| Variable | Default | Required | Description |
|---|---|---|---|
IMAGE_GENERATION_MCP_BEARER_TOKEN | — | No | Static bearer token; enables bearer auth when set. |
IMAGE_GENERATION_MCP_BASE_URL | — | No | Public base URL for OIDC and MCP File Exchange downloads (e.g. https://mcp.example.com). |
IMAGE_GENERATION_MCP_OIDC_CONFIG_URL | — | No | OIDC discovery endpoint URL. |
IMAGE_GENERATION_MCP_OIDC_CLIENT_ID | — | No | OIDC client ID. |
IMAGE_GENERATION_MCP_OIDC_CLIENT_SECRET | — | No | OIDC client secret. |
IMAGE_GENERATION_MCP_OIDC_JWT_SIGNING_KEY | ephemeral | Yes on Linux/Docker | JWT signing key. |
IMAGE_GENERATION_MCP_OIDC_AUDIENCE | — | No | Expected JWT audience claim. |
IMAGE_GENERATION_MCP_OIDC_REQUIRED_SCOPES | openid | No | Comma-separated required scopes. |
IMAGE_GENERATION_MCP_OIDC_VERIFY_ACCESS_TOKEN | false | No | Verify access token as JWT instead of id token. |
| Variable | Default | Required | Description |
|---|---|---|---|
IMAGE_GENERATION_MCP_PAID_PROVIDERS | openai,gemini | No | Comma-separated paid provider names. Triggers elicitation confirmation on capable clients. Set to empty to disable. |
IMAGE_GENERATION_MCP_TRANSFORM_CACHE_SIZE | 64 | No | Max cached transforms. Set to 0 to disable caching. |
| Variable | Default | Required | Description |
|---|---|---|---|
IMAGE_GENERATION_MCP_FILE_EXCHANGE_ENABLED | true on http/sse, false on stdio | No | Master switch for the file-exchange producer. Set false to suppress all file_ref publishing. |
IMAGE_GENERATION_MCP_FILE_EXCHANGE_TTL | 3600 | No | Default and maximum TTL (seconds) for published files and download URLs. create_download_link's ttl_seconds is clamped to this. |
IMAGE_GENERATION_MCP_FILE_EXCHANGE_CONSUME | true | Recommended false | Master switch for the consumer side. This server is producer-only; set false to silence the upstream "consume on, no consumer_sink wired" startup warning. |
| Variable | Default | Required | Description |
|---|---|---|---|
IMAGE_GENERATION_MCP_SERVER_NAME | image-generation-mcp | No | Server name shown to MCP clients. |
IMAGE_GENERATION_MCP_INSTRUCTIONS | (dynamic) | No | System instructions for LLM context. |
IMAGE_GENERATION_MCP_HTTP_PATH | /mcp | No | HTTP endpoint mount path. |
IMAGE_GENERATION_MCP_APP_DOMAIN | (auto) | No | MCP Apps widget sandbox domain. Auto-computed from BASE_URL for Claude; override for other hosts. |
Domain-config fields are composed inside src/image_generation_mcp/config.py between the CONFIG-FIELDS-START / CONFIG-FIELDS-END sentinels; env reads go through fastmcp_pvl_core.env(_ENV_PREFIX, "SUFFIX", default) so naming stays consistent.
For the full MCP tool / resource / prompt surface and per-provider setup notes, see the documentation site.
discover_capabilities() reports its actual supported aspect ratios / qualities / formats / negative-prompt support at startup; routing logic asks the capability surface, not a hard-coded enum. New providers slot in by implementing the protocol — no router edits needed. (See docs/decisions/0001-…, 0002-…, 0007-….)style_profile metadata, surfaced via list_providers. Closed-list providers (OpenAI, Gemini, placeholder) use exact-key lookup; SD WebUI uses a regex-ordered pattern table. Profiles include lifecycle flags (current / legacy / deprecated) and feed an auto-built top-level warnings array. (See docs/decisions/0009-….)check_generation_status polling — clients pick the mode via task=True. (See docs/decisions/0005-….)image://{id}/view?format=webp&width=512&crop_x=… resources do format conversion / resize / crop on demand without re-generating. Transforms are cached. (See docs/decisions/0006-….)style_profile: style library is the brief; style_profile describes the model. (See docs/decisions/0008-… and 0009-… for disambiguation.)fastmcp_pvl_core.ServerConfig, never inherits. Domain config goes between CONFIG-FIELDS-START / CONFIG-FIELDS-END sentinels; env reads route through fastmcp_pvl_core.env(...) to keep prefix naming consistent.IMAGE_GENERATION_MCP_READ_ONLYdefault: trueWhen true, write-tagged tools (generate_image) are hidden. Set to false to enable image generation.
IMAGE_GENERATION_MCP_SCRATCH_DIRDirectory for saved generated images. Created automatically on first use.
IMAGE_GENERATION_MCP_DEFAULT_PROVIDERdefault: autoDefault provider selection: auto (keyword-based), openai, sd_webui, or placeholder. Deprecated alias 'a1111' is also accepted.
IMAGE_GENERATION_MCP_OPENAI_API_KEYsecretOpenAI API key. Enables the OpenAI provider (gpt-image-1, dall-e-3) when set.
IMAGE_GENERATION_MCP_SD_WEBUI_HOSTSD WebUI base URL (e.g. http://localhost:7860). Enables the SD WebUI provider when set.
IMAGE_GENERATION_MCP_SD_WEBUI_MODELSD WebUI checkpoint name for model-aware preset detection (SD 1.5 vs SDXL vs Lightning vs Flux).
IMAGE_GENERATION_MCP_A1111_HOSTDeprecated — use SD_WEBUI_HOST instead. Accepted as fallback.
IMAGE_GENERATION_MCP_A1111_MODELDeprecated — use SD_WEBUI_MODEL instead. Accepted as fallback.
FASTMCP_LOG_LEVELdefault: INFOLog level for FastMCP internals; app loggers default to INFO, -v overrides both to DEBUG.
IMAGE_GENERATION_MCP_SERVER_NAMEdefault: image-generation-mcpServer name shown to MCP clients in the initialization response.
IMAGE_GENERATION_MCP_INSTRUCTIONSSystem-level instructions injected into LLM context.
IMAGE_GENERATION_MCP_BEARER_TOKENsecretStatic bearer token for HTTP authentication.
IMAGE_GENERATION_MCP_AUTH_MODEOIDC auth mode: 'remote' (JWKS validation) or 'oidc-proxy' (OAuth proxy). Auto-detected if not set.
IMAGE_GENERATION_MCP_BASE_URLPublic base URL for OIDC redirects (e.g. https://mcp.example.com). Required for OIDC.
IMAGE_GENERATION_MCP_OIDC_CONFIG_URLOIDC discovery endpoint URL. Required for OIDC.
IMAGE_GENERATION_MCP_OIDC_CLIENT_IDOIDC client ID. Required for OIDC.
IMAGE_GENERATION_MCP_OIDC_CLIENT_SECRETsecretOIDC client secret. Required for OIDC.
IMAGE_GENERATION_MCP_OIDC_JWT_SIGNING_KEYsecretSigning key for OIDC session JWTs (critical on Linux/Docker).
IMAGE_GENERATION_MCP_OIDC_AUDIENCEExpected JWT audience claim. Leave unset if your provider does not set one.
IMAGE_GENERATION_MCP_OIDC_REQUIRED_SCOPESComma-separated required OIDC scopes (default: openid for oidc-proxy mode, none for remote mode).
IMAGE_GENERATION_MCP_OIDC_VERIFY_ACCESS_TOKENdefault: falseVerify access token JWT instead of id_token.
IMAGE_GENERATION_MCP_HTTP_PATHdefault: /mcpHTTP endpoint mount path for streamable-HTTP transport.
IMAGE_GENERATION_MCP_EVENT_STORE_URLdefault: file:///data/state/eventsEventStore backend for SSE session resumability. file:///path (file-backed, survives restarts) or memory:// (in-process only, dev mode).
PUIDdefault: 1000Run as this UID (Docker entrypoint).
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent