CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

Image Generation MCP Server

pvliesdonk/image-generation-mcp
11 toolsauthSTDIO, HTTPregistry active
Summary

Connects Claude to OpenAI DALL-E, Google Gemini image models, and local Stable Diffusion (via SD WebUI) through a single interface. Exposes `generate_image` with auto-routing based on prompt keywords, plus on-demand image transforms (resize, crop, format conversion) through `image://` URIs. Background task support for long SD generations, persistent event store for session resumability over HTTP, and a built-in gallery viewer. Ships with Docker, systemd packages, and .mcpb bundles for Claude Desktop. Reach for this when you want Claude to generate production images without writing custom API clients for each provider, or when you need to test prompts across multiple models from one conversation.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

1 tools
image-generationGenerate an image from a given text10 params

Generate an image from a given text

Parameters* required
urlstring
Optional URL for image input.
stepsnumber
The number of steps for image generation.
widthnumber
The width of the image in pixels.
heightnumber
The height of the image in pixels.
promptstring
The prompt to generate the image.
return_typestring
The return type for the image.one of url · binary · base64
aspect_ratiostring
The aspect ratio for the image.one of 1:1 · 16:9 · 21:9 · 3:2 · 2:3 · 4:5
output_formatstring
The output format of the image.one of png · svg
advance_configobject
Advanced configuration options.
file_store_keystring
Optional file store key.

Image Generation MCP

CI codecov PyPI Python License Docker Docs llms.txt Template

Multi-provider image generation MCP server built on FastMCP. Generate images from Claude Desktop, Claude Code, or any MCP client using OpenAI, Google Gemini, Stable Diffusion (SD WebUI), or a zero-cost placeholder provider.

Documentation | Config wizard | PyPI | Docker

Features

  • Multi-provider — OpenAI (gpt-image-1.5, gpt-image-1, dall-e-3), Google Gemini (gemini-2.5-flash-image, gemini-3.x previews), SD WebUI (Stable Diffusion / Forge / reForge), and a zero-cost placeholder for testing.
  • Per-model style metadata — every model carries a style_profile (strengths, prompt grammar, lifecycle); list_providers includes a top-level warnings array for deprecated models. See Model Catalog.
  • Keyword-based auto-selection — provider="auto" routes by prompt content (text/logo → OpenAI, photoreal/anime → SD WebUI, draft → placeholder).
  • CDN-style image transforms — image://{id}/view?format=webp&width=512&crop_x=... resizes / re-encodes / crops on demand without re-generating.
  • Hybrid background tasks — long-running SD generations run with task=True (poll for status); short OpenAI calls stream progress in the foreground.
  • MCP Apps gallery + viewer — interactive UI surfaces (browse generated images, edit / crop / rotate) for clients that support app: resources.
  • Production deployment — Docker (multi-arch), .deb/.rpm with hardened systemd, OIDC + bearer auth, persistent EventStore for HTTP session resumability.

What you can do with it

With this server mounted in an MCP client, you can ask:

  • "Generate a coffee mug product photo on a worn oak table, 16:9, no text." Routes to gpt-image-1.5 for typography-aware photorealism.
  • "Create three concept-art variations of a cyberpunk alley at dusk." Composes generate_image with provider="sd_webui" and a stylised checkpoint like dreamshaperXL.
  • "Crop this image to a 1:1 square centred on the subject and resize to 512px." Uses image://{id}/view?width=512&height=512&crop_x=... resource transforms.
  • "Show me my recent generations." Browses the gallery via the image://list resource and the MCP Apps gallery viewer.
  • "Save this style as 'cyberpunk-night' so I can apply it to future requests." Uses the style library — markdown briefs the LLM interprets per-provider.

Installation

From PyPI

pip install image-generation-mcp

If you add optional extras via the PROJECT-EXTRAS-START / PROJECT-EXTRAS-END sentinels in pyproject.toml, document them below:

ExtraIncludesUse when
mcpfastmcp[tasks]>=3.0,<4Background-task support (task=True) — required for long SD generations.
openaiopenai>=1.0Enables the OpenAI provider.
google-genaigoogle-genai>=1.0Enables the Gemini provider.
allfastmcp[tasks] + openai + google-genaiEverything except SD WebUI (which is HTTP-only — no extra needed).

Example: pip install image-generation-mcp[all].

From source

git clone https://github.com/pvliesdonk/image-generation-mcp.git
cd image-generation-mcp
uv sync --all-extras --all-groups

Docker

docker pull ghcr.io/pvliesdonk/image-generation-mcp:latest

A compose.yml ships at the repo root as a starting point — copy .env.example to .env, edit, and docker compose up -d.

To attach a remote Python debugger (development only — the protocol is unauthenticated), see Remote debugging.

Linux packages (.deb / .rpm)

Download .deb or .rpm packages from the GitHub Releases page. Both install a hardened systemd unit; env configuration is sourced from /etc/image-generation-mcp/env (copy from the shipped /etc/image-generation-mcp/env.example).

Claude Desktop (.mcpb bundle)

Download the .mcpb bundle from the GitHub Releases page and double-click to install, or run:

mcpb install image-generation-mcp-<version>.mcpb

Claude Desktop prompts for required env vars via a GUI wizard — no manual JSON editing needed.

For manual Claude Desktop configuration and setup options, see Claude Desktop deployment.

Quick start

image-generation-mcp serve                                # stdio transport
image-generation-mcp serve --transport http --port 8000   # streamable HTTP

For library usage (embedding the domain logic without the MCP transport), import from the image_generation_mcp package directly — see the project's domain modules under src/image_generation_mcp/ for entry points.

Server info

The server registers a built-in get_server_info tool (via fastmcp_pvl_core.register_server_info_tool) so operators can confirm the deployed version with a single MCP call. The default response carries server_name, server_version, and core_version. Servers that talk to a remote upstream wire upstream version reporting inside the DOMAIN-UPSTREAM-START / DOMAIN-UPSTREAM-END sentinel in src/image_generation_mcp/server.py — see CLAUDE.md for the wiring pattern.

Configuration

Core environment variables shared across all fastmcp-pvl-core-based services:

VariableDefaultDescription
FASTMCP_LOG_LEVELINFOLog level for FastMCP internals and app loggers (DEBUG / INFO / WARNING / ERROR). The -v CLI flag overrides to DEBUG.
FASTMCP_ENABLE_RICH_LOGGINGtrueSet to false for plain / structured JSON log output.
IMAGE_GENERATION_MCP_KV_STORE_URLfile:///data/statePersistent-state backend URL for pvl-core subsystems — file:///path (survives restarts), memory:// (dev/ephemeral).

Domain-specific variables go below under Domain configuration.

Authentication

Callers authenticate via a bearer token or OIDC (mutually exclusive). See the Authentication guide for setup, mapped multi-subject tokens, OIDC, and troubleshooting.

Post-scaffold checklist

After copier copy and gh repo create --push:

  1. Fill in the DOMAIN blocks in this README (Features, What you can do with it, Domain configuration, Key design decisions) and in CLAUDE.md.
  2. Configure GitHub secrets — see below.
  3. Install dev + docs tooling: uv sync --all-extras --all-groups.
  4. Install pre-commit hooks: uv run pre-commit install.
  5. Run the gate locally: uv run pytest -x -q && uv run ruff check --fix . && uv run ruff format . && uv run mypy src/ tests/.
  6. Push the first commit — CI should be green.

GitHub secrets

CI workflows reference three repository secrets. Configure them via Settings → Secrets and variables → Actions or with gh secret set:

SecretUsed byHow to generate
RELEASE_TOKENrelease.yml, copier-update.ymlFine-grained PAT at https://github.com/settings/personal-access-tokens/new with contents: write and pull_requests: write (the copier-update cron opens PRs). Scoped to this repo.
CODECOV_TOKENci.ymlhttps://codecov.io — sign in with GitHub, add the repo, copy the upload token from the repo settings page.
CLAUDE_CODE_OAUTH_TOKENclaude.yml, claude-code-review.ymlRun claude setup-token locally and paste the result.
gh secret set RELEASE_TOKEN
gh secret set CODECOV_TOKEN
gh secret set CLAUDE_CODE_OAUTH_TOKEN

GITHUB_TOKEN is auto-provided — no action needed.

Local development

The PR gate (matches CI):

uv run pytest -x -q                                  # tests
uv run ruff check --fix . && uv run ruff format .    # lint + format
uv run mypy src/ tests/                              # type-check

Pre-commit runs a subset of the gate on each commit; see .pre-commit-config.yaml for details, or CLAUDE.md for the full Hard PR Acceptance Gates.

Troubleshooting

Moving a scaffolded project

uv sync creates .venv/bin/* scripts with absolute shebangs pointing at the venv Python. If you move the repo after scaffolding (mv /old/path /new/path), uv run pytest fails with ModuleNotFoundError: No module named 'fastmcp' because the stale shebang resolves to a different interpreter than the venv's site-packages.

Fix:

rm -rf .venv
uv sync --all-extras --all-groups

uv run python -m pytest also works as a one-shot workaround (bypasses the stale entry-script shim).

uv.lock refresh after copier update

When copier update introduces new dependencies (e.g. a new extra added to pyproject.toml.jinja), CI runs uv sync --frozen which fails against a stale lockfile. Run uv lock locally and commit the refreshed uv.lock alongside accepting the copier-update PR.

Links

  • Documentation
  • llms.txt
  • FastMCP
  • fastmcp-pvl-core

Domain configuration

All domain environment variables use the IMAGE_GENERATION_MCP_ prefix.

Core

VariableDefaultRequiredDescription
IMAGE_GENERATION_MCP_SCRATCH_DIR~/.image-generation-mcp/images/NoDirectory for saved generated images.
IMAGE_GENERATION_MCP_READ_ONLYtrueNoHide write-tagged tools (generate_image). Set to false to enable generation.
IMAGE_GENERATION_MCP_DEFAULT_PROVIDERautoNoDefault provider: auto, openai, gemini, sd_webui, placeholder.

Providers

VariableDefaultRequiredDescription
IMAGE_GENERATION_MCP_OPENAI_API_KEY—NoOpenAI API key; enables OpenAI provider when set.
IMAGE_GENERATION_MCP_GOOGLE_API_KEY—NoGoogle API key with Gemini access; enables Gemini provider when set.
IMAGE_GENERATION_MCP_SD_WEBUI_HOST—NoSD WebUI URL (e.g. http://localhost:7860); enables SD WebUI provider when set. Deprecated alias: A1111_HOST.
IMAGE_GENERATION_MCP_SD_WEBUI_MODEL—NoSD WebUI checkpoint name for preset detection and override. Deprecated alias: A1111_MODEL.

Authentication

VariableDefaultRequiredDescription
IMAGE_GENERATION_MCP_BEARER_TOKEN—NoStatic bearer token; enables bearer auth when set.
IMAGE_GENERATION_MCP_BASE_URL—NoPublic base URL for OIDC and MCP File Exchange downloads (e.g. https://mcp.example.com).
IMAGE_GENERATION_MCP_OIDC_CONFIG_URL—NoOIDC discovery endpoint URL.
IMAGE_GENERATION_MCP_OIDC_CLIENT_ID—NoOIDC client ID.
IMAGE_GENERATION_MCP_OIDC_CLIENT_SECRET—NoOIDC client secret.
IMAGE_GENERATION_MCP_OIDC_JWT_SIGNING_KEYephemeralYes on Linux/DockerJWT signing key.
IMAGE_GENERATION_MCP_OIDC_AUDIENCE—NoExpected JWT audience claim.
IMAGE_GENERATION_MCP_OIDC_REQUIRED_SCOPESopenidNoComma-separated required scopes.
IMAGE_GENERATION_MCP_OIDC_VERIFY_ACCESS_TOKENfalseNoVerify access token as JWT instead of id token.

Cost control & performance

VariableDefaultRequiredDescription
IMAGE_GENERATION_MCP_PAID_PROVIDERSopenai,geminiNoComma-separated paid provider names. Triggers elicitation confirmation on capable clients. Set to empty to disable.
IMAGE_GENERATION_MCP_TRANSFORM_CACHE_SIZE64NoMax cached transforms. Set to 0 to disable caching.

File Exchange (MCP downloads)

VariableDefaultRequiredDescription
IMAGE_GENERATION_MCP_FILE_EXCHANGE_ENABLEDtrue on http/sse, false on stdioNoMaster switch for the file-exchange producer. Set false to suppress all file_ref publishing.
IMAGE_GENERATION_MCP_FILE_EXCHANGE_TTL3600NoDefault and maximum TTL (seconds) for published files and download URLs. create_download_link's ttl_seconds is clamped to this.
IMAGE_GENERATION_MCP_FILE_EXCHANGE_CONSUMEtrueRecommended falseMaster switch for the consumer side. This server is producer-only; set false to silence the upstream "consume on, no consumer_sink wired" startup warning.

Server identity

VariableDefaultRequiredDescription
IMAGE_GENERATION_MCP_SERVER_NAMEimage-generation-mcpNoServer name shown to MCP clients.
IMAGE_GENERATION_MCP_INSTRUCTIONS(dynamic)NoSystem instructions for LLM context.
IMAGE_GENERATION_MCP_HTTP_PATH/mcpNoHTTP endpoint mount path.
IMAGE_GENERATION_MCP_APP_DOMAIN(auto)NoMCP Apps widget sandbox domain. Auto-computed from BASE_URL for Claude; override for other hosts.

Domain-config fields are composed inside src/image_generation_mcp/config.py between the CONFIG-FIELDS-START / CONFIG-FIELDS-END sentinels; env reads go through fastmcp_pvl_core.env(_ENV_PREFIX, "SUFFIX", default) so naming stays consistent.

For the full MCP tool / resource / prompt surface and per-provider setup notes, see the documentation site.

Key design decisions

  • Multi-provider with capability discovery, not feature flags. Each provider's discover_capabilities() reports its actual supported aspect ratios / qualities / formats / negative-prompt support at startup; routing logic asks the capability surface, not a hard-coded enum. New providers slot in by implementing the protocol — no router edits needed. (See docs/decisions/0001-…, 0002-…, 0007-….)
  • Per-model style_profile metadata, surfaced via list_providers. Closed-list providers (OpenAI, Gemini, placeholder) use exact-key lookup; SD WebUI uses a regex-ordered pattern table. Profiles include lifecycle flags (current / legacy / deprecated) and feed an auto-built top-level warnings array. (See docs/decisions/0009-….)
  • Hybrid background tasks. Short calls (OpenAI ~5 s) stream progress in-line; long calls (SD WebUI 30-180 s) run as background tasks with check_generation_status polling — clients pick the mode via task=True. (See docs/decisions/0005-….)
  • Image asset model: content-addressed registry + sidecar JSON metadata + on-demand transforms. Generated images keep their full-resolution original; image://{id}/view?format=webp&width=512&crop_x=… resources do format conversion / resize / crop on demand without re-generating. Transforms are cached. (See docs/decisions/0006-….)
  • Style library. User-saved markdown briefs (with YAML frontmatter for tags / aspect ratio / quality) that the LLM interprets per-provider — not copy-pasted verbatim. Distinct from per-model style_profile: style library is the brief; style_profile describes the model. (See docs/decisions/0008-… and 0009-… for disambiguation.)
  • Composes fastmcp_pvl_core.ServerConfig, never inherits. Domain config goes between CONFIG-FIELDS-START / CONFIG-FIELDS-END sentinels; env reads route through fastmcp_pvl_core.env(...) to keep prefix naming consistent.
Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

Configuration

IMAGE_GENERATION_MCP_READ_ONLYdefault: true

When true, write-tagged tools (generate_image) are hidden. Set to false to enable image generation.

IMAGE_GENERATION_MCP_SCRATCH_DIR

Directory for saved generated images. Created automatically on first use.

IMAGE_GENERATION_MCP_DEFAULT_PROVIDERdefault: auto

Default provider selection: auto (keyword-based), openai, sd_webui, or placeholder. Deprecated alias 'a1111' is also accepted.

IMAGE_GENERATION_MCP_OPENAI_API_KEYsecret

OpenAI API key. Enables the OpenAI provider (gpt-image-1, dall-e-3) when set.

IMAGE_GENERATION_MCP_SD_WEBUI_HOST

SD WebUI base URL (e.g. http://localhost:7860). Enables the SD WebUI provider when set.

IMAGE_GENERATION_MCP_SD_WEBUI_MODEL

SD WebUI checkpoint name for model-aware preset detection (SD 1.5 vs SDXL vs Lightning vs Flux).

IMAGE_GENERATION_MCP_A1111_HOST

Deprecated — use SD_WEBUI_HOST instead. Accepted as fallback.

IMAGE_GENERATION_MCP_A1111_MODEL

Deprecated — use SD_WEBUI_MODEL instead. Accepted as fallback.

FASTMCP_LOG_LEVELdefault: INFO

Log level for FastMCP internals; app loggers default to INFO, -v overrides both to DEBUG.

IMAGE_GENERATION_MCP_SERVER_NAMEdefault: image-generation-mcp

Server name shown to MCP clients in the initialization response.

IMAGE_GENERATION_MCP_INSTRUCTIONS

System-level instructions injected into LLM context.

IMAGE_GENERATION_MCP_BEARER_TOKENsecret

Static bearer token for HTTP authentication.

IMAGE_GENERATION_MCP_AUTH_MODE

OIDC auth mode: 'remote' (JWKS validation) or 'oidc-proxy' (OAuth proxy). Auto-detected if not set.

IMAGE_GENERATION_MCP_BASE_URL

Public base URL for OIDC redirects (e.g. https://mcp.example.com). Required for OIDC.

IMAGE_GENERATION_MCP_OIDC_CONFIG_URL

OIDC discovery endpoint URL. Required for OIDC.

IMAGE_GENERATION_MCP_OIDC_CLIENT_ID

OIDC client ID. Required for OIDC.

IMAGE_GENERATION_MCP_OIDC_CLIENT_SECRETsecret

OIDC client secret. Required for OIDC.

IMAGE_GENERATION_MCP_OIDC_JWT_SIGNING_KEYsecret

Signing key for OIDC session JWTs (critical on Linux/Docker).

IMAGE_GENERATION_MCP_OIDC_AUDIENCE

Expected JWT audience claim. Leave unset if your provider does not set one.

IMAGE_GENERATION_MCP_OIDC_REQUIRED_SCOPES

Comma-separated required OIDC scopes (default: openid for oidc-proxy mode, none for remote mode).

IMAGE_GENERATION_MCP_OIDC_VERIFY_ACCESS_TOKENdefault: false

Verify access token JWT instead of id_token.

IMAGE_GENERATION_MCP_HTTP_PATHdefault: /mcp

HTTP endpoint mount path for streamable-HTTP transport.

IMAGE_GENERATION_MCP_EVENT_STORE_URLdefault: file:///data/state/events

EventStore backend for SSE session resumability. file:///path (file-backed, survives restarts) or memory:// (in-process only, dev mode).

PUIDdefault: 1000

Run as this UID (Docker entrypoint).

Categories
AI & LLM Tools
Registryactive
Packageimage-generation-mcp
TransportSTDIO, HTTP
AuthRequired
UpdatedApr 29, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f