Plugs Claude directly into Google's Gemini image generation API, specifically the Nano Banana and Nano Banana Pro models. You get five tools over stdio: list available models, generate images from text prompts, edit or compose existing images with instructions, batch generate consistent image sets from a master prompt, and run multi-turn refinement sessions via the Interactions API. Images land on disk by default with paths returned, or you can pull them inline as base64. Requires a Gemini API key and runs through the generativelanguage.googleapis.com v1beta endpoint. Reach for this when you need Claude to generate or iterate on images without leaving the MCP context.
MCP server for Google Gemini image generation and editing. Exposes four tools to Claude over stdio: list available models, generate images from text prompts, edit or compose images with text instructions, and generate a consistent set of images from a master prompt. Images are written to disk by default (path returned) or returned inline as base64. Built on the Gemini v1beta API (generativelanguage.googleapis.com) using the Nano Banana / Nano Banana Pro model family.
Developed and maintained by AI (Claude Code).
| Variable | Required | Description |
|---|---|---|
GEMINI_API_KEY | Yes | Your Google Gemini API key (aistudio.google.com/apikey) |
GEMINI_IMAGE_MODEL | No | Override the default image model (default: gemini-3-pro-image) |
GEMINI_OUTPUT_DIR | No | Default directory for generated images (default: current working directory) |
GEMINI_INPUT_DIR | No | Directory to resolve bare input-image filenames against (so images: ["foo.jpg"] works) |
| Tool | Description |
|---|---|
gemini_list_models | List available Gemini image models and the current default |
gemini_generate_image | Generate image(s) from a text prompt |
gemini_edit_image | Edit or compose one or more input images with a text instruction |
gemini_generate_set | Generate a master image plus N consistent images referencing it |
gemini_interact | Multi-turn generation/editing via the Interactions API (Beta); returns an interaction_id to refine the same image conversationally |
{
"mcpServers": {
"gemini": {
"command": "npx",
"args": ["-y", "@chrischall/gemini-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
See SKILL.md for full usage documentation.
GEMINI_API_KEY*secretYour Google Gemini API key (aistudio.google.com/apikey)
GEMINI_IMAGE_MODELOverride the default image model (default: gemini-3-pro-image)
GEMINI_OUTPUT_DIRDefault directory for generated images (default: current working directory)
GEMINI_INPUT_DIRDirectory to resolve bare input-image filenames against
io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage
io.github.mikerawsonnz/llm-orchestration-agent
io.github.mikerawsonnz/authenticated-llm-agent
labforgedev/copilot-memory-mcp
csoai-org/agent-prompt-injection-firewall-mcp
io.github.mikerawsonnz/authenticated-multi-llm-agent