MCP Gemini

8 toolsSTDIOregistry active

Summary

Connects Claude to Google's Gemini AI with 39 tools spanning text generation, image creation via Imagen 4, video generation with Veo, document analysis, YouTube summarization, text-to-speech, and multi-step web research. You get structured output extraction, content caching for repeated queries, code execution, and Google Search grounding for queries. Notable for image editing sessions where you can iteratively refine generated images across multiple turns. Runs locally via stdio so your API key stays on your machine. Reach for this when you want Claude to tap Gemini's multimodal capabilities, especially for tasks like analyzing PDFs, generating videos, or running deep research operations that span multiple web sources.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Tools

Public tool metadata for what this MCP can expose to an agent.

8 tools

GEMINI_COUNT_TOKENSCounts the number of tokens in text using Gemini tokenization. Useful for estimating costs, checking input limits, and optimizing prompts before making API calls.2 params

Counts the number of tokens in text using Gemini tokenization. Useful for estimating costs, checking input limits, and optimizing prompts before making API calls.

Parameters* required

textstring

Text to count tokens for

modelstring

Model to use for token counting. Examples: 'gemini-1.5-flash', 'gemini-1.5-pro'default: gemini-1.5-flash

GEMINI_EMBED_CONTENTGenerates text embeddings using Gemini embedding models. Converts text into numerical vectors for semantic search, similarity comparison, clustering, and classification tasks.4 params

Generates text embeddings using Gemini embedding models. Converts text into numerical vectors for semantic search, similarity comparison, clustering, and classification tasks.

Parameters* required

textstring

Text to generate embeddings for

modelstring

Embedding model to use. Examples: 'text-embedding-004', 'embedding-001'default: text-embedding-004

titlestring

Optional title for the content (for document embeddings)

task_typestring

Task type: 'RETRIEVAL_QUERY', 'RETRIEVAL_DOCUMENT', 'SEMANTIC_SIMILARITY', 'CLASSIFICATION', 'CLUSTERING'

GEMINI_GENERATE_CONTENTGenerates text content from prompts using Gemini models. Supports various models like Gemini Flash and Pro with configurable temperature, token limits, and safety settings for diverse text generation tasks.9 params

Generates text content from prompts using Gemini models. Supports various models like Gemini Flash and Pro with configurable temperature, token limits, and safety settings for diverse text generation tasks.

Parameters* required

modelstring

Model to use. Examples: 'gemini-1.5-flash', 'gemini-1.5-pro', 'gemini-2.0-flash-exp'default: gemini-1.5-flash

top_kinteger

Top-k sampling parameter

top_pnumber

Nucleus sampling parameter (0.0 to 1.0)

promptstring

Text prompt for content generation

temperaturenumber

Controls randomness (0.0 to 2.0)

stop_sequencesarray

Sequences where generation should stop

safety_settingsarray

Safety filter settings

max_output_tokensinteger

Maximum number of tokens to generate

system_instructionstring

System instruction to guide the model's behavior

GEMINI_GENERATE_IMAGEGenerates images from text prompts using Gemini 2.5 Flash Image Preview model (Nano Banana). Supports creative image generation with customizable parameters like aspect ratio, safety settings, and optional local file saving. Generated images are automatically uploaded to S3 an...9 params

Generates images from text prompts using Gemini 2.5 Flash Image Preview model (Nano Banana). Supports creative image generation with customizable parameters like aspect ratio, safety settings, and optional local file saving. Generated images are automatically uploaded to S3 an...

Parameters* required

modelstring

Model to use. Use 'gemini-2.5-flash-image-preview' for image generationdefault: gemini-2.5-flash-image-preview

top_kinteger

Top-k sampling parameter

top_pnumber

Nucleus sampling parameter (0.0 to 1.0)

promptstring

Text prompt for image generation

save_pathstring

Optional local path to save the generated image

temperaturenumber

Controls randomness (0.0 to 2.0)

safety_settingsarray

Safety filter settings

max_output_tokensinteger

Maximum number of tokens to generate (max 32,768)

system_instructionstring

System instruction to guide image generation behavior

GEMINI_GENERATE_VIDEOSGenerates videos from text prompts using Google's Veo models. Creates high-quality video content. Returns operation ID for tracking progress. After this, call GEMINI_WAIT_FOR_VIDEO to download the video using the operation ID.4 params

Generates videos from text prompts using Google's Veo models. Creates high-quality video content. Returns operation ID for tracking progress. After this, call GEMINI_WAIT_FOR_VIDEO to download the video using the operation ID.

Parameters* required

modelstring

Model to use. Examples: 'veo-3.0-generate-preview', 'veo-3.0-fast-generate-preview', 'veo-2.0-generate-001'default: veo-3.0-generate-preview

extrasobject

Additional parameters passed through to API

promptstring

Text prompt for Veo video generation

person_generationstring

Controls person generation in videos. Values: 'allow_adult' or 'dont_allow'. IMPORTANT: Veo 3 models in EU/UK/CH/MENA regions ONLY support 'allow_adult'. Veo 2 models support both values in all regions.

GEMINI_GET_VIDEOS_OPERATIONChecks the status of a Veo video generation operation. Use the operation name from GenerateVideos to track progress and get the download URL when complete.1 params

Checks the status of a Veo video generation operation. Use the operation name from GenerateVideos to track progress and get the download URL when complete.

Parameters* required

operation_namestring

Operation resource name returned by predictLongRunning

GEMINI_LIST_MODELSLists available Gemini and Veo models with their capabilities and limits. Useful for discovering supported models and their features before making generation requests.1 params

Lists available Gemini and Veo models with their capabilities and limits. Useful for discovering supported models and their features before making generation requests.

Parameters* required

filter_prefixstring

Filter models by name prefix (client-side). Leave empty to get all models.default:

GEMINI_WAIT_FOR_VIDEOPolls a Veo video generation operation until completion, then downloads and returns the video as a FileDownloadable with public URL.1 params

Polls a Veo video generation operation until completion, then downloads and returns the video as a FileDownloadable with public URL.

Parameters* required

operation_namestring

The operation name from video generation (e.g., 'models/...')

MCP Gemini CrunchTools

A secure MCP (Model Context Protocol) server for Google Gemini AI - text, image, video, research, and more.

Overview

This MCP server is designed to be:

Secure by default - Comprehensive threat modeling, input validation, and API key protection
No third-party services - Runs locally via stdio, your API key never leaves your machine
Cross-platform - Works on Linux, macOS, and Windows
Automatically updated - GitHub Actions monitor for CVEs and update dependencies
Containerized - Available at quay.io/crunchtools/mcp-gemini built on Hummingbird Python base image

Naming Convention

Component	Name
GitHub repo	crunchtools/mcp-gemini
Container	`quay.io/crunchtools/mcp-gemini`
Python package (PyPI)	`mcp-gemini-crunchtools`
CLI command	`mcp-gemini-crunchtools`
Module import	`mcp_gemini_crunchtools`

Why Hummingbird?

The container image is built on the Hummingbird Python base image from Project Hummingbird, which provides:

Minimal CVE exposure - Hummingbird images are built with a minimal package set, dramatically reducing the attack surface compared to general-purpose images
Regular updates - Security patches are applied promptly, keeping CVE counts low
Optimized for Python - Pre-configured Python environment with uv package manager for fast, reproducible builds
Production-ready - Designed for production workloads with proper signal handling and non-root user defaults

This means your MCP server runs in a hardened environment with fewer vulnerabilities than typical Python container images.

Features

Query Tools (5 tools)

gemini_query - Query Gemini with optional Google Search grounding
gemini_brainstorm - Generate creative ideas on a topic
gemini_analyze_code - Analyze code for security, performance, bugs
gemini_analyze_text - Analyze text for sentiment, tone, content
gemini_summarize - Summarize content in various formats

Image Generation (4 tools)

gemini_generate_image - Generate images from text prompts (native Gemini)
gemini_generate_image_with_input - Edit/modify existing images
gemini_image_prompt - Craft effective image generation prompts
gemini_imagen_generate - Generate images using Google Imagen 4 models

Image Editing (4 tools)

gemini_start_image_edit - Start a multi-turn image editing session
gemini_continue_image_edit - Continue editing in an active session
gemini_end_image_edit - End an image editing session
gemini_list_image_sessions - List all active editing sessions

Image Analysis (1 tool)

gemini_analyze_image - Analyze and describe local image files

Search Tools (1 tool)

gemini_search - Web search using Gemini with Google Search grounding

Document Tools (3 tools)

gemini_analyze_document - Analyze PDFs, DOCX, TXT, etc.
gemini_summarize_pdf - Summarize PDF documents
gemini_extract_tables - Extract tables from documents

URL Tools (3 tools)

gemini_analyze_url - Analyze one or more URLs
gemini_compare_urls - Compare two URLs
gemini_extract_from_url - Extract specific data from a URL

Video Tools (2 tools)

gemini_generate_video - Generate videos using Veo
gemini_check_video - Check video generation status

YouTube Tools (2 tools)

gemini_youtube - Analyze YouTube videos
gemini_youtube_summary - Summarize YouTube videos

Voice Tools (3 tools)

gemini_speak - Convert text to speech
gemini_dialogue - Generate multi-voice dialogue audio
gemini_list_voices - List available voices

Research Tools (3 tools)

gemini_deep_research - Perform multi-step web research
gemini_check_research - Check research operation status
gemini_research_followup - Ask follow-up questions

Cache Tools (4 tools)

gemini_create_cache - Create content cache for repeated queries
gemini_query_cache - Query cached content
gemini_list_caches - List all active caches
gemini_delete_cache - Delete a cache

Structured Output Tools (2 tools)

gemini_structured - Get structured JSON output
gemini_extract - Extract structured data from text

Token Tools (1 tool)

gemini_count_tokens - Count tokens in content

Code Execution Tools (1 tool)

gemini_run_code - Execute Python code via Gemini

Total: 39 tools

Installation

With uvx (Recommended)

uvx mcp-gemini-crunchtools

With pip

pip install mcp-gemini-crunchtools

With Container

# Create a shared output directory (required before first run)
mkdir -p ~/.local/share/mcp-uploads-downloads

podman run -v ~/.local/share/mcp-uploads-downloads:/output:z \
    -e GEMINI_API_KEY=your_key \
    -e GEMINI_OUTPUT_DIR=/output \
    quay.io/crunchtools/mcp-gemini

SELinux note: Use :z (lowercase, shared) instead of :Z (uppercase, private). MCP servers run as long-lived stdio processes. With :Z, files copied into the directory after container start won't have the container's private MCS label and will be invisible inside the container. The :z flag sets a shared container_file_t context that all containers and the host can read/write.

Tip: Use the same shared directory (~/.local/share/mcp-uploads-downloads/) across multiple MCP container servers (e.g., mcp-gemini and mcp-wordpress) so generated images are immediately available for WordPress upload without copying.

Configuration

Creating a Google Gemini API Key

Navigate to Google AI Studio
- Go to https://aistudio.google.com/apikey
- Sign in with your Google account
Create API Key
- Click "Get API key" or "Create API key"
- Select a Google Cloud project or create a new one
- Click "Create API key in new project" (or select existing project)
Copy Your API Key
- IMPORTANT: Copy the API key immediately - store it securely!
- The key starts with AI... (e.g., AIzaSy...)

Add to Claude Code

claude mcp add mcp-gemini-crunchtools \
    --env GEMINI_API_KEY=your_api_key_here \
    -- uvx mcp-gemini-crunchtools

Or for the container version:

# Create a shared output directory (required before first run)
mkdir -p ~/.local/share/mcp-uploads-downloads

claude mcp add mcp-gemini-crunchtools \
    --env GEMINI_API_KEY=your_api_key_here \
    --env GEMINI_OUTPUT_DIR=/output \
    -- podman run -i --rm \
        -v ~/.local/share/mcp-uploads-downloads:/output:z \
        -e GEMINI_API_KEY \
        -e GEMINI_OUTPUT_DIR=/output \
        quay.io/crunchtools/mcp-gemini

Optional: Set Output Directory (non-container)

For generated images, audio, and videos when running without a container:

claude mcp add mcp-gemini-crunchtools \
    --env GEMINI_API_KEY=your_api_key_here \
    --env GEMINI_OUTPUT_DIR=$HOME/.local/share/mcp-uploads-downloads \
    -- uvx mcp-gemini-crunchtools

Usage Examples

Query with Google Search

User: What are the latest developments in quantum computing?
Assistant: [calls gemini_query with use_google_search=true]

Generate an Image

User: Generate a photorealistic image of a sunset over mountains
Assistant: [calls gemini_generate_image with prompt and style]

Analyze a PDF Document

User: Analyze this research paper at /path/to/paper.pdf
Assistant: [calls gemini_analyze_document with file_path]

Summarize a YouTube Video

User: Summarize this YouTube video: https://youtube.com/watch?v=...
Assistant: [calls gemini_youtube_summary with url]

Deep Research

User: Research the environmental impact of electric vehicles
Assistant: [calls gemini_deep_research then gemini_check_research]

Code Analysis

User: Analyze this Python code for security issues
Assistant: [calls gemini_analyze_code with focus="security"]

Security

This server was designed with security as a primary concern. See SECURITY.md for:

Threat model and attack vectors
Defense in depth architecture
API key handling best practices
Input validation rules
Audit logging

Key Security Features

API Key Protection
- Stored as SecretStr (never accidentally logged)
- Environment variable only (never in files or args)
- Sanitized from all error messages
Input Validation
- Pydantic models for all inputs
- File path validation
- URL validation
- Strict format validation
API Hardening
- Hardcoded API base URL (prevents SSRF)
- TLS certificate validation
- Request timeouts
- Response size limits
Automated CVE Scanning
- GitHub Actions scan dependencies weekly
- Automatic PRs for security updates
- Dependabot alerts enabled

Development

Setup

git clone https://github.com/crunchtools/mcp-gemini.git
cd mcp-gemini
uv sync

Run Tests

uv run pytest

Lint and Type Check

uv run ruff check src tests
uv run mypy src

Build Container

podman build -t mcp-gemini .

License

AGPL-3.0-or-later

Contributing

Contributions welcome! Please read SECURITY.md before submitting security-related changes.

Links

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

MCP Gemini CrunchTools

A secure MCP (Model Context Protocol) server for Google Gemini AI - text, image, video, research, and more.

Overview

This MCP server is designed to be:

Secure by default - Comprehensive threat modeling, input validation, and API key protection
No third-party services - Runs locally via stdio, your API key never leaves your machine
Cross-platform - Works on Linux, macOS, and Windows
Automatically updated - GitHub Actions monitor for CVEs and update dependencies
Containerized - Available at quay.io/crunchtools/mcp-gemini built on Hummingbird Python base image

Naming Convention

Component	Name
GitHub repo	crunchtools/mcp-gemini
Container	`quay.io/crunchtools/mcp-gemini`
Python package (PyPI)	`mcp-gemini-crunchtools`
CLI command	`mcp-gemini-crunchtools`
Module import	`mcp_gemini_crunchtools`

Why Hummingbird?

The container image is built on the Hummingbird Python base image from Project Hummingbird, which provides:

Minimal CVE exposure - Hummingbird images are built with a minimal package set, dramatically reducing the attack surface compared to general-purpose images
Regular updates - Security patches are applied promptly, keeping CVE counts low
Optimized for Python - Pre-configured Python environment with uv package manager for fast, reproducible builds
Production-ready - Designed for production workloads with proper signal handling and non-root user defaults

This means your MCP server runs in a hardened environment with fewer vulnerabilities than typical Python container images.

Features

Query Tools (5 tools)

gemini_query - Query Gemini with optional Google Search grounding
gemini_brainstorm - Generate creative ideas on a topic
gemini_analyze_code - Analyze code for security, performance, bugs
gemini_analyze_text - Analyze text for sentiment, tone, content
gemini_summarize - Summarize content in various formats

Image Generation (4 tools)

gemini_generate_image - Generate images from text prompts (native Gemini)
gemini_generate_image_with_input - Edit/modify existing images
gemini_image_prompt - Craft effective image generation prompts
gemini_imagen_generate - Generate images using Google Imagen 4 models

Image Editing (4 tools)

gemini_start_image_edit - Start a multi-turn image editing session
gemini_continue_image_edit - Continue editing in an active session
gemini_end_image_edit - End an image editing session
gemini_list_image_sessions - List all active editing sessions

Image Analysis (1 tool)

gemini_analyze_image - Analyze and describe local image files

Search Tools (1 tool)

gemini_search - Web search using Gemini with Google Search grounding

Document Tools (3 tools)

gemini_analyze_document - Analyze PDFs, DOCX, TXT, etc.
gemini_summarize_pdf - Summarize PDF documents
gemini_extract_tables - Extract tables from documents

URL Tools (3 tools)

gemini_analyze_url - Analyze one or more URLs
gemini_compare_urls - Compare two URLs
gemini_extract_from_url - Extract specific data from a URL

Video Tools (2 tools)

gemini_generate_video - Generate videos using Veo
gemini_check_video - Check video generation status

YouTube Tools (2 tools)

gemini_youtube - Analyze YouTube videos
gemini_youtube_summary - Summarize YouTube videos

Voice Tools (3 tools)

gemini_speak - Convert text to speech
gemini_dialogue - Generate multi-voice dialogue audio
gemini_list_voices - List available voices

Research Tools (3 tools)

gemini_deep_research - Perform multi-step web research
gemini_check_research - Check research operation status
gemini_research_followup - Ask follow-up questions

Cache Tools (4 tools)

gemini_create_cache - Create content cache for repeated queries
gemini_query_cache - Query cached content
gemini_list_caches - List all active caches
gemini_delete_cache - Delete a cache

Structured Output Tools (2 tools)

gemini_structured - Get structured JSON output
gemini_extract - Extract structured data from text

Token Tools (1 tool)

gemini_count_tokens - Count tokens in content

Code Execution Tools (1 tool)

gemini_run_code - Execute Python code via Gemini

Total: 39 tools

Installation

With uvx (Recommended)

uvx mcp-gemini-crunchtools

With pip

pip install mcp-gemini-crunchtools

With Container

# Create a shared output directory (required before first run)
mkdir -p ~/.local/share/mcp-uploads-downloads

podman run -v ~/.local/share/mcp-uploads-downloads:/output:z \
    -e GEMINI_API_KEY=your_key \
    -e GEMINI_OUTPUT_DIR=/output \
    quay.io/crunchtools/mcp-gemini

SELinux note: Use :z (lowercase, shared) instead of :Z (uppercase, private). MCP servers run as long-lived stdio processes. With :Z, files copied into the directory after container start won't have the container's private MCS label and will be invisible inside the container. The :z flag sets a shared container_file_t context that all containers and the host can read/write.

Tip: Use the same shared directory (~/.local/share/mcp-uploads-downloads/) across multiple MCP container servers (e.g., mcp-gemini and mcp-wordpress) so generated images are immediately available for WordPress upload without copying.

Configuration

Creating a Google Gemini API Key

Navigate to Google AI Studio
- Go to https://aistudio.google.com/apikey
- Sign in with your Google account
Create API Key
- Click "Get API key" or "Create API key"
- Select a Google Cloud project or create a new one
- Click "Create API key in new project" (or select existing project)
Copy Your API Key
- IMPORTANT: Copy the API key immediately - store it securely!
- The key starts with AI... (e.g., AIzaSy...)

Add to Claude Code

claude mcp add mcp-gemini-crunchtools \
    --env GEMINI_API_KEY=your_api_key_here \
    -- uvx mcp-gemini-crunchtools

Or for the container version:

# Create a shared output directory (required before first run)
mkdir -p ~/.local/share/mcp-uploads-downloads

claude mcp add mcp-gemini-crunchtools \
    --env GEMINI_API_KEY=your_api_key_here \
    --env GEMINI_OUTPUT_DIR=/output \
    -- podman run -i --rm \
        -v ~/.local/share/mcp-uploads-downloads:/output:z \
        -e GEMINI_API_KEY \
        -e GEMINI_OUTPUT_DIR=/output \
        quay.io/crunchtools/mcp-gemini

Optional: Set Output Directory (non-container)

For generated images, audio, and videos when running without a container:

claude mcp add mcp-gemini-crunchtools \
    --env GEMINI_API_KEY=your_api_key_here \
    --env GEMINI_OUTPUT_DIR=$HOME/.local/share/mcp-uploads-downloads \
    -- uvx mcp-gemini-crunchtools

Usage Examples

Query with Google Search

User: What are the latest developments in quantum computing?
Assistant: [calls gemini_query with use_google_search=true]

Generate an Image

User: Generate a photorealistic image of a sunset over mountains
Assistant: [calls gemini_generate_image with prompt and style]

Analyze a PDF Document

User: Analyze this research paper at /path/to/paper.pdf
Assistant: [calls gemini_analyze_document with file_path]

Summarize a YouTube Video

User: Summarize this YouTube video: https://youtube.com/watch?v=...
Assistant: [calls gemini_youtube_summary with url]

Deep Research

User: Research the environmental impact of electric vehicles
Assistant: [calls gemini_deep_research then gemini_check_research]

Code Analysis

User: Analyze this Python code for security issues
Assistant: [calls gemini_analyze_code with focus="security"]

Security

This server was designed with security as a primary concern. See SECURITY.md for:

Threat model and attack vectors
Defense in depth architecture
API key handling best practices
Input validation rules
Audit logging

Key Security Features

API Key Protection
- Stored as SecretStr (never accidentally logged)
- Environment variable only (never in files or args)
- Sanitized from all error messages
Input Validation
- Pydantic models for all inputs
- File path validation
- URL validation
- Strict format validation
API Hardening
- Hardcoded API base URL (prevents SSRF)
- TLS certificate validation
- Request timeouts
- Response size limits
Automated CVE Scanning
- GitHub Actions scan dependencies weekly
- Automatic PRs for security updates
- Dependabot alerts enabled

Development

Setup

git clone https://github.com/crunchtools/mcp-gemini.git
cd mcp-gemini
uv sync

Run Tests

uv run pytest

Lint and Type Check

uv run ruff check src tests
uv run mypy src

Build Container

podman build -t mcp-gemini .

License

AGPL-3.0-or-later

Contributing

Contributions welcome! Please read SECURITY.md before submitting security-related changes.