This server gives Claude the ability to execute Python code in throwaway sandboxes that vanish after each run. Scripts declare dependencies inline using PEP 723 metadata blocks, uv resolves them ephemerally, and nothing touches your host Python installation. It exposes execute_python for running code with configurable timeouts, validate_script for checking metadata without execution, and check_environment for inspecting the runtime. On Linux it uses bubblewrap for namespace isolation, elsewhere it spins up Docker containers. You can target Python 3.13 through 3.15 regardless of what's on your host machine. Reach for this when you need Claude to prototype with third-party packages or analyze data without polluting your system with accumulating virtualenvs and package conflicts.
Sandboxed Python execution for AI agents. Scripts run in ephemeral, isolated environments with inline dependencies (PEP 723) -- zero host pollution, zero leftover venvs, zero package conflicts.
Every coding agent can already run Python on your host. The problem is what happens next: packages accumulate, venvs sprawl, and a rogue pip install breaks your system. mcp-python-exec-sandbox eliminates this:
uv# /// script blocksAll setups require:
uvx for running the server without installing it globally.Additional requirements depend on your chosen sandbox backend:
| Setup | Additional requirements | Install |
|---|---|---|
| Native sandbox (Linux) | bubblewrap | sudo apt install bubblewrap |
| Docker sandbox (macOS, any) | Docker Engine | See Docker docs |
| No sandbox | None | -- |
Host Python vs. execution Python: These are independent. Python 3.13+ is needed to run the server process itself. The
--python-versionflag controls which Python version your scripts execute on -- uv downloads the target version automatically. You do not need to install Python 3.14 or 3.15 on your host to run scripts on those versions.
claude mcp add python-sandbox -- uvx mcp-python-exec-sandbox
claude mcp add python-sandbox -- uvx mcp-python-exec-sandbox
The Docker sandbox image is pulled automatically from GHCR on first use. No manual build required.
claude mcp add python-sandbox -- uvx mcp-python-exec-sandbox --sandbox-backend none
Add to .cursor/mcp.json (project-level) or ~/.cursor/mcp.json (global):
{
"mcpServers": {
"python-sandbox": {
"command": "uvx",
"args": ["mcp-python-exec-sandbox"]
}
}
}
codex mcp add python-sandbox -- uvx mcp-python-exec-sandbox
Or add to .codex/config.toml:
[mcp_servers.python-sandbox]
command = "uvx"
args = ["mcp-python-exec-sandbox"]
Any client that supports the MCP stdio transport can use this server:
{
"mcpServers": {
"python-sandbox": {
"command": "uvx",
"args": ["mcp-python-exec-sandbox"]
}
}
}
Use --python-version to target a specific Python version. uv downloads it automatically -- no manual install needed.
# Python 3.13 (default)
uvx mcp-python-exec-sandbox --python-version 3.13
# Python 3.14
uvx mcp-python-exec-sandbox --python-version 3.14
# Python 3.15
uvx mcp-python-exec-sandbox --python-version 3.15
This works across all sandbox backends. The Docker sandbox uses uv inside the container to manage Python versions, so the same --python-version flag applies.
execute_pythonExecute a Python script with automatic dependency management.
| Parameter | Type | Default | Description |
|---|---|---|---|
script | str | required | Python source code, may include PEP 723 inline metadata |
dependencies | list[str] | [] | Extra PEP 508 dependency specifiers to merge |
timeout_seconds | int | 30 | Maximum execution time (1--300) |
# Simple script
execute_python(script="print('hello world')")
# Script with dependencies
execute_python(
script="import requests; print(requests.get('https://httpbin.org/get').status_code)",
dependencies=["requests"]
)
# Script with inline PEP 723 metadata
execute_python(script="""
# /// script
# dependencies = ["pandas", "matplotlib"]
# ///
import pandas as pd
print(pd.DataFrame({'a': [1,2,3]}).describe())
""")
check_environmentReturns information about the execution environment: Python version, uv version, platform, sandbox status, and configuration.
validate_scriptValidates a script's PEP 723 metadata and dependencies without executing it.
| Parameter | Type | Default | Description |
|---|---|---|---|
script | str | required | Python source code to validate |
dependencies | list[str] | [] | Extra dependency specifiers to validate |
| Backend | Platform | Tool | Notes |
|---|---|---|---|
native | Linux | bubblewrap | Namespace isolation, network allowed |
docker | Any | Docker | Container isolation, resource limits |
none | Any | -- | No sandboxing (not recommended) |
The default backend is native (bubblewrap) on Linux and docker on macOS/other platforms. Specifying --sandbox-backend native on macOS automatically redirects to Docker. If the sandbox tool is unavailable, the server falls back to none with a warning.
The Docker sandbox image is published to GHCR and pulled automatically when the server starts. No manual setup is needed.
To build locally for development:
docker build -t ghcr.io/lu-zhengda/mcp-python-exec-sandbox profiles/
mcp-python-exec-sandbox [OPTIONS]
Options:
--python-version TEXT Python version for execution (default: 3.13)
--sandbox-backend TEXT native | docker | none (default: native on Linux, docker on macOS)
--max-timeout INT Maximum allowed timeout in seconds (default: 300)
--default-timeout INT Default timeout in seconds (default: 30)
--max-output-bytes INT Maximum output size in bytes (default: 102400)
--no-warm-cache Skip cache warming on startup
--uv-path TEXT Path to uv binary (default: uv)
git clone https://github.com/lu-zhengda/mcp-python-exec-sandbox.git
cd mcp-python-exec-sandbox
uv sync --dev
src/mcp_python_exec_sandbox/ # Package source
server.py # FastMCP server + tool definitions
executor.py # uv subprocess orchestration
script.py # PEP 723 metadata parsing/merging
sandbox.py # Sandbox ABC + factory
sandbox_linux.py # bubblewrap sandbox (Linux)
sandbox_docker.py # Docker sandbox (macOS/any)
config.py, cache.py, output.py, errors.py
tests/ # Unit + integration tests (mocked or local uv)
e2e_tests/ # End-to-end tests (require uv + network)
profiles/ # Dockerfile, warmup packages
.devcontainer/ # Devcontainer for Linux sandbox testing from macOS
Unit and integration tests -- fast, run everywhere:
uv run pytest tests/ -v
E2E tests -- require uv and network access. These exercise real script execution, package installation, MCP protocol flow, and sandbox enforcement:
uv run pytest e2e_tests/ -v
The Docker E2E tests (e2e_tests/test_docker_sandbox.py) verify execution, dependency installation, read-only filesystem enforcement, host isolation, and timeout handling through the Docker backend.
Prerequisites:
docker build -t ghcr.io/lu-zhengda/mcp-python-exec-sandbox profiles/
Then run:
uv run pytest e2e_tests/test_docker_sandbox.py -v
These tests are automatically skipped if Docker is unavailable or the image hasn't been built.
The Linux sandbox tests (e2e_tests/test_sandbox_enforcement.py::test_linux_sandbox_blocks_etc_shadow) use bubblewrap (bwrap) for namespace isolation. They are skipped on macOS because bwrap is Linux-only.
To run them from macOS, use the included devcontainer which provides Ubuntu 24.04 with bwrap pre-installed:
VS Code:
uv run pytest e2e_tests/test_sandbox_enforcement.py -v
CLI:
# Install the devcontainer CLI (once)
npm install -g @devcontainers/cli
# Build and start the container
devcontainer up --workspace-folder .
# Run the Linux sandbox tests inside the container
devcontainer exec --workspace-folder . uv run pytest e2e_tests/test_sandbox_enforcement.py -v
| Test suite | Command | Requirements |
|---|---|---|
| Unit tests | uv run pytest tests/ -v | uv |
| Integration tests | uv run pytest tests/test_integration.py -v | uv |
| E2E (general) | uv run pytest e2e_tests/ -v | uv, network |
| E2E (Docker sandbox) | uv run pytest e2e_tests/test_docker_sandbox.py -v | uv, Docker, sandbox image |
| E2E (Linux/bwrap sandbox) | uv run pytest e2e_tests/test_sandbox_enforcement.py -v | uv, Linux with bwrap (or devcontainer) |
uv run pytest tests/ -v before committing -- all tests must pass.tests/, E2E in e2e_tests/ if it needs real execution.server.py are user-facing MCP tool descriptions. Write them for an LLM audience.NoopSandbox with a warning.MIT