Pentest Mcp Server

1STDIO, HTTPregistry active

Summary

Built for authorized penetration testing workflows, offering six offline tools that span methodology lookup, response analysis, MITRE ATT&CK technique mapping, and payload template generation. The pentest_guide tool returns step-by-step playbooks for fifteen attack vectors (XSS, SQLi, SSRF, JWT attacks, etc.), each with detection indicators and mitigations. You can paste raw HTTP responses into pentest_analyze_response to extract version leaks, stack traces, and technology fingerprints, then feed those fingerprints to pentest_map_techniques to get ranked ATT&CK techniques scored against your target profile. The payload generator returns annotated templates with bypass rationale and encoding chains, not live exploits. Everything runs locally via stdio or against a public hosted endpoint at pentest.caseyjhand.com/mcp. Designed for red teams, CTF players, and blue teamers building detection coverage.

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

@cyanheads/pentest-mcp-server

Offline methodology engine and payload workshop for authorized penetration testing, CTF, security research, and education via MCP. STDIO or Streamable HTTP.

7 Tools

Public Hosted Server: https://pentest.caseyjhand.com/mcp

Authorized use only. This server is designed for penetration testers, red teamers, CTF players, security researchers, and students working on systems they own or have explicit written authorization to test. Users are solely responsible for ensuring their testing is lawful and appropriately scoped. Unauthorized access to computer systems is illegal — this server does not and cannot enforce authorization on your behalf.

Dual-audience design. Every offensive technique is paired with detection indicators and mitigations. Blue teamers, developers, and anyone building detection coverage will find the methodology and ATT&CK data as useful as the red team workflows.

Tools

Seven tools spanning the full authorized-testing workflow — from initial scoping through response analysis and payload generation:

Tool	Description
`pentest_guide`	Returns a step-by-step methodology playbook for the given attack vector, scoped to authorized testing. Each phase covers what to look for, tools, detection indicators for defenders, and mitigations.
`pentest_analyze_response`	Analyzes a raw server response (headers + body) from authorized probing for information leakage, fingerprinting signals, and exploitation opportunities — each finding paired with remediation.
`pentest_lookup_technique`	Looks up a MITRE ATT&CK technique by ID or keyword. Returns description, tactics, detection data sources, behavioral indicators, mitigations, and real-world procedure examples.
`pentest_lookup_group`	Looks up a MITRE ATT&CK threat group or software entry by ID or name. Returns aliases, type (group vs. software), description, and the techniques it uses with procedure context.
`pentest_map_techniques`	Given a target profile (stack, services, auth type, OS), returns ranked ATT&CK techniques and OWASP test cases most relevant to that authorized engagement.
`pentest_generate_payloads`	Generates annotated payload templates for authorized testing. Each template includes why it works in the injection context, detection signatures, and mitigations.
`pentest_encode`	Applies an encoding chain to a payload string (URL, double-URL, HTML entity, Unicode, hex, Base64, and more). Returns step-by-step decode explanation and bypass rationale.

`pentest_guide`

Instruction tool. Returns a structured attack methodology playbook for the given vector and optional target context.

Fifteen attack vectors via a single vector enum: auth_bypass, idor, ssrf, xss, sqli, xxe, path_traversal, cors, csrf, open_redirect, deserialization, race_condition, ssti, command_injection, jwt_attack
Optional target_context (stack, waf, recon_notes) narrows the playbook to stack-specific techniques and WAF-bypass-aware variants
Phase filtering: all, recon, enumeration, exploitation, post_exploitation
Every technique entry includes detection indicators and recommended mitigations — usable as a blue team planning aid
nextToolSuggestions pre-filled with payload generator and ATT&CK lookup calls from the methodology context
authorized_use_reminder field rendered as the first line of every response so the framing reaches all clients
OWASP Testing Guide test case IDs and ATT&CK technique IDs included for cross-referencing

`pentest_analyze_response`

Bridge tool. Paste raw HTTP output from authorized probing; get structured findings.

Accepts response_headers (raw HTTP headers), response_body (up to 10,000 chars), status_code, and freeform context
Detects: version disclosure, stack traces, internal paths, debug headers, technology fingerprints, auth patterns, CORS misconfigurations, missing security headers, interesting fields, error messages
Each finding carries: category, severity (info/low/medium/high), what was detected, why it matters, how a defender would detect exploitation, and remediation
Technology fingerprinting summary (server_software, framework, language, database, cloud_provider) ready for use as target_context in pentest_guide or pentest_map_techniques
nextToolSuggestions pre-filled from fingerprints and findings

`pentest_lookup_technique`

Single-record ATT&CK lookup. Accepts exact IDs (T1190, T1059.001) or keyword search.

Full technique record: name, tactics, description, target platforms
Detection context: summary, ATT&CK data sources (log sources, sensors), concrete behavioral indicators
Mitigations: ATT&CK mitigation IDs, names, and descriptions
Real-world procedure examples from public threat intelligence reporting
Sub-technique inclusion toggle (include_subtechniques, default true)
ATT&CK dataset version string in every response so callers know the data vintage

`pentest_lookup_group`

ATT&CK threat group and software lookup. Accepts exact IDs (G0007, S0002) or name/keyword search (APT28, Mimikatz).

Covers both intrusion sets (threat groups, G-prefix) and software entries (malware and tools, S-prefix)
Returns: name, type (group or software), aliases, description, and up to 20 techniques used with procedure-level context
Technique entries link directly to pentest_lookup_technique for full detection and mitigation context
Equally useful for defenders building detection coverage around specific adversary tradecraft

`pentest_map_techniques`

Discovery and ranking tool. Takes a target profile, returns prioritized testing scope.

Profile inputs: stack (array of components), services (exposed interfaces), auth_type (jwt/session_cookie/api_key/oauth2/basic_auth/ntlm/kerberos/none/unknown), os (linux/windows/macos/unknown)
Transparent relevance scoring: 1 point per matching platform, 2 per matching service, 2 for auth type match — criteria documented in each result row so rankings are verifiable
Each ranked technique includes: relevance rationale, detection opportunity, mitigation summary, and the pentest_guide vector for follow-up
OWASP test cases mapped to the profile alongside ATT&CK techniques
Configurable result count (1–50, default 15)

`pentest_generate_payloads`

Context-aware payload template generator for authorized testing. Returns annotated templates — not live weaponized strings.

Fourteen payload categories: xss, sqli, ssrf, xxe, path_traversal, ssti, command_injection, open_redirect, csrf, deserialization, jwt, ldap_injection, nosql_injection, http_header
Sixteen injection contexts for precision targeting: html_attribute, html_body, js_string, js_template, js_script_block, url_parameter, url_path, sql_where, sql_integer, xml_element, xml_attribute, http_header, json_value, cookie_value, file_name, generic
WAF profile variants: cloudflare, aws_waf, modsecurity_crs, imperva, akamai, f5_bigip_asm, nginx_modsecurity, fortinet_fortiwaf, none, unknown — bypass variants reference public research per WAF product
Optional encoding chain applied at generation time; up to 20 variants per request
Each template annotated with: vulnerability class, detection signature, mitigation, and WAF bypass notes
authorized_use_reminder rendered as the first line of every response

`pentest_encode`

Pure transformation utility. Applies an ordered encoding chain to a payload string.

Ten encoding types: url, double_url, html_entity, unicode, hex, base64, js_escape, null_byte, mixed_case, comment_break
Chains up to 6 steps applied left-to-right; intermediate values returned for tracing
Optional step-by-step decode explanation (explain, default true): how a WAF or server would reverse each layer
Bypass rationale: why the encoding combination might evade common filter patterns
detection_note included in every response — how defenders detect encoded payload variants — preserving the dual-audience framing
No live probing — mathematically deterministic transforms only

Features

Built on @cyanheads/mcp-ts-core:

Declarative tool definitions — one file per tool, framework handles registration and validation
Unified error handling — handlers throw, framework catches, classifies, and formats
Structured logging with optional OpenTelemetry tracing
STDIO and Streamable HTTP transports

Pentest-specific:

Fully offline at runtime — no external API calls, no credentials required. All data loaded at startup from embedded modules; zero I/O during request handling
MITRE ATT&CK Enterprise embedded at build time via scripts/refresh-attack.ts; in-memory indexed by ID and keyword at startup. Fast-fails with a clear message if the data file is missing
OWASP Testing Guide methodology curated as structured TypeScript modules covering reconnaissance through post-exploitation phases
Payload template library curated and annotated TypeScript modules, one file per vulnerability category, adapted to injection context
WAF bypass knowledge keyed by WAF product and attack vector, referencing public research
Encoding chain engine — pure TypeScript transforms with full decode-path tracing
All tools annotated readOnlyHint: true, openWorldHint: false — deterministic output from a bounded embedded dataset

Agent-friendly output:

authorized_use_reminder rendered as the first line of content[] on every guide/payload/encoding response — consistent framing across all MCP clients regardless of which surface (structured or text) the client forwards
detection_note and mitigation fields required on every technique, finding, and payload — never optional — so defenders always receive usable context alongside offense technique
Transparent relevance scoring in pentest_map_techniques — documented criteria, no opaque composite scores
nextToolSuggestions pre-filled with arguments from the current methodology context — reduces agent planning overhead
ATT&CK dataset version string on every technique result — callers can reason about data vintage

Build-time data step

The server embeds MITRE ATT&CK Enterprise data (~20 MB JSON) fetched by a one-time script into a gitignored path. Self-hosters and Docker builders must run this step before the server will start:

bun run scripts/refresh-attack.ts

This downloads the latest ATT&CK Enterprise JSON from the MITRE GitHub release endpoint, writes it to src/data/attack/enterprise.json (gitignored), and updates src/data/attack/version.ts with a version string such as Enterprise v16.1. The version file is committed; the JSON is not (too large for git history).

The Dockerfile handles this automatically — the build stage runs scripts/refresh-attack.ts before the TypeScript compile, so docker build produces a self-contained image.

If you clone the repo and skip this step, attack-service will fail fast at startup with an actionable error message pointing to scripts/refresh-attack.ts.

Run the script quarterly (or before each release) to pull the latest ATT&CK version.

Getting started

Public Hosted Instance

A public instance is available at https://pentest.caseyjhand.com/mcp — no installation required. Point any MCP client at it via Streamable HTTP:

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "streamable-http",
      "url": "https://pentest.caseyjhand.com/mcp"
    }
  }
}

Self-Hosted / Local

Add the following to your MCP client configuration file.

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "stdio",
      "command": "bunx",
      "args": ["@cyanheads/pentest-mcp-server@latest"],
      "env": {
        "MCP_TRANSPORT_TYPE": "stdio",
        "MCP_LOG_LEVEL": "info"
      }
    }
  }
}

Or with npx (no Bun required):

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "@cyanheads/pentest-mcp-server@latest"],
      "env": {
        "MCP_TRANSPORT_TYPE": "stdio",
        "MCP_LOG_LEVEL": "info"
      }
    }
  }
}

Or with Docker:

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "stdio",
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "-e", "MCP_TRANSPORT_TYPE=stdio",
        "ghcr.io/cyanheads/pentest-mcp-server:latest"
      ]
    }
  }
}

For Streamable HTTP, set the transport and start the server:

MCP_TRANSPORT_TYPE=http MCP_HTTP_PORT=3010 bun run start:http
# Server listens at http://localhost:3010/mcp

Prerequisites

Bun v1.3.0 or higher (or Node.js v24+).
ATT&CK data seeded — run bun run scripts/refresh-attack.ts once after cloning (Docker builds handle this automatically).

Installation

Clone the repository:

git clone https://github.com/cyanheads/pentest-mcp-server.git

Navigate into the directory:

cd pentest-mcp-server

Install dependencies:

bun install

Seed the ATT&CK data:

bun run scripts/refresh-attack.ts

Configure environment:

cp .env.example .env
# edit .env if needed — no required vars beyond transport defaults

Configuration

No API keys required. The server is fully offline at runtime.

Variable	Description	Default
`MCP_TRANSPORT_TYPE`	Transport: `stdio` or `http`.	`stdio`
`MCP_HTTP_PORT`	Port for HTTP server.	`3010`
`MCP_AUTH_MODE`	Auth mode: `none`, `jwt`, or `oauth`.	`none`
`MCP_LOG_LEVEL`	Log level: `debug`, `info`, `notice`, `warning`, `error`.	`info`
`LOGS_DIR`	Directory for log files (Node.js only).	`<project-root>/logs`
`OTEL_ENABLED`	Enable OpenTelemetry instrumentation (spans, metrics, completion logs).	`false`

See .env.example for the full list of optional overrides.

Running the server

Local development

Build and run:

# Seed ATT&CK data (first time, or to update)
bun run scripts/refresh-attack.ts

# Build
bun run rebuild

# Run
bun run start:stdio
# or
bun run start:http

Run checks and tests:

bun run devcheck   # Lint, format, typecheck, security audit
bun run test       # Vitest test suite
bun run lint:mcp   # Validate MCP definitions

Docker

# Build — ATT&CK data is fetched during the build stage
docker build -t pentest-mcp-server .
docker run --rm -p 3010:3010 pentest-mcp-server

The Dockerfile defaults to HTTP transport, stateless session mode, and logs to /var/log/pentest-mcp-server. OpenTelemetry peer dependencies are installed by default — build with --build-arg OTEL_ENABLED=false to omit them. The ATT&CK data refresh runs automatically in the build stage.

Project structure

Directory / File	Purpose
`src/index.ts`	`createApp()` entry point — registers tools and initializes services.
`src/services/attack/`	MITRE ATT&CK service — loads and indexes the embedded enterprise JSON at startup.
`src/services/methodology/`	OWASP Testing Guide methodology service — vector branches for `pentest_guide`.
`src/services/payload/`	Payload template service — keyed by category and injection context.
`src/services/encoding/`	Encoding chain executor — pure TypeScript transforms.
`src/services/response-analysis/`	Pattern library for information leakage and fingerprinting detection.
`src/mcp-server/tools/definitions/`	Tool definitions (`*.tool.ts`) — one file per tool.
`src/data/attack/`	`enterprise.json` (gitignored, fetched by `scripts/refresh-attack.ts`) + committed `version.ts`.
`src/data/owasp/`	Curated OWASP TG v4.2 methodology content as TypeScript modules.
`src/data/payloads/`	Annotated payload templates by vulnerability category.
`src/data/waf-bypass/`	WAF bypass variants keyed by product and attack vector.
`src/data/encodings/`	Encoding transform functions.
`src/data/patterns/`	Regex patterns and metadata for response leakage detection.
`scripts/refresh-attack.ts`	Downloads ATT&CK Enterprise JSON and updates the version string. Run once after cloning, then quarterly.
`tests/`	Unit and integration tests mirroring `src/`.
`docs/design.md`	Design document — tool surface, data strategy, and architectural decisions.

Development guide

See CLAUDE.md for development guidelines and architectural rules. The short version:

Handlers throw, framework catches — no try/catch in tool logic
Use ctx.log for request-scoped logging, ctx.state for tenant-scoped storage
Register new tools via the barrel in src/mcp-server/tools/definitions/index.ts
authorized_use_reminder is a required output field on every tool that produces methodology or payload content — render it as the first line of every content[] response in format()
Every technique, finding, and payload object has required (non-optional) detection and mitigation fields — this is a schema contract, not documentation guidance

Contributing

Issues and pull requests are welcome. Run checks and tests before submitting:

bun run devcheck
bun run test

License

Apache-2.0 — see LICENSE for details.

Featured

CodeRabbit

AI writes the code. CodeRabbit catches the slop.

Try For Free →

Keep your Mac awake

Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.

One time payment $9 →

Context.dev

Integrate web data into your AI product. One API to scrape website & brand data.

Get API Key Now →

Make your agent a DeFi expert

Agent, run crypto. Access onchain data & trade routes via 1inch.

Install now →

Make money from your Skills

On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.

Start earning →

AppSignal

Monitor with ease. Code with confidence.

Start Free Trial →

Configuration

MCP_LOG_LEVELdefault: info

Sets the minimum log level for output (e.g., 'debug', 'info', 'warn').

MCP_HTTP_HOSTdefault: 127.0.0.1

The hostname for the HTTP server.

MCP_HTTP_PORTdefault: 3010

The port to run the HTTP server on.

MCP_HTTP_ENDPOINT_PATHdefault: /mcp

The endpoint path for the MCP server.

MCP_AUTH_MODEdefault: none

Authentication mode to use: 'none', 'jwt', or 'oauth'.

@cyanheads/pentest-mcp-server

Offline methodology engine and payload workshop for authorized penetration testing, CTF, security research, and education via MCP. STDIO or Streamable HTTP.

7 Tools

Public Hosted Server: https://pentest.caseyjhand.com/mcp

Authorized use only. This server is designed for penetration testers, red teamers, CTF players, security researchers, and students working on systems they own or have explicit written authorization to test. Users are solely responsible for ensuring their testing is lawful and appropriately scoped. Unauthorized access to computer systems is illegal — this server does not and cannot enforce authorization on your behalf.

Dual-audience design. Every offensive technique is paired with detection indicators and mitigations. Blue teamers, developers, and anyone building detection coverage will find the methodology and ATT&CK data as useful as the red team workflows.

Tools

Seven tools spanning the full authorized-testing workflow — from initial scoping through response analysis and payload generation:

Tool	Description
`pentest_guide`	Returns a step-by-step methodology playbook for the given attack vector, scoped to authorized testing. Each phase covers what to look for, tools, detection indicators for defenders, and mitigations.
`pentest_analyze_response`	Analyzes a raw server response (headers + body) from authorized probing for information leakage, fingerprinting signals, and exploitation opportunities — each finding paired with remediation.
`pentest_lookup_technique`	Looks up a MITRE ATT&CK technique by ID or keyword. Returns description, tactics, detection data sources, behavioral indicators, mitigations, and real-world procedure examples.
`pentest_lookup_group`	Looks up a MITRE ATT&CK threat group or software entry by ID or name. Returns aliases, type (group vs. software), description, and the techniques it uses with procedure context.
`pentest_map_techniques`	Given a target profile (stack, services, auth type, OS), returns ranked ATT&CK techniques and OWASP test cases most relevant to that authorized engagement.
`pentest_generate_payloads`	Generates annotated payload templates for authorized testing. Each template includes why it works in the injection context, detection signatures, and mitigations.
`pentest_encode`	Applies an encoding chain to a payload string (URL, double-URL, HTML entity, Unicode, hex, Base64, and more). Returns step-by-step decode explanation and bypass rationale.

`pentest_guide`

Instruction tool. Returns a structured attack methodology playbook for the given vector and optional target context.

Fifteen attack vectors via a single vector enum: auth_bypass, idor, ssrf, xss, sqli, xxe, path_traversal, cors, csrf, open_redirect, deserialization, race_condition, ssti, command_injection, jwt_attack
Optional target_context (stack, waf, recon_notes) narrows the playbook to stack-specific techniques and WAF-bypass-aware variants
Phase filtering: all, recon, enumeration, exploitation, post_exploitation
Every technique entry includes detection indicators and recommended mitigations — usable as a blue team planning aid
nextToolSuggestions pre-filled with payload generator and ATT&CK lookup calls from the methodology context
authorized_use_reminder field rendered as the first line of every response so the framing reaches all clients
OWASP Testing Guide test case IDs and ATT&CK technique IDs included for cross-referencing

`pentest_analyze_response`

Bridge tool. Paste raw HTTP output from authorized probing; get structured findings.

Accepts response_headers (raw HTTP headers), response_body (up to 10,000 chars), status_code, and freeform context
Detects: version disclosure, stack traces, internal paths, debug headers, technology fingerprints, auth patterns, CORS misconfigurations, missing security headers, interesting fields, error messages
Each finding carries: category, severity (info/low/medium/high), what was detected, why it matters, how a defender would detect exploitation, and remediation
Technology fingerprinting summary (server_software, framework, language, database, cloud_provider) ready for use as target_context in pentest_guide or pentest_map_techniques
nextToolSuggestions pre-filled from fingerprints and findings

`pentest_lookup_technique`

Single-record ATT&CK lookup. Accepts exact IDs (T1190, T1059.001) or keyword search.

Full technique record: name, tactics, description, target platforms
Detection context: summary, ATT&CK data sources (log sources, sensors), concrete behavioral indicators
Mitigations: ATT&CK mitigation IDs, names, and descriptions
Real-world procedure examples from public threat intelligence reporting
Sub-technique inclusion toggle (include_subtechniques, default true)
ATT&CK dataset version string in every response so callers know the data vintage

`pentest_lookup_group`

ATT&CK threat group and software lookup. Accepts exact IDs (G0007, S0002) or name/keyword search (APT28, Mimikatz).

Covers both intrusion sets (threat groups, G-prefix) and software entries (malware and tools, S-prefix)
Returns: name, type (group or software), aliases, description, and up to 20 techniques used with procedure-level context
Technique entries link directly to pentest_lookup_technique for full detection and mitigation context
Equally useful for defenders building detection coverage around specific adversary tradecraft

`pentest_map_techniques`

Discovery and ranking tool. Takes a target profile, returns prioritized testing scope.

Profile inputs: stack (array of components), services (exposed interfaces), auth_type (jwt/session_cookie/api_key/oauth2/basic_auth/ntlm/kerberos/none/unknown), os (linux/windows/macos/unknown)
Transparent relevance scoring: 1 point per matching platform, 2 per matching service, 2 for auth type match — criteria documented in each result row so rankings are verifiable
Each ranked technique includes: relevance rationale, detection opportunity, mitigation summary, and the pentest_guide vector for follow-up
OWASP test cases mapped to the profile alongside ATT&CK techniques
Configurable result count (1–50, default 15)

`pentest_generate_payloads`

Context-aware payload template generator for authorized testing. Returns annotated templates — not live weaponized strings.

Fourteen payload categories: xss, sqli, ssrf, xxe, path_traversal, ssti, command_injection, open_redirect, csrf, deserialization, jwt, ldap_injection, nosql_injection, http_header
Sixteen injection contexts for precision targeting: html_attribute, html_body, js_string, js_template, js_script_block, url_parameter, url_path, sql_where, sql_integer, xml_element, xml_attribute, http_header, json_value, cookie_value, file_name, generic
WAF profile variants: cloudflare, aws_waf, modsecurity_crs, imperva, akamai, f5_bigip_asm, nginx_modsecurity, fortinet_fortiwaf, none, unknown — bypass variants reference public research per WAF product
Optional encoding chain applied at generation time; up to 20 variants per request
Each template annotated with: vulnerability class, detection signature, mitigation, and WAF bypass notes
authorized_use_reminder rendered as the first line of every response

`pentest_encode`

Pure transformation utility. Applies an ordered encoding chain to a payload string.

Ten encoding types: url, double_url, html_entity, unicode, hex, base64, js_escape, null_byte, mixed_case, comment_break
Chains up to 6 steps applied left-to-right; intermediate values returned for tracing
Optional step-by-step decode explanation (explain, default true): how a WAF or server would reverse each layer
Bypass rationale: why the encoding combination might evade common filter patterns
detection_note included in every response — how defenders detect encoded payload variants — preserving the dual-audience framing
No live probing — mathematically deterministic transforms only

Features

Built on @cyanheads/mcp-ts-core:

Declarative tool definitions — one file per tool, framework handles registration and validation
Unified error handling — handlers throw, framework catches, classifies, and formats
Structured logging with optional OpenTelemetry tracing
STDIO and Streamable HTTP transports

Pentest-specific:

Fully offline at runtime — no external API calls, no credentials required. All data loaded at startup from embedded modules; zero I/O during request handling
MITRE ATT&CK Enterprise embedded at build time via scripts/refresh-attack.ts; in-memory indexed by ID and keyword at startup. Fast-fails with a clear message if the data file is missing
OWASP Testing Guide methodology curated as structured TypeScript modules covering reconnaissance through post-exploitation phases
Payload template library curated and annotated TypeScript modules, one file per vulnerability category, adapted to injection context
WAF bypass knowledge keyed by WAF product and attack vector, referencing public research
Encoding chain engine — pure TypeScript transforms with full decode-path tracing
All tools annotated readOnlyHint: true, openWorldHint: false — deterministic output from a bounded embedded dataset

Agent-friendly output:

authorized_use_reminder rendered as the first line of content[] on every guide/payload/encoding response — consistent framing across all MCP clients regardless of which surface (structured or text) the client forwards
detection_note and mitigation fields required on every technique, finding, and payload — never optional — so defenders always receive usable context alongside offense technique
Transparent relevance scoring in pentest_map_techniques — documented criteria, no opaque composite scores
nextToolSuggestions pre-filled with arguments from the current methodology context — reduces agent planning overhead
ATT&CK dataset version string on every technique result — callers can reason about data vintage

Build-time data step

The server embeds MITRE ATT&CK Enterprise data (~20 MB JSON) fetched by a one-time script into a gitignored path. Self-hosters and Docker builders must run this step before the server will start:

bun run scripts/refresh-attack.ts

The Dockerfile handles this automatically — the build stage runs scripts/refresh-attack.ts before the TypeScript compile, so docker build produces a self-contained image.

If you clone the repo and skip this step, attack-service will fail fast at startup with an actionable error message pointing to scripts/refresh-attack.ts.

Run the script quarterly (or before each release) to pull the latest ATT&CK version.

Getting started

Public Hosted Instance

A public instance is available at https://pentest.caseyjhand.com/mcp — no installation required. Point any MCP client at it via Streamable HTTP:

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "streamable-http",
      "url": "https://pentest.caseyjhand.com/mcp"
    }
  }
}

Self-Hosted / Local

Add the following to your MCP client configuration file.

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "stdio",
      "command": "bunx",
      "args": ["@cyanheads/pentest-mcp-server@latest"],
      "env": {
        "MCP_TRANSPORT_TYPE": "stdio",
        "MCP_LOG_LEVEL": "info"
      }
    }
  }
}

Or with npx (no Bun required):

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "@cyanheads/pentest-mcp-server@latest"],
      "env": {
        "MCP_TRANSPORT_TYPE": "stdio",
        "MCP_LOG_LEVEL": "info"
      }
    }
  }
}

Or with Docker:

{
  "mcpServers": {
    "pentest-mcp-server": {
      "type": "stdio",
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "-e", "MCP_TRANSPORT_TYPE=stdio",
        "ghcr.io/cyanheads/pentest-mcp-server:latest"
      ]
    }
  }
}

For Streamable HTTP, set the transport and start the server:

MCP_TRANSPORT_TYPE=http MCP_HTTP_PORT=3010 bun run start:http
# Server listens at http://localhost:3010/mcp

Prerequisites

Bun v1.3.0 or higher (or Node.js v24+).
ATT&CK data seeded — run bun run scripts/refresh-attack.ts once after cloning (Docker builds handle this automatically).

Installation

Clone the repository:

git clone https://github.com/cyanheads/pentest-mcp-server.git

Navigate into the directory:

cd pentest-mcp-server

Install dependencies:

bun install

Seed the ATT&CK data:

bun run scripts/refresh-attack.ts

Configure environment:

cp .env.example .env
# edit .env if needed — no required vars beyond transport defaults

Configuration

No API keys required. The server is fully offline at runtime.

Variable	Description	Default
`MCP_TRANSPORT_TYPE`	Transport: `stdio` or `http`.	`stdio`
`MCP_HTTP_PORT`	Port for HTTP server.	`3010`
`MCP_AUTH_MODE`	Auth mode: `none`, `jwt`, or `oauth`.	`none`
`MCP_LOG_LEVEL`	Log level: `debug`, `info`, `notice`, `warning`, `error`.	`info`
`LOGS_DIR`	Directory for log files (Node.js only).	`<project-root>/logs`
`OTEL_ENABLED`	Enable OpenTelemetry instrumentation (spans, metrics, completion logs).	`false`

See .env.example for the full list of optional overrides.

Running the server

Local development

Build and run:

# Seed ATT&CK data (first time, or to update)
bun run scripts/refresh-attack.ts

# Build
bun run rebuild

# Run
bun run start:stdio
# or
bun run start:http

Run checks and tests:

bun run devcheck   # Lint, format, typecheck, security audit
bun run test       # Vitest test suite
bun run lint:mcp   # Validate MCP definitions

Docker

# Build — ATT&CK data is fetched during the build stage
docker build -t pentest-mcp-server .
docker run --rm -p 3010:3010 pentest-mcp-server

Project structure

Directory / File	Purpose
`src/index.ts`	`createApp()` entry point — registers tools and initializes services.
`src/services/attack/`	MITRE ATT&CK service — loads and indexes the embedded enterprise JSON at startup.
`src/services/methodology/`	OWASP Testing Guide methodology service — vector branches for `pentest_guide`.
`src/services/payload/`	Payload template service — keyed by category and injection context.
`src/services/encoding/`	Encoding chain executor — pure TypeScript transforms.
`src/services/response-analysis/`	Pattern library for information leakage and fingerprinting detection.
`src/mcp-server/tools/definitions/`	Tool definitions (`*.tool.ts`) — one file per tool.
`src/data/attack/`	`enterprise.json` (gitignored, fetched by `scripts/refresh-attack.ts`) + committed `version.ts`.
`src/data/owasp/`	Curated OWASP TG v4.2 methodology content as TypeScript modules.
`src/data/payloads/`	Annotated payload templates by vulnerability category.
`src/data/waf-bypass/`	WAF bypass variants keyed by product and attack vector.
`src/data/encodings/`	Encoding transform functions.
`src/data/patterns/`	Regex patterns and metadata for response leakage detection.
`scripts/refresh-attack.ts`	Downloads ATT&CK Enterprise JSON and updates the version string. Run once after cloning, then quarterly.
`tests/`	Unit and integration tests mirroring `src/`.
`docs/design.md`	Design document — tool surface, data strategy, and architectural decisions.

Development guide

See CLAUDE.md for development guidelines and architectural rules. The short version:

Handlers throw, framework catches — no try/catch in tool logic
Use ctx.log for request-scoped logging, ctx.state for tenant-scoped storage
Register new tools via the barrel in src/mcp-server/tools/definitions/index.ts
authorized_use_reminder is a required output field on every tool that produces methodology or payload content — render it as the first line of every content[] response in format()
Every technique, finding, and payload object has required (non-optional) detection and mitigation fields — this is a schema contract, not documentation guidance

Contributing

Issues and pull requests are welcome. Run checks and tests before submitting:

bun run devcheck
bun run test

License

Apache-2.0 — see LICENSE for details.

Pentest Mcp Server

@cyanheads/pentest-mcp-server

Tools

pentest_guide

pentest_analyze_response

pentest_lookup_technique

pentest_lookup_group

pentest_map_techniques

pentest_generate_payloads

pentest_encode

Features

Build-time data step

Getting started

Public Hosted Instance

Self-Hosted / Local

Prerequisites

Installation

Configuration

Running the server

Local development

Docker

Project structure

Development guide

Contributing

License

Configuration

Pentest Mcp Server

@cyanheads/pentest-mcp-server

Tools

pentest_guide

pentest_analyze_response

pentest_lookup_technique

pentest_lookup_group

pentest_map_techniques

pentest_generate_payloads

pentest_encode

Features

Build-time data step

Getting started

Public Hosted Instance

Self-Hosted / Local

Prerequisites

Installation

Configuration

Running the server

Local development

Docker

Project structure

Development guide

Contributing

License

Configuration

Related Security & Pentesting MCP Servers

Related Security & Pentesting MCP Servers

`pentest_guide`

`pentest_analyze_response`

`pentest_lookup_technique`

`pentest_lookup_group`

`pentest_map_techniques`

`pentest_generate_payloads`

`pentest_encode`

`pentest_guide`

`pentest_analyze_response`

`pentest_lookup_technique`

`pentest_lookup_group`

`pentest_map_techniques`

`pentest_generate_payloads`

`pentest_encode`