CAT
/MCP
SkillsMCPMarketplacesDigestToolsAdvertise

This week in Claude

Every Monday: Claude Code, Agent SDK, MCP, and the Anthropic platform moves worth your time.

Skills by Category
Frontend DevelopmentBackend & APIsTesting & QASecurityDevOps & CI/CDGit & Pull RequestsDocumentationCode Review & QualityAI & Agent BuildingSkill Development
MCP Servers by Category
Sales & MarketingWeb & Browser AutomationDatabasesAI & LLM ToolsCloud & InfrastructureCommunication & MessagingDeveloper ToolsDesign & CreativeDocuments & KnowledgeSearch & Web Crawling
Marketplaces by Category
AI Agents & OrchestrationLLM IntegrationDevelopment ToolsFrontend & UIBackend & APIsDatabasesTesting & Code QualityDevOps & CloudSecurity & ComplianceGit & Version Control

Cross AI Tools

Discover Claude Code plugins, extensions, and tools. Automatically updated directory of Anthropic Claude AI marketplaces with development tools, productivity plugins, and integrations.

Resources

  • Browse Skills
  • Browse MCP Servers
  • Browse Marketplaces
  • Plugins Reference

Community

  • About
  • Tools
  • Feedback
  • Privacy Policy
  • Advertise

Built for the Claude Code community with Claude Code by @mertduzgun

Independent project, not affiliated with Anthropic

signet-eval

jmcentire/signet-eval
2registry active
Summary

Deterministic policy enforcement that sits between Claude and tool execution. Every tool call passes through user-defined YAML rules before running,no LLM in the authorization path, 25ms end-to-end. Ships with locked self-protection rules that prevent agents from disabling their own guardrails. Supports spending limits via encrypted vault ledger, conditional blocks based on parameters or patterns, and probabilistic advisory injection for nudges. Integrates via PreToolUse hooks in Claude Code or Codex, plus an MCP management interface for conversational policy updates. Conditions include spend tracking, credential checks, regex matching, and recent action history. Rules can be locked to prevent agent modification, and the vault uses three-tier encryption with Argon2id key derivation.

CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →

signet-eval

Deterministic policy enforcement for AI agent tool calls. Every action an agent proposes passes through user-defined rules before execution. No LLM in the authorization path. Advisory nudges are separate from authorization. 25ms end-to-end.

Install

# crates.io
cargo install signet-eval

# from source
git clone https://github.com/jmcentire/signet-eval
cd signet-eval
cargo install --path .

There is no npm or PyPI package for signet-eval. The public distribution path is crates.io plus source install from GitHub. The MCP Registry listing points at the repository metadata; the runtime is the local signet-eval serve stdio server.

Quick Start

1. Hook into Claude Code — add to ~/.claude/settings.json:

{
  "hooks": {
    "PreToolUse": [{
      "matcher": "",
      "hooks": [{"type": "command", "command": "signet-eval", "timeout": 2000}]
    }]
  }
}

For Codex, enable hooks in ~/.codex/config.toml or <repo>/.codex/config.toml:

[features]
codex_hooks = true

Then add ~/.codex/hooks.json or <repo>/.codex/hooks.json:

{
  "hooks": {
    "PreToolUse": [{
      "matcher": "*",
      "hooks": [{
        "type": "command",
        "command": "signet-eval --adapter codex",
        "timeout": 30000,
        "statusMessage": "Checking Signet policy"
      }]
    }],
    "PermissionRequest": [{
      "matcher": "*",
      "hooks": [{
        "type": "command",
        "command": "signet-eval --adapter codex-permission",
        "timeout": 30000,
        "statusMessage": "Checking Signet approval policy"
      }]
    }]
  }
}

2. Done. Every tool call now passes through policy evaluation. The default policy blocks destructive operations, protects its own configuration, and allows everything else.

3. (Optional) Customize — talk to Claude with the MCP server:

claude mcp add --scope user --transport stdio signet -- signet-eval serve

Then say: "Add a $50 limit for amazon orders" or "Block all rm commands".

Default Policy

Self-protection rules are locked — they cannot be removed, edited, or reordered by the AI agent, even through the MCP management server. This prevents the agent from disabling its own guardrails.

ActionDecisionLocked
Write/Edit/Bash touching .signet/denyyes
Write/Edit/Bash touching signet-eval binarydenyyes
Write/Edit settings.json / settings.local.jsonaskyes
Bash kill/pkill/killall + signetdenyyes
Edit/Write/NotebookEdit without recent planask
Edit/Write on core/DSL/schema pathsask
rm, rmdirdeny
git push --forceask
mkfs, format, dd if=deny
curl | sh, wget | shdeny
Everything elseallow

Custom Policy

signet-eval init       # write default policy to ~/.signet/policy.yaml
signet-eval validate   # check policy for errors
signet-eval rules      # show current rules

Edit ~/.signet/policy.yaml:

version: 1
default_action: ALLOW
rules:
  - name: block_rm
    tool_pattern: ".*"
    conditions: ["contains(parameters, 'rm ')"]
    action: DENY
    reason: "File deletion blocked"

  - name: books_limit
    tool_pattern: ".*purchase.*"
    conditions:
      - "param_eq(category, 'books')"
      - "spend_plus_amount_gt('books', amount, 200)"
    action: DENY
    reason: "Books spending limit ($200) exceeded"

  - name: protect_my_config
    tool_pattern: ".*"
    conditions: ["contains(parameters, '/etc/')"]
    action: ASK
    locked: true
    reason: "System config changes require confirmation"

Rules are evaluated in order — first match wins. Multiple conditions on a rule are AND'd. Rules with locked: true cannot be modified through the MCP management server.

Advisory Injection

INJECT rules probabilistically add advisory context near the tool call that triggered them. They are nudges, not authorization: the normal ALLOW/DENY/ASK/GATE/ENSURE pass remains first-match-wins and deterministic. Injection runs afterward and only emits context when a matching inject rule fires.

rules:
  - name: maybe_remind_kindex_on_git
    tool_pattern: "^Bash$"
    conditions: ["contains(parameters, 'git ')"]
    action: INJECT
    inject:
      trigger:
        mode: exponential
        peak: 0.35
        cooldown_seconds: 300
        peak_after_seconds: 1800
        max_per_session: 3
      payload:
        text: "Before committing, check whether project `.kin` files should be included."

Trigger modes:

ModeBehavior
constant / stepFixed probability after cooldown
linearRamps from 0 to peak over peak_after_seconds
exponentialApproaches peak with exponential decay

Payload sources:

SourceNotes
textInline literal text
text_fileBare filename under ~/.signet/injections/
from_commandHMAC-signed allowlist entry from ~/.signet/inject_commands.yaml; direct exec, no shell

Template substitutions are enabled by default: {tool_name}, {cwd}, {date}, and {matched_param.X}. See examples/inject_examples.yaml.

Condition Functions

FunctionDescriptionExample
contains(parameters, 'X')Tool input contains stringcontains(parameters, 'rm ')
any_of(parameters, 'X', 'Y')Any string presentany_of(parameters, 'mkfs', 'format')
param_eq(field, 'value')Field equals valueparam_eq(category, 'books')
param_ne(field, 'value')Field not equalparam_ne(role, 'admin')
param_gt(field, N)Field > numberparam_gt(amount, 100)
param_lt(field, N)Field < numberparam_lt(amount, 5)
param_contains(field, 'X')Field contains substringparam_contains(command, 'sudo')
matches(field, 'regex')Field matches regexmatches(file_path, '\\.env$')
has_credential('name')Credential exists in vaulthas_credential('cc_visa')
spend_gt('cat', N)Session spend > limitspend_gt('books', 200)
spend_plus_amount_gt('cat', field, N)Spend + this amount > limitspend_plus_amount_gt('books', amount, 200)
not(condition)Negate conditionnot(param_eq(format, 'json'))
or(A || B)Either conditionor(contains(parameters, '-f') || contains(parameters, '--force'))
has_recent_action('search', N)Recent allowed action matches in tool name or detail; pipe-delimited ORhas_recent_action('EnterPlanMode|TaskCreate', 500)
true / falseLiteraltrue

Encrypted Vault

Three-tier encrypted storage with passphrase-derived key hierarchy (Argon2id + AES-256-GCM):

TierEncryptionContents
1NoneAction log, spending ledger
2Session keySession state
3Compartment keyCC numbers, API tokens, secrets
signet-eval setup                      # create vault with passphrase
signet-eval store cc_visa 4111...      # store Tier 3 credential
signet-eval status                     # vault status and spending
signet-eval log                        # recent action log
signet-eval unlock                     # refresh session after timeout

Credentials support scoped access via request_capability: domain restrictions, purpose constraints, per-use amount caps, and one-time tokens that auto-invalidate after a single use.

Spending limits use the vault ledger — each tool call that spends money is logged, and spend_plus_amount_gt() checks cumulative totals before allowing the next purchase.

Self-Protection

signet-eval ships with locked rules that prevent an AI agent from disabling its own policy enforcement:

  1. protect_signet_dir — Denies any Write, Edit, or Bash command touching .signet/ (policy files, vault, HMAC)
  2. protect_signet_binary — Denies tampering with the signet-eval binary itself
  3. protect_hook_config — Requires user confirmation before modifying settings.json (where the hook is configured)
  4. protect_signet_process — Denies kill/pkill/killall commands targeting signet processes

These rules are:

  • Locked — MCP tools refuse to remove, edit, or reorder them
  • Position-protected — Unlocked rules cannot be reordered above locked rules (first-match-wins)
  • Hardcoded in defaults — If the policy file is corrupted or missing, the binary falls back to hardcoded defaults that include self-protection
  • HMAC-backed — Direct file edits break the policy signature, triggering fallback to safe defaults

MCP Management Server

Manage policies conversationally through Claude:

claude mcp add --scope user --transport stdio signet -- signet-eval serve
ToolPurpose
signet_list_rulesShow all rules with locked status
signet_add_ruleAdd a new rule (appended after locked rules)
signet_remove_ruleRemove a rule (refuses on locked rules)
signet_edit_ruleModify rule properties (refuses on locked rules)
signet_reorder_ruleMove a rule (refuses on locked, prevents placing above locked)
signet_set_limitSet a spending limit for a category
signet_testTest a tool call against the current policy
signet_validateCheck policy for errors
signet_condition_helpShow available condition functions
signet_statusVault status, spending totals, credential count
signet_recent_actionsShow recent action log
signet_store_credentialStore a Tier 3 credential
signet_use_credentialRequest a credential through capability constraints
signet_list_credentialsList credential names
signet_delete_credentialDelete a credential
signet_sign_policyHMAC-sign the policy file
signet_reset_sessionClear spending counters

All mutating operations auto-sign the policy when the vault is available.

MCP Proxy

Wrap upstream MCP servers with policy enforcement. The agent connects to the proxy, never directly to servers. Policy is hot-reloaded on every call.

# Configure upstream servers
cat > ~/.signet/proxy.yaml << 'YAML'
servers:
  linear:
    command: npx
    args: ["-y", "mcp-linear"]
    env:
      LINEAR_API_KEY: "your-key"
YAML

# Register proxy with Claude Code
claude mcp add --scope user --transport stdio signet-proxy -- signet-eval proxy

All Commands

CommandPurpose
signet-evalHook evaluation (default, 25ms)
signet-eval --adapter codexCodex PreToolUse hook evaluation
signet-eval --adapter codex-permissionCodex PermissionRequest hook evaluation
signet-eval initWrite default policy with locked self-protection rules
signet-eval rulesShow current policy rules (locked rules tagged)
signet-eval validateCheck policy for errors
signet-eval test '<json>'Test a tool call against policy
signet-eval setupCreate encrypted vault
signet-eval unlockRefresh vault session
signet-eval statusVault status and spending
signet-eval store <name> <value>Store Tier 3 credential
signet-eval delete <name>Delete a credential
signet-eval logRecent action log
signet-eval reset-sessionClear spending counters
signet-eval signHMAC-sign policy file
signet-eval injectionsShow recent inject rule fires
signet-eval inject-test <rule>Force-fire one inject rule for testing
signet-eval serveMCP management server (17 tools)
signet-eval proxyMCP proxy for upstream servers

Performance

MetricValue
Hook eval (end-to-end)25ms — process spawn, stdin, JSON parse, policy load, eval, response
In-process policy eval14–63μs — 14μs deny, 21μs ask, 63μs spending check
CLI validate / rules8ms
Binary size6.2MB (stripped, LTO)

Architecture

signet-eval is the enforcement layer of the Signet personal sovereign agent stack. The core principle: the authorization layer must not be an LLM. It processes structured data only — regex, comparisons, and vault queries. No natural language, no context window, no persuasion surface. A rule either matches or it doesn't.

Agent proposes action  ->  signet-eval evaluates policy  ->  allow / deny / ask
                           (deterministic, 25ms, no NLP)

Security Model

signet-eval is a seatbelt, not a cage. Understanding what it does and doesn't protect against is critical.

What it protects against

A cooperative agent making mistakes. Claude Code and Codex follow hook protocols — they call signet-eval around tool use and respect the supported responses. Within those protocols, signet-eval reliably enforces policy:

  • Blocks destructive commands (rm, mkfs, piped remote execution)
  • Enforces spending limits across tool calls
  • Gates credential access with domain, purpose, and amount constraints
  • Requires user confirmation for sensitive operations (git push --force)
  • Prevents casual policy tampering via locked rules and MCP guards

This is the normal operating mode. The agent isn't trying to escape — it's trying to be helpful and the policy keeps it in bounds.

Codex support has one important semantic difference: PreToolUse is currently deny-only for enforcement, so Signet ALLOW emits no output and Signet ASK is converted to a deny at that stage. The PermissionRequest adapter explicitly allows or denies requests Codex was already going to send through approval; Signet ASK defers to Codex's normal approval prompt.

What it does NOT protect against

An adversarial agent with shell access. signet-eval runs at the same privilege level as the agent it constrains. Any process with the same UID can:

  • Use shell indirection to evade string matching (alias, eval, base64 -d | sh, python3 -c "...")
  • Create symlinks to protected paths under innocent names
  • Read the session key file and forge HMAC signatures
  • Replace the signet-eval binary itself
  • Modify the hook configuration in settings.json
  • Use LD_PRELOAD, ptrace, or process injection

No amount of pattern matching fixes this. Adding rules for ln invites alias. Blocking alias invites eval. Blocking eval invites python3 -c. This is an infinite regress — the fundamental issue is privilege level, not pattern coverage.

Where real containment lives

Protecting against a truly adversarial agent requires OS-level controls that operate above the agent's privilege level:

  • Separate user account — run the agent as a restricted user with no write access to policy files
  • Container/sandbox — Docker, Flatpak, or bubblewrap with a read-only mount for ~/.signet/
  • Stripped $PATH — remove ln, alias, and other indirection tools from the agent's environment
  • seccomp / AppArmor / SELinux — mandatory access controls that the agent cannot override
  • Virtual environment — run Claude Code in a constrained environment with limited filesystem access

signet-eval is the policy layer within such a setup. It handles the "what should this agent be allowed to do" question with clear, auditable rules. The OS handles the "can this agent circumvent the policy" question. Neither replaces the other.

Defense in depth

The layers work together:

LayerProtects againstMechanism
String matchingObvious mistakes, clear UXRegex, substring, word-boundary conditions
Locked rulesCasual MCP-based policy tamperingImmutable rules, position protection
HMAC signingOut-of-band file modificationCryptographic integrity verification
OS controlsPrivilege escalation, shell indirectionSandboxing, RBAC, separate users

Without OS controls, signet-eval is a speed bump, not a wall. With them, it's the policy engine inside a secure perimeter.

License

MIT

Featured
CodeRabbit
CodeRabbit
AI writes the code. CodeRabbit catches the slop.
Try For Free →
Keep your Mac awake
Keep your Mac awake
Keep your Mac awake while Claude Code and 40+ AI agents run. Sleeps when they're idle.
One time payment $9 →
Context.devContext.dev
Context.dev
Integrate web data into your AI product. One API to scrape website & brand data.
Get API Key Now →
Make your agent a DeFi expert
Make your agent a DeFi expert
Agent, run crypto. Access onchain data & trade routes via 1inch.
Install now →
Make money from your Skills
Make money from your Skills
On Capafy, your Skill runs online 24/7 as an agent product, and you get paid every time someone uses it.
Start earning →
AppSignal
AppSignal
Monitor with ease. Code with confidence.
Start Free Trial →
Categories
AI & LLM Tools
Registryactive
UpdatedMay 16, 2026
View on GitHub

Related AI & LLM Tools MCP Servers

View all →
SkillFM LLM Cost Optimizer

io.github.ericm1018/skillfm-llm-cost-optimizer-openai-anthropic-usage

LLM cost optimizer for OpenAI, Anthropic, token usage, BYOK, and SkillFM Beacon audits.
Llm Orchestration Agent

io.github.mikerawsonnz/llm-orchestration-agent

Run a prompt through a LangChain (system + human) chain over Gemini on Vertex AI; optional LangSmith
Authenticated Llm Agent

io.github.mikerawsonnz/authenticated-llm-agent

JWT-gated LLM gateway: authenticate (bcrypt/JWT), then run a LangChain-on-Vertex Gemini completion.
Copilot Memory MCP

labforgedev/copilot-memory-mcp

Persistent semantic memory for AI agents using local ChromaDB vector search. No cloud required.
1
Agent Prompt Injection Firewall Mcp

csoai-org/agent-prompt-injection-firewall-mcp

The WAF for agents. Pattern-based + heuristic firewall scans prompts, RAG documents, tool argume...
Authenticated Multi Llm Agent

io.github.mikerawsonnz/authenticated-multi-llm-agent

Google-OAuth-gated LLM gateway: verify a Google ID token, then run a Gemini (Vertex AI) completion f