This scans your agent instructions and system prompts for vulnerabilities where a user could slip in conflicting commands and hijack the agent's behavior. You'd run it when building new skills or before deploying agents that take untrusted input. It focuses specifically on instruction-level attacks, not code vulnerabilities or credential leaks. The output is a report pointing to risky sections with suggested fixes. One honest take: this addresses a real problem that's easy to overlook when you're heads-down building agents, especially since prompt injection patterns keep evolving. Worth running periodically on production skills, not just once at launch.
npx -y skills add jorgealves/agent_skills --skill prompt-injection-scanner --agent claude-codeInstalls into .claude/skills of the current project.
The prompt-injection-scanner is a security tool specifically for the AI agent era. It identifies weak points in agent instructions where a malicious user could potentially "hijack" the agent's behavior by inserting conflicting instructions into input fields.
capabilities or instructions for an agent skill.secret-leak-detector for credentials; this is specifically for "instruction-level" security.skill_path: "./agent-skills/data-processor/SKILL.md"
A structured report highlighting parts of the instructions that are susceptible to prompt hijacking, along with concrete mitigation strategies.
hoodini/ai-agents-skills
addyosmani/agent-skills
giuseppe-trisciuoglio/developer-kit
agamm/claude-code-owasp