Agent skill

oe-security-prompt-injection

Maintain and extend prompt-injection defenses. Use when adding new user-input surfaces, changing prompt templates, or when a new injection pattern is observed; run the security regression suite and add a minimal new test case.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/development/oe-security-prompt-injection

SKILL.md

oe-security-prompt-injection

Run the regression suite

  • pytest backend/tests/regression/test_security_prompt_injection.py -v

Add a new attack case (when needed)

  1. Add the new payload to the parametrized attack list in backend/tests/regression/test_security_prompt_injection.py.
  2. Assert both:
    • the input is flagged as suspicious, and
    • the matched pattern/category is the expected one (so we catch drift).

Guardrails

  • Do not weaken detection to “make a test pass”; prefer tightening allowlists for safe inputs and adding targeted patterns for new attacks.

Didn't find tool you were looking for?

Be as detailed as possible for better results