Skip to content

Add prompt injection false positive tests#26

Merged
nagasatish007 merged 1 commit into
agentguard-ai:mainfrom
lleonardo-franco:issue-85-prompt-injection-negative-tests
May 30, 2026
Merged

Add prompt injection false positive tests#26
nagasatish007 merged 1 commit into
agentguard-ai:mainfrom
lleonardo-franco:issue-85-prompt-injection-negative-tests

Conversation

@lleonardo-franco
Copy link
Copy Markdown
Contributor

Summary

  • Add negative PromptInjectionGuardrail coverage for benign inputs that contain suspicious words such as ignore, system, and prompt injection.
  • Cover code comments, prompt engineering documentation, user formatting instructions, technical system logs, and version-specific guidance.
  • Assert each benign case passes with no detections.

Validation

  • npm test -- src/guardrails/__tests__/built-in-guardrails.test.ts --runInBand
  • npm run build:tsc
  • git diff --check

Resolves agentguard-ai/tealtiger#85

@nagasatish007 nagasatish007 merged commit 1500910 into agentguard-ai:main May 30, 2026
0 of 10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Good First Issue] Add negative tests for prompt injection guardrail

2 participants