Skip to main content

Security Audit

jailbreak-override

github.com/cisco-ai-defense/skill-scanner
AI SkillCommit de9371289c23
12
CRITICAL
Scanned 8 days ago
3
Critical
Immediate action required
0
High
Priority fixes suggested
0
Medium
Best practices review
0
Low
Acknowledged / Tracked

Trust Assessment

jailbreak-override received a trust score of 12/100, placing it in the Untrusted category. This skill has significant security findings that require attention before use in production.

SkillShield's automated analysis identified 3 findings: 3 critical, 0 high, 0 medium, and 0 low severity. Key findings include System prompt override / policy bypass, Prompt Injection: Attempt to override safety and content policies.

The analysis covered 4 layers: llm_behavioral_safety, manifest_analysis, static_code_analysis, dependency_graph. The manifest_analysis layer scored lowest at 40/100, indicating areas for improvement.

Last analyzed on February 12, 2026 (commit de937128). SkillShield performs automated 4-layer security analysis on AI skills and MCP servers.

Layer Breakdown

Manifest Analysis
40%
Static Code Analysis
100%
Dependency Graph
100%
LLM Behavioral Safety
70%

Behavioral Risk Signals

Dynamic Code
1 finding

Security Findings3

SeverityFindingLayerLocation

Scan History

Embed Code

[![SkillShield](https://skillshield.io/api/v1/badge/ee54bb3928aaf700.svg)](https://skillshield.io/report/ee54bb3928aaf700)
SkillShield Badge

Powered by SkillShield