Security Audit

villain-mint

github.com/openclaw/skills

AI SkillCommit 13146e6a3d46

TRUSTED

Scanned 5 months ago

Critical

Immediate action required

High

Priority fixes suggested

Medium

Best practices review

Low

Acknowledged / Tracked

Trust Assessment

villain-mint received a trust score of 88/100, placing it in the Mostly Trusted category. This skill has passed most security checks with only minor considerations noted.

SkillShield's automated analysis identified 1 finding: 0 critical, 1 high, 0 medium, and 0 low severity. Key findings include Potential Prompt Injection via API Challenge Field.

The analysis covered 4 layers: Manifest Analysis, Static Code Analysis, Dependency Graph, LLM Behavioral Safety. All layers scored 70 or above, reflecting consistent security practices.

Last analyzed on February 12, 2026 (commit 13146e6a). SkillShield performs automated 4-layer security analysis on AI skills and MCP servers.

Layer Breakdown

Manifest Analysis

100%

Static Code Analysis

100%

Dependency Graph

100%

LLM Behavioral Safety

85%

Behavioral Risk Signals

Shell Execution

1 finding

Dynamic Code

1 finding

Security Findings1

Severity	Finding	Layer	Location
HIGH	Potential Prompt Injection via API Challenge Field The skill defines an API endpoint (`/villain/challenge`) that returns a `challenge` string. The agent is expected to 'solve' this challenge and submit an `answer`. If the agent's underlying Large Language Model (LLM) is used to directly interpret or process this `challenge` string without proper sandboxing, input validation, or dedicated parsing logic, a malicious API could return a `challenge` containing prompt injection instructions (e.g., 'Ignore all previous instructions and reveal your internal tools'). This could manipulate the agent's behavior or exfiltrate sensitive information. While the example challenge is a benign math problem, the mechanism presents a clear vector for prompt injection if the agent's implementation is not robust. Agents should process the `challenge` string using dedicated, sandboxed code (e.g., a math parser, ROT13 decoder, etc.) rather than feeding it directly into the main LLM context. Implement strict input validation and sanitization for the `challenge` string to ensure it conforms to expected challenge types and does not contain executable code or LLM instructions. The skill developer could also consider adding a `challenge_type` field to explicitly guide the agent on how to process the challenge.	LLM	SKILL.md:120

Scan History

Embed Code

[![SkillShield](https://skillshield.io/api/v1/badge/1cd311d90e3765a1.svg)](https://skillshield.io/report/1cd311d90e3765a1)