Verified Skills Standard

A graduated security certification system for AI agent skills.

The Problem

AI agent skills execute with the full privileges of their host agent — filesystem access, terminal execution, and credential visibility. Yet skills are published to open registries with zero review.

Snyk's ToxicSkills study (February 5, 2026) quantified the risk: scanning 3,984 skills from public registries, they found 1,467 (36.82%) containing at least one security flaw, including 76 confirmed malicious payloads. Real attacks include credential exfiltration (SSH keys, AWS tokens, crypto wallets), crypto miners, and prompt injection designed to persist across sessions via memory poisoning.

No existing platform had comprehensive security scanning. The Verified Skills Standard fills this gap.

For the full threat landscape, platform comparison, and risk taxonomy, see the Skills Ecosystem Security page.

Three-Tier Certification

The standard defines three escalating levels of trust, each building on the previous tier.

Tier 1: Scanned

Automated pattern matching against 41 known-dangerous patterns across 9 categories.

Category	What It Catches
Destructive commands	`rm -rf`, `DROP TABLE`, `format`, `dd`, `mkfs`
Remote code execution	`curl \| bash`, `wget \| bash`, `eval()`, `exec()`, `child_process`
Credential access	`~/.ssh/`, `~/.aws/`, wallet paths, `.env`, `credentials.json`
Prompt injection	`<system>` tags, "ignore previous instructions", role reassignment
Obfuscation	`atob`/`btoa`, `base64 -d`, hex sequences, password-protected archives
Memory poisoning	Writes to `SOUL.md`, `MEMORY.md`, `CLAUDE.md`
Data exfiltration	`curl --data`, encoded payloads in URL parameters
Dangerous permissions	`chmod 777`, unrestricted file access
Network access	`fetch()`, `http.get`, `axios`, hardcoded URLs

Cost: Free
Speed: < 500ms
Output: Pass/Fail with detailed findings by severity (critical, high, medium, low, info)

Tier 2: Verified

Everything in Tier 1 plus LLM-based semantic analysis for threats that pattern matching cannot catch.

The LLM judge evaluates intent, not just syntax. It detects:

Social engineering — "download and run this helper script"
Scope inflation — declared scope says src/ only, but behavior touches ~/
Obfuscated intent — indirect language that avoids triggering pattern rules
Multi-step attacks — individually safe operations that compose into dangerous behavior
Chained skill attacks — dependency chains that distribute trust across multiple skills
Cost: ~$0.03 per skill
Speed: 5-15 seconds
Output: Score 0-100 with verdict (PASS >= 80, CONCERNS 60-79, FAIL < 60)

Tier 3: Certified

Everything in Tiers 1 and 2 plus human security review and sandbox testing.

A professional security reviewer:

Reads the full skill source code
Tests behavior in a sandboxed environment
Validates that declared scope matches actual behavior
Checks for edge cases the automated tiers cannot catch
Cost: $50-200 per skill
Speed: 1-5 business days
Output: Certification report with findings and recommendations

Trust Badges and Labels

Skills display trust badges indicating their highest achieved certification tier:

Badge	Meaning
Scanned	Passed 41-pattern automated scan
Verified	Passed automated scan + LLM intent analysis
Certified	Passed all tiers including human review
Vendor	Published by a trusted organization (Anthropic, OpenAI, Google, Vercel, Microsoft, Supabase)

Additional labels provide context:

Label	Meaning
`safe`	No network access, no filesystem writes outside declared scope
`extensible`	Supports DCI and skill memories (Extensible Skills Standard)
`portable`	Works across multiple AI agents
`popular`	High install count / community usage
`deprecated`	No longer maintained
`warning`	Known issues, use with caution

Mandatory SKILL.md Sections

To qualify for any certification tier, a SKILL.md file must include four sections:

description (frontmatter) — What the skill does and when it activates (10-1,024 characters)
## Scope — Languages, frameworks, tools, file patterns, and a "Does NOT" clause
## Permissions — Table of every tool permission with justification
## Security Notes — Data handling, network access, external dependencies

These sections make skill behavior transparent and auditable. The full specification with compliant examples is in the Secure Skill Factory Standard RFC.

Forbidden Patterns

Certain patterns constitute automatic disqualification from any certification tier, regardless of context. These include:

Code execution primitives (eval(), exec(), Function(), child_process)
Remote code execution (curl | bash, wget | bash)
Obfuscation (atob, btoa, base64 decode, hex sequences)
Credential access (~/.ssh/, ~/.aws/, wallet paths)
Memory poisoning (writes to SOUL.md, MEMORY.md, CLAUDE.md)
Data exfiltration (curl --data)
Destructive commands (rm -rf, DROP TABLE, format, dd)

The complete list of 41 forbidden patterns across 9 categories is documented in the Secure Skill Factory Standard RFC.

verifiedskill.com

verifiedskill.com is the trusted registry for AI agent skills. It provides:

Skill browsing — Search and filter skills by category, verification tier, and compatible agents
Submission pipeline — Submit your skill's GitHub repository for automated security verification
Trust badges — SVG badges you can embed in your skill's README
Pre-verified skills — Skills from major vendor repositories (Anthropic/Claude, OpenAI/Codex, Google/Gemini) are auto-verified and listed from day one

`vskill` CLI

The vskill CLI provides command-line access to the registry. Use any package runner:

# npm
npx vskill search "react components"

# bun
bunx vskill search "react components"

# pnpm
pnpx vskill search "react components"

# yarn
yarn dlx vskill search "react components"

# Get skill details with verification status
npx vskill info @anthropic/react-frontend

# Submit a skill for verification
npx vskill submit https://github.com/you/your-skill

# Check submission status
npx vskill status <submission-id>

Replace npx with bunx, pnpx, or yarn dlx in any of the above commands.

note

verifiedskill.com is currently in development (see increment 0225). The CLI and registry features described above represent the planned functionality.

The Full Specification

This page provides a concise overview of the Verified Skills Standard. For the complete technical specification — including all 41 forbidden patterns, structural validation rules, vendor auto-verification logic, and backwards compatibility considerations — see the Secure Skill Factory Standard RFC.

The Problem​

Three-Tier Certification​

Tier 1: Scanned​

Tier 2: Verified​

Tier 3: Certified​

Trust Badges and Labels​

Mandatory SKILL.md Sections​

Forbidden Patterns​

verifiedskill.com​

vskill CLI​

The Full Specification​

See Also​