Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

cursor/principle-prove-it-works

Name: principle-prove-it-works
Author: cursor

pstack/skills/principle-prove-it-works/SKILL.md

npx skillsauth add cursor/plugins principle-prove-it-works

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Prove It Works

Verify every task output by checking the real thing directly. Do not infer from proxies, self-reports, or "it compiles."

Why: Unverified work has unknown correctness. Indirect verification (file mtimes, output freshness, agent self-reports, cached screenshots) feels cheaper than direct observation. Acting on a wrong inference costs far more than checking the source.

Pattern: After completing any task, ask: "how do I prove this actually works?"

Check the real thing, not a proxy:

Check process liveness directly, not indirectly through derived state
Read the actual value, not a cached or derived representation
When verification fails, suspect the observation method before suspecting the system

Code and features:

Build it (necessary but not sufficient)
Run it and exercise the actual feature path
Check the full chain: does data flow from input to output?
For integrations, test the full communication path end-to-end

Delegation: trust artifacts, not self-reports. When verifying delegated work, inspect the actual output artifact (git diff, file contents, runtime behavior), not the delegate's summary. Agents report what they intended, not always what happened.

Script the check when you can

The strongest proof is a deterministic script that re-runs the same comparison, not a one-time eyeball. Write the script, run it, and keep its output as an artifact a reviewer can re-run instead of trusting your word. A script comparing the old and new compiled output catches what a glance misses.

Keep the artifact visible for the human. Commit it only for large or complex work where the trail has to be auditable later, like a big port or migration (the show-me-your-work skill). Most work just needs it visible, not committed.

cursor/principle-prove-it-works

pstack/skills/principle-prove-it-works/SKILL.md

Apply after completing a task, before declaring done. Verify against the real artifact (run the feature, read the actual value, inspect the diff), not a proxy, self-report, or 'it compiles.'

744 stars

development

Updated May 25, 2026

$ install --global

skillsauth

npx skillsauth add cursor/plugins principle-prove-it-works

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 25, 2026, 2:22 AM23.9s1 file scanned

SKILL.md

name:: principle-prove-it-works
description:: Apply after completing a task, before declaring done. Verify against the real artifact (run the feature, read the actual value, inspect the diff), not a proxy, self-report, or 'it compiles.'
disable-model-invocation:: true

Prove It Works

Verify every task output by checking the real thing directly. Do not infer from proxies, self-reports, or "it compiles."

Pattern: After completing any task, ask: "how do I prove this actually works?"

Check the real thing, not a proxy:

Check process liveness directly, not indirectly through derived state
Read the actual value, not a cached or derived representation
When verification fails, suspect the observation method before suspecting the system

Code and features:

Build it (necessary but not sufficient)
Run it and exercise the actual feature path
Check the full chain: does data flow from input to output?
For integrations, test the full communication path end-to-end

Script the check when you can

Related Skills

cursor/setup-pstack

documentation

VerifiedTrustedCommunity

Configure which models pstack uses per role. Detects your available models and writes an always-applied rule that overrides the skill defaults. Use for /setup-pstack, "configure pstack models", or changing pstack's model choices.

1,884SKILL.mdUpdated Jun 7, 2026

cursor/principle-sequence-verifiable-units

testing

VerifiedTrustedCommunity

Apply to multi-step work (sweeps, migrations, runs of similar edits) and to how you stack commits and PRs. Break work into small units that each end in a verifiable state, check each before the next, and order delivery so the sequence proves itself to a reviewer.

1,884SKILL.mdUpdated Jun 7, 2026

cursor/principle-sequence-verifiable-units

cursor/figure-it-out

development

VerifiedTrustedCommunity

Design an auditable playbook when no narrower one fits: a large migration, an ambitious multi-part change, or work a human reviews after stepping away. Scales rigor to the task, runs a hypothesis loop, and logs decisions via show-me-your-work. Use for /figure-it-out, 'figure it out', a large migration, or when no narrower playbook applies.

1,884SKILL.mdUpdated May 25, 2026

cursor/why

tools

VerifiedTrustedCommunity

Use for 'why does X work this way', 'why we picked Y', design rationale, regressions, postmortems, or data-backed thresholds. Discovers available MCPs and queries each evidence category (source control, issue tracker, long-form docs, real-time chat, infrastructure observability, error tracking, product analytics warehouse) in parallel, then returns a cited read on decisions and tradeoffs. Use how for runtime behavior.

1,884SKILL.mdUpdated May 24, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/cursor/plugins.git

# Copy into Claude Code skills folder (global)
cp -r plugins/pstack/skills/principle-prove-it-works ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

cursor/plugins

744 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT