Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

petekp/formal-verify

Name: formal-verify
Author: petekp

skills/formal-verify/SKILL.md

npx skillsauth add petekp/claude-skills formal-verify

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

formal-verify

Use this skill when architectural intent matters more than "it compiles."

This skill runs a three-layer verification loop:

Layer 1: structural verification over extracted AST facts and declarative rules
Layer 2: behavioral verification over Z3Py protocol specs and TLA+/Apalache state-machine specs
Layer 3: elegance auditing over complexity, consistency, and craft heuristics

The layers are intentionally tiered:

every edit: Layer 1 only, fast enough for continuous feedback
slice checkpoint: Layers 1 and 2
pre-commit and manual /verify: all three layers

Quick Start

Bootstrap a target project with:

/verify --bootstrap

Bootstrap runs four phases:

Install dependencies and create .verifier/
Discover architectural rules from docs and code shape
Interview the user in plain English about ambiguities
Validate the initial rules against the current codebase

Commands

/verify Runs all layers in verbose mode and prints a unified report.
/verify --bootstrap Installs dependencies, creates .verifier/, and scaffolds the first rule set.
/verify --evolve Checks for drift between architectural docs and existing verification specs.
/verify --grade Runs Layer 3 only and reports the current elegance grade.

How Verification Runs

Layer 1: Structural

The runner extracts facts from Rust and Swift source files, then checks structural.yaml rules such as:

only module X may cross boundary Y
modules matching pattern Z must implement interface W
all modules must not reference legacy identifiers

Structural checks are the default PostToolUse hook because they are the fastest.

Layer 2: Behavioral

Behavioral verification covers state transitions and protocol contracts:

TLA+/Apalache for temporal properties, liveness, and interleavings
Z3Py spec files for contracts, invariants, and cross-boundary data guarantees

Use this layer at slice checkpoints, before risky merges, and whenever a change touches coordination logic or cross-language contracts.

Layer 3: Elegance

Elegance auditing scores code for:

complexity
consistency
craft

It produces a grade and line-level deductions so the agent can clean up code, not just make it technically correct.

Violation Handling

When a violation is found, tailor the output to the audience:

agent output: counterexample, diagnosis, concrete fix suggestion
human output: counterexample and diagnosis only

If the agent fails to resolve the same violation three times, stop the fix loop and escalate with:

the original rule
the counterexample
the three attempted fixes
what still appears to block a correct repair

Project Structure Created In The Target Repo

Bootstrap creates and maintains:

.verifier/
├── structural.yaml
├── elegance.yaml
├── specs/
├── facts/
└── reports/

structural.yaml stores declarative Layer 1 rules
elegance.yaml stores thresholds and grade policy
specs/ stores Z3Py and TLA+ behavioral specs
facts/ caches extracted AST facts
reports/ stores the most recent verification outputs

facts/ and reports/ should be gitignored in the target project.

Operating Guidance

Run /verify before claiming a migration is complete.
Run /verify --grade when the code is correct but still feels rough.
Prefer updating rules and specs over weakening them when the architecture evolves intentionally.
Keep SKILL.md focused on orchestration; pull detailed mechanics from the references below.

References

@references/layer1-structural.md Fact extraction, Z3 encoding, reachability, and incremental invalidation.
@references/layer2-behavioral.md When to use TLA+/Apalache versus Z3Py, plus spec execution contracts.
@references/layer3-elegance.md Metric families, grading, thresholds, and the Layer 3 sub-module layout.
@references/constraint-yaml-spec.md Structural rule schema, selectors, assertions, and fact pattern operators.
@references/bootstrap-process.md The install, discover, interview, validate bootstrap workflow.
@references/agent-feedback-loop.md Hook integration, violation injection, retries, and escalation policy.
@references/spec-authoring-guide.md Translating plain-English architectural intent into formal specs.

petekp/formal-verify

skills/formal-verify/SKILL.md

Continuous formal verification of architectural constraints and code quality. Use when asked to verify, audit, or validate codebase integrity. Runs automatically via hooks on every edit (structural) and pre-commit (full). Catches ownership violations, boundary crossings, state machine bugs, and code smells that grep ratchets miss. Triggers: "verify", "formal verify", "check architecture", "audit code quality", "run verification", "/verify", "/verify --bootstrap", "/verify --grade".

25 stars

development

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add petekp/claude-skills formal-verify

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 25, 2026, 10:58 PM392.1s1 file scanned

SKILL.md

name:: formal-verify
description:: >
and code smells that grep ratchets miss. Triggers:: verify", "formal verify",
license:: MIT
author:: petekp
version:: 0.1.0

formal-verify

Use this skill when architectural intent matters more than "it compiles."

This skill runs a three-layer verification loop:

Layer 1: structural verification over extracted AST facts and declarative rules
Layer 2: behavioral verification over Z3Py protocol specs and TLA+/Apalache state-machine specs
Layer 3: elegance auditing over complexity, consistency, and craft heuristics

The layers are intentionally tiered:

every edit: Layer 1 only, fast enough for continuous feedback
slice checkpoint: Layers 1 and 2
pre-commit and manual /verify: all three layers

Quick Start

Bootstrap a target project with:

/verify --bootstrap

Bootstrap runs four phases:

Install dependencies and create .verifier/
Discover architectural rules from docs and code shape
Interview the user in plain English about ambiguities
Validate the initial rules against the current codebase

Commands

/verify Runs all layers in verbose mode and prints a unified report.
/verify --bootstrap Installs dependencies, creates .verifier/, and scaffolds the first rule set.
/verify --evolve Checks for drift between architectural docs and existing verification specs.
/verify --grade Runs Layer 3 only and reports the current elegance grade.

How Verification Runs

Layer 1: Structural

The runner extracts facts from Rust and Swift source files, then checks structural.yaml rules such as:

only module X may cross boundary Y
modules matching pattern Z must implement interface W
all modules must not reference legacy identifiers

Structural checks are the default PostToolUse hook because they are the fastest.

Layer 2: Behavioral

Behavioral verification covers state transitions and protocol contracts:

TLA+/Apalache for temporal properties, liveness, and interleavings
Z3Py spec files for contracts, invariants, and cross-boundary data guarantees

Use this layer at slice checkpoints, before risky merges, and whenever a change touches coordination logic or cross-language contracts.

Layer 3: Elegance

Elegance auditing scores code for:

complexity
consistency
craft

It produces a grade and line-level deductions so the agent can clean up code, not just make it technically correct.

Violation Handling

When a violation is found, tailor the output to the audience:

agent output: counterexample, diagnosis, concrete fix suggestion
human output: counterexample and diagnosis only

If the agent fails to resolve the same violation three times, stop the fix loop and escalate with:

the original rule
the counterexample
the three attempted fixes
what still appears to block a correct repair

Project Structure Created In The Target Repo

Bootstrap creates and maintains:

.verifier/
├── structural.yaml
├── elegance.yaml
├── specs/
├── facts/
└── reports/

structural.yaml stores declarative Layer 1 rules
elegance.yaml stores thresholds and grade policy
specs/ stores Z3Py and TLA+ behavioral specs
facts/ caches extracted AST facts
reports/ stores the most recent verification outputs

facts/ and reports/ should be gitignored in the target project.

Operating Guidance

Run /verify before claiming a migration is complete.
Run /verify --grade when the code is correct but still feels rough.
Prefer updating rules and specs over weakening them when the architecture evolves intentionally.
Keep SKILL.md focused on orchestration; pull detailed mechanics from the references below.

References

@references/layer1-structural.md Fact extraction, Z3 encoding, reachability, and incremental invalidation.
@references/layer2-behavioral.md When to use TLA+/Apalache versus Z3Py, plus spec execution contracts.
@references/layer3-elegance.md Metric families, grading, thresholds, and the Layer 3 sub-module layout.
@references/constraint-yaml-spec.md Structural rule schema, selectors, assertions, and fact pattern operators.
@references/bootstrap-process.md The install, discover, interview, validate bootstrap workflow.
@references/agent-feedback-loop.md Hook integration, violation injection, retries, and escalation policy.
@references/spec-authoring-guide.md Translating plain-English architectural intent into formal specs.

Related Skills

petekp/pr-self-review

development

VerifiedTrustedCommunity

Draft short, plainspoken notes in the author's voice that help reviewers understand non-obvious choices, boundaries, and preserved behavior in the author's own pull request or local diff. Use when the user asks to self-review, annotate, or add reviewer context to their PR or changes. Draft locally when no PR exists, and post approved notes as one GitHub review when a PR does exist. Do not use for reviewing someone else's PR, writing code comments, explaining code generally, or drafting a PR description. Never post without explicit approval.

40SKILL.mdUpdated Jul 21, 2026

petekp/pr-self-review

petekp/tailwind-plugin-craft

tools

VerifiedTrustedCommunity

Design and build pure-CSS (zero-JavaScript) Tailwind CSS v4 plugins of unusual depth and craft. Use when the user wants to create, architect, or refine a Tailwind utility plugin or CSS effect — e.g. "make a tailwind plugin", "build a tw-* plugin", "a CSS-only shimmer/fade/glow/grain/noise utility", "tailwind v4 @utility", "package this effect as a plugin", or wants an effect with surprising visual depth (gradients, masks, filters, SVG filter tricks, scroll-driven animation). Pairs deep CSS/SVG technique research with a bespoke tuning workbench for dialing the effect in. Inspired by tw-fade and tw-shimmer.

40SKILL.mdUpdated Jul 15, 2026

petekp/tailwind-plugin-craft

petekp/pr-screenshot-comparison

content-media

VerifiedTrustedCommunity

Create clear, polished before-and-after screenshots for a GitHub pull request. Use when a UI change needs visual proof: capture matching states, crop to the relevant UI, stitch and caption one comparison image, attach it natively to the PR, and keep the image out of the repository.

40SKILL.mdUpdated Jul 15, 2026

petekp/pr-screenshot-comparison

petekp/skills/latent-potential

testing

VerifiedTrustedCommunity

--- name: latent-potential description: First-principles, team-of-experts assessment of a software project that surfaces latent potential; underexploited assets, a sharper north star, missing high-leverage capabilities, better framing and messaging. Produces a prioritized, evidence-grounded report with cheap probes, a reframe candidate, a stop-doing list, and an honest skeptic's case. Use whenever the user wants fresh eyes on a project they have built: "what am I sitting on", "what could this be

40SKILL.mdUpdated Jul 15, 2026

petekp/skills/latent-potential

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/petekp/claude-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-skills/skills/formal-verify ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

petekp/claude-skills

25 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT