Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

petekp/formal-verify

Name: formal-verify
Author: petekp

skills/formal-verify/SKILL.md

npx skillsauth add petekp/agent-skills formal-verify

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

formal-verify

Use this skill when architectural intent matters more than "it compiles."

This skill runs a three-layer verification loop:

Layer 1: structural verification over extracted AST facts and declarative rules
Layer 2: behavioral verification over Z3Py protocol specs and TLA+/Apalache state-machine specs
Layer 3: elegance auditing over complexity, consistency, and craft heuristics

The layers are intentionally tiered:

every edit: Layer 1 only, fast enough for continuous feedback
slice checkpoint: Layers 1 and 2
pre-commit and manual /verify: all three layers

Quick Start

Bootstrap a target project with:

/verify --bootstrap

Bootstrap runs four phases:

Install dependencies and create .verifier/
Discover architectural rules from docs and code shape
Interview the user in plain English about ambiguities
Validate the initial rules against the current codebase

Commands

/verify Runs all layers in verbose mode and prints a unified report.
/verify --bootstrap Installs dependencies, creates .verifier/, and scaffolds the first rule set.
/verify --evolve Checks for drift between architectural docs and existing verification specs.
/verify --grade Runs Layer 3 only and reports the current elegance grade.

How Verification Runs

Layer 1: Structural

The runner extracts facts from Rust and Swift source files, then checks structural.yaml rules such as:

only module X may cross boundary Y
modules matching pattern Z must implement interface W
all modules must not reference legacy identifiers

Structural checks are the default PostToolUse hook because they are the fastest.

Layer 2: Behavioral

Behavioral verification covers state transitions and protocol contracts:

TLA+/Apalache for temporal properties, liveness, and interleavings
Z3Py spec files for contracts, invariants, and cross-boundary data guarantees

Use this layer at slice checkpoints, before risky merges, and whenever a change touches coordination logic or cross-language contracts.

Layer 3: Elegance

Elegance auditing scores code for:

complexity
consistency
craft

It produces a grade and line-level deductions so the agent can clean up code, not just make it technically correct.

Violation Handling

When a violation is found, tailor the output to the audience:

agent output: counterexample, diagnosis, concrete fix suggestion
human output: counterexample and diagnosis only

If the agent fails to resolve the same violation three times, stop the fix loop and escalate with:

the original rule
the counterexample
the three attempted fixes
what still appears to block a correct repair

Project Structure Created In The Target Repo

Bootstrap creates and maintains:

.verifier/
├── structural.yaml
├── elegance.yaml
├── specs/
├── facts/
└── reports/

structural.yaml stores declarative Layer 1 rules
elegance.yaml stores thresholds and grade policy
specs/ stores Z3Py and TLA+ behavioral specs
facts/ caches extracted AST facts
reports/ stores the most recent verification outputs

facts/ and reports/ should be gitignored in the target project.

Operating Guidance

Run /verify before claiming a migration is complete.
Run /verify --grade when the code is correct but still feels rough.
Prefer updating rules and specs over weakening them when the architecture evolves intentionally.
Keep SKILL.md focused on orchestration; pull detailed mechanics from the references below.

References

@references/layer1-structural.md Fact extraction, Z3 encoding, reachability, and incremental invalidation.
@references/layer2-behavioral.md When to use TLA+/Apalache versus Z3Py, plus spec execution contracts.
@references/layer3-elegance.md Metric families, grading, thresholds, and the Layer 3 sub-module layout.
@references/constraint-yaml-spec.md Structural rule schema, selectors, assertions, and fact pattern operators.
@references/bootstrap-process.md The install, discover, interview, validate bootstrap workflow.
@references/agent-feedback-loop.md Hook integration, violation injection, retries, and escalation policy.
@references/spec-authoring-guide.md Translating plain-English architectural intent into formal specs.

petekp/formal-verify

skills/formal-verify/SKILL.md

Continuous formal verification of architectural constraints and code quality. Use when asked to verify, audit, or validate codebase integrity. Runs automatically via hooks on every edit (structural) and pre-commit (full). Catches ownership violations, boundary crossings, state machine bugs, and code smells that grep ratchets miss. Triggers: "verify", "formal verify", "check architecture", "audit code quality", "run verification", "/verify", "/verify --bootstrap", "/verify --grade".

4 stars

development

Updated May 15, 2026

$ install --global

skillsauth

npx skillsauth add petekp/agent-skills formal-verify

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 15, 2026, 8:33 AM3.8s1 file scanned

SKILL.md

name:: formal-verify
description:: >
and code smells that grep ratchets miss. Triggers:: verify", "formal verify",
license:: MIT
author:: petekp
version:: 0.1.0

formal-verify

Use this skill when architectural intent matters more than "it compiles."

This skill runs a three-layer verification loop:

Layer 1: structural verification over extracted AST facts and declarative rules
Layer 2: behavioral verification over Z3Py protocol specs and TLA+/Apalache state-machine specs
Layer 3: elegance auditing over complexity, consistency, and craft heuristics

The layers are intentionally tiered:

every edit: Layer 1 only, fast enough for continuous feedback
slice checkpoint: Layers 1 and 2
pre-commit and manual /verify: all three layers

Quick Start

Bootstrap a target project with:

/verify --bootstrap

Bootstrap runs four phases:

Install dependencies and create .verifier/
Discover architectural rules from docs and code shape
Interview the user in plain English about ambiguities
Validate the initial rules against the current codebase

Commands

/verify Runs all layers in verbose mode and prints a unified report.
/verify --bootstrap Installs dependencies, creates .verifier/, and scaffolds the first rule set.
/verify --evolve Checks for drift between architectural docs and existing verification specs.
/verify --grade Runs Layer 3 only and reports the current elegance grade.

How Verification Runs

Layer 1: Structural

The runner extracts facts from Rust and Swift source files, then checks structural.yaml rules such as:

only module X may cross boundary Y
modules matching pattern Z must implement interface W
all modules must not reference legacy identifiers

Structural checks are the default PostToolUse hook because they are the fastest.

Layer 2: Behavioral

Behavioral verification covers state transitions and protocol contracts:

TLA+/Apalache for temporal properties, liveness, and interleavings
Z3Py spec files for contracts, invariants, and cross-boundary data guarantees

Use this layer at slice checkpoints, before risky merges, and whenever a change touches coordination logic or cross-language contracts.

Layer 3: Elegance

Elegance auditing scores code for:

complexity
consistency
craft

It produces a grade and line-level deductions so the agent can clean up code, not just make it technically correct.

Violation Handling

When a violation is found, tailor the output to the audience:

agent output: counterexample, diagnosis, concrete fix suggestion
human output: counterexample and diagnosis only

If the agent fails to resolve the same violation three times, stop the fix loop and escalate with:

the original rule
the counterexample
the three attempted fixes
what still appears to block a correct repair

Project Structure Created In The Target Repo

Bootstrap creates and maintains:

.verifier/
├── structural.yaml
├── elegance.yaml
├── specs/
├── facts/
└── reports/

structural.yaml stores declarative Layer 1 rules
elegance.yaml stores thresholds and grade policy
specs/ stores Z3Py and TLA+ behavioral specs
facts/ caches extracted AST facts
reports/ stores the most recent verification outputs

facts/ and reports/ should be gitignored in the target project.

Operating Guidance

Run /verify before claiming a migration is complete.
Run /verify --grade when the code is correct but still feels rough.
Prefer updating rules and specs over weakening them when the architecture evolves intentionally.
Keep SKILL.md focused on orchestration; pull detailed mechanics from the references below.

References

@references/layer1-structural.md Fact extraction, Z3 encoding, reachability, and incremental invalidation.
@references/layer2-behavioral.md When to use TLA+/Apalache versus Z3Py, plus spec execution contracts.
@references/layer3-elegance.md Metric families, grading, thresholds, and the Layer 3 sub-module layout.
@references/constraint-yaml-spec.md Structural rule schema, selectors, assertions, and fact pattern operators.
@references/bootstrap-process.md The install, discover, interview, validate bootstrap workflow.
@references/agent-feedback-loop.md Hook integration, violation injection, retries, and escalation policy.
@references/spec-authoring-guide.md Translating plain-English architectural intent into formal specs.

Related Skills

petekp/write-goal

development

VerifiedTrustedCommunity

Compile a plain-language task into a concise, auditable Codex or Claude Code `/goal`, or explain why a normal prompt fits better. Use when the user asks to draft, formulate, rewrite, tighten, or create a goal for multi-step work that needs a durable objective, transcript-visible proof, constraints, bounded stop conditions, host-aware operation, and risk-based review depth.

4SKILL.mdUpdated May 19, 2026

petekp/unix-macos-engineer

tools

VerifiedTrustedCommunity

Expert Unix and macOS systems engineer for shell scripting, system administration, command-line tools, launchd, Homebrew, networking, and low-level system tasks. Use when the user asks about Unix commands, shell scripts, macOS system configuration, process management, or troubleshooting system issues.

4SKILL.mdUpdated May 15, 2026

petekp/unix-macos-engineer

petekp/typography

testing

VerifiedTrustedCommunity

Apply professional typography principles to create readable, hierarchical, and aesthetically refined interfaces. Use when setting type scales, choosing fonts, adjusting spacing, designing text-heavy layouts, implementing dark mode typography, or when asked about readability, font pairing, line height, measure, typographic hierarchy, variable fonts, font loading, or OpenType features.

4SKILL.mdUpdated May 15, 2026

petekp/tuning-panel

development

VerifiedTrustedCommunity

Create visual parameter tuning panels for iterative adjustment of animations, layouts, colors, typography, physics, or any numeric/visual values. Use when the user asks to "create a tuning panel", "add parameter controls", "build a debug panel", "tweak parameters visually", "fine-tune values", "dial in the settings", or "adjust parameters interactively". Also triggers on mentions of "leva", "dat.GUI", or "tweakpane".

4SKILL.mdUpdated May 15, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/petekp/agent-skills.git

# Copy into Claude Code skills folder (global)
cp -r agent-skills/skills/formal-verify ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

petekp/agent-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT