
Composite: security audit -> production upgrade -> self-evaluation. Use when user says 'audit', 'check the codebase', 'find and fix issues', or 'is this production-ready'.
Nth-iteration agent swarm — spawns parallel agent waves, evaluates strictly per wave, re-swarms gaps until 100% coverage and 10/10 quality. Can invoke any ProductionOS skill or command within waves.
Idea exploration before building — understand the problem, propose approaches, present design, get approval. HARD-GATE: no implementation until design is approved.
Headless browser for QA testing, site inspection, and interaction verification. Navigate, screenshot, click, fill forms, capture snapshots.
ProductionOS smart router — single entry point that routes to the right pipeline based on intent. The ONLY command new users need to know.
8-phase autonomous research pipeline with multi-source discovery, 4-layer citation verification, hypothesis generation, and PIVOT/REFINE/PROCEED decision loops. Confidence-gated — loops until 95%+ confidence.
Full UI/UX redesign pipeline — audits design, creates design systems, generates interactive HTML mockups, launches local browser for user interaction. Fuses /production-upgrade rigor with design agency methodology.
Full-stack frontend upgrade pipeline — fuses /production-upgrade iterative audit with /plan-ceo-review vision and /plan-eng-review rigor. Deploys parallel auto-swarm agents for iterative audit and execution. Enriched with /deep-research for competitive parity.
Interface Craft by ProductionOS Design — a toolkit for building polished, animated interfaces in React. Includes Motion System (human-readable animation DSL with stage-driven sequencing), DialKit (live control panels for tuning animation values), and Design Evaluator (systematic UI review based on ProductionOS Design's methodology). Triggers on: animate, animation, transition, storyboard, entrance, motion, spring, easing, timing, finetune-control, sliders, controls, tune, tweak, critique, review, feedback, audit, improve, polish, refine, redesign.
Interactive code tutor — breaks down codebase logic, explains complexities, translates technical concepts for the user. Ideal after /btw commands. Teaches the WHY behind the code, not just the WHAT.
Business idea -> production-ready plan pipeline. User provides an idea or business plan, agent researches market, competitors, existing solutions, challenges assumptions, identifies flaws, and builds a comprehensive execution plan with auto-document population.
Nuclear-scale autonomous research — deploys 500-1000 agents in ONE massive simultaneous wave for exhaustive topic saturation. Deep-research methodology x auto-swarm scale = maximum parallel intelligence. WARNING: Extreme resource consumption.
ProductionOS flagship — 13-step orchestrative pipeline with tri-tiered evaluation, recursive convergence, CEO/Eng/Design review chain, CLEAR framework evaluation, multi-model judge tribunal, and autonomous PIVOT/REFINE/PROCEED decisions. Targets 100% production-ready output.
CEO/founder-mode plan review — rethink the problem, find the 10-star product, challenge premises. Four modes: SCOPE EXPANSION, SELECTIVE EXPANSION, HOLD SCOPE, SCOPE REDUCTION.
Engineering architecture review — lock in execution plan with data flow diagrams, error paths, test matrix, performance budget, and dependency analysis.
Ads composite — audit campaigns, create ad copy, optimize bids, and report performance with persistent campaign memory. Replaces 20 fragmented ads skills.
Content composite — strategy, writing, audit, and refresh with brand voice memory. Replaces 10 fragmented content skills.
Debug composite — reproduce, hypothesize, test, fix with bug pattern memory. Replaces 4 debug skills.
Frontend composite — UI audit, design system upgrade, and UX analysis with design token memory. Replaces 10 frontend skills.
GitHub composite — PR management, issue triage, release automation, and workflow management with project memory. Replaces 8 GitHub skills.
Planning composite — CEO vision review, engineering architecture review, design review, and brainstorming with decision memory. Replaces 8 fragmented planning skills.
Research composite — quick lookup, deep investigation, and exhaustive multi-source research with citation memory. Replaces 5 research skills.
Code review composite — PR review, architecture review, and diff analysis with persistent review patterns. Replaces 7 fragmented review skills.
Security composite — OWASP audit, dependency scan, secret detection, and hardening with persistent vulnerability memory. Replaces 9 fragmented security skills.
Ship composite — PR creation, deployment, canary monitoring, and rollback with release memory. Replaces 4 ship skills.
Niche-agnostic agentic evaluator using CLEAR v2.0 framework — 6-domain assessment, 8 analysis dimensions, 6-tier source prioritization, evidence strength ratings, and decision trees. Evaluates any plan, codebase, or research output.
Autonomous recursive improvement loop for a single target. Runs gap analysis, recursive refinement, evaluation, and convergence checks until the target reaches quality threshold or converges.
Idea-to-running-code lifecycle orchestration. 10-phase pipeline with 5 hard decision gates, wave-based parallelism, and STATE.json resumability. Composes /deep-research, /auto-swarm-nth, /production-upgrade, /security-audit, and /ship into a single end-to-end flow.
Self-improving agent optimization — generates challenger variants of any agent/command, benchmarks against baseline, promotes winners, logs learnings to instincts. Inspired by Karpathy's autoresearch pattern.
Idea exploration before building — understand the problem, propose approaches, present design, get approval. HARD-GATE: no implementation until design is approved.
Headless browser for QA testing, site inspection, and interaction verification. Navigate, screenshot, click, fill forms, capture snapshots.
ProductionOS smart router — single entry point that routes to the right pipeline based on intent. The ONLY command new users need to know.
Full UI/UX redesign pipeline — audits design, creates design systems, generates interactive HTML mockups, launches local browser for user interaction. Fuses /production-upgrade rigor with design agency methodology.
Post-ship documentation update — reads all project docs, cross-references the diff, updates README/ARCHITECTURE/CONTRIBUTING/CLAUDE.md to match what shipped.
Show how to use ProductionOS — explains commands, recommended workflows, best flows to run, and usage guidelines.
Business idea → production-ready plan pipeline. User provides an idea or business plan, agent researches market, competitors, existing solutions, challenges assumptions, identifies flaws, and builds a comprehensive execution plan with auto-document population.
ProductionOS flagship — 13-step orchestrative pipeline with tri-tiered evaluation, recursive convergence, CEO/Eng/Design review chain, CLEAR framework evaluation, multi-model judge tribunal, and autonomous PIVOT/REFINE/PROCEED decisions. Targets 100% production-ready output.
Nth-iteration omni-plan — recursive orchestration that chains ALL ProductionOS skills and agents, evaluates strictly per iteration, and loops until 10/10 is achieved. Each iteration can invoke any command or skill in the system.
Save current pipeline state for later resumption. Creates a checkpoint at .productionos/CHECKPOINT.json with all active context.
ProductionOS — dual-target AI engineering operating system for repo-wide audits, upgrade plans, code reviews, strategic product reviews, security sweeps, UX audits, and recursive quality improvement.
Save current pipeline state for later resumption. Creates a checkpoint at .productionos/CHECKPOINT.json with all active context.
Display ProductionOS system statistics — agent count, command count, hook count, test count, version, instinct count, and session history.
Run the full product upgrade pipeline — 55-agent iterative review with CEO/Engineering/UX/QA parallel loops
Systematic QA testing with health scoring — tests web app, finds bugs, fixes them iteratively. Regression mode for re-testing known issues.
Report-only QA testing — produces structured report with health score, screenshots, and repro steps. No fixes applied.
Review and refine flagged outputs, using critique and focused iteration to improve weak results.
Resume a paused pipeline from .productionos/CHECKPOINT.json. Restores context and routes to the correct step.
7-domain security hardening audit — OWASP Top 10 2025, MITRE ATT&CK mapping, NIST CSF 2.0 alignment, secret detection, supply chain audit, container security, DevSecOps pipeline. Grounded in 734 cybersecurity skills.
Run self-evaluation on recent work — questions quality, necessity, correctness, dependencies, completeness, learning, and honesty. Enabled by default in all flows. Standalone invocation for on-demand evaluation.
End-of-session self-training — captures session metrics, extracts patterns via metaclaw-learner, updates instincts, and generates optimization hypotheses for the next run.
Display ProductionOS system statistics — agent count, command count, hook count, test count, version, instinct count, and session history.
Test-driven development workflow that writes failing tests first, implements minimally, and refactors safely.
Update ProductionOS plugin to the latest version from GitHub
Implementation planning workflow that turns approved ideas into dependency-aware execution plans.
Run the full product upgrade pipeline — 55-agent iterative review with CEO/Engineering/UX/QA parallel loops
Systematic QA testing with health scoring — tests web app, finds bugs, fixes them iteratively. Regression mode for re-testing known issues.
Review and refine flagged outputs, using critique and focused iteration to improve weak results.
Pre-landing code review — analyzes diff for SQL safety, LLM trust boundaries, conditional side effects, missing tests, dependency risks, and security issues.
Enforces code review quality before commits and pushes across ALL projects. 6-gate sequence: diff size, PII/secrets, conventions, cross-project boundaries, completeness, self-review reminder. Only PII gate blocks; rest are advisory. Triggers on: "review before push", "pre-commit review", "quality gate", "/review-gate".
Run self-evaluation on recent work — questions quality, necessity, correctness, dependencies, completeness, learning, and honesty. Enabled by default in all flows. Standalone invocation for on-demand evaluation.
SEO composite — technical audit, content creation, keyword research, and rank monitoring with persistent domain memory. Replaces 9 fragmented SEO skills.
End-of-session self-training — captures session metrics, extracts patterns via metaclaw-learner, updates instincts, and generates optimization hypotheses for the next run.
Scaffold and wire a persistent SecondBrain (Obsidian vault + LLM wiki) for cross-session knowledge management. Creates PARA structure, wiki domains/entities/concepts, cross-project references, and RAG integration. Runs once per user, then the wiki compounds over time. Triggers on: "setup secondbrain", "create knowledge base", "setup wiki", "persistent memory", "second brain", "/setup-secondbrain".
Ship workflow — detect base branch, merge, run tests, review diff, bump VERSION, update CHANGELOG, commit, push, create PR.
Composite: self-eval -> review -> ship. Use when user says 'ship', 'deploy', 'push', 'merge', or 'create PR'. Ensures quality before shipping.
Test-driven development workflow that writes failing tests first, implements minimally, and refactors safely.
UX improvement pipeline — creates user stories from UI guidelines, maps user journeys, identifies friction, dispatches fix agents. The user-experience equivalent of /production-upgrade.
Local RAG and Graph RAG over the SecondBrain wiki vault. Progressive context loading (hot cache -> index -> domain -> entity). Graph traversal via wikilink resolution. Use when agents need cross-project context, when answering questions that span multiple domains, or when building context for planning tasks. Triggers on: "wiki context", "cross-project context", "what do we know about", "check the wiki", "graph context", "/wiki-rag".
Show how to use ProductionOS — explains commands, recommended workflows, best flows to run, and usage guidelines.
Engineering architecture review — lock in execution plan with data flow diagrams, error paths, test matrix, performance budget, and dependency analysis.
CEO/founder-mode plan review — rethink the problem, find the 10-star product, challenge premises. Four modes: SCOPE EXPANSION, SELECTIVE EXPANSION, HOLD SCOPE, SCOPE REDUCTION.
ProductionOS Mission Control — launch Claude DevTools, show session dashboard with eval convergence, agent dispatches, cost tracking, and hot file intelligence.
Systematic debugging with hypothesis tracking — reproduce, hypothesize, test, narrow, fix. Never guess-and-check.
Composite: deep research -> CEO review -> eng review. Use when user says 'research', 'plan', 'design', 'architect', or 'spec out'.
Report-only QA testing — produces structured report with health score, screenshots, and repro steps. No fixes applied.
Systematic debugging with hypothesis tracking — reproduce, hypothesize, test, narrow, fix. Never guess-and-check.
ProductionOS Mission Control — launch Claude DevTools, show session dashboard with eval convergence, agent dispatches, cost tracking, and hot file intelligence.
Idea-to-running-code lifecycle orchestration. 10-phase pipeline with 5 hard decision gates, wave-based parallelism, and STATE.json resumability. Composes /deep-research, /auto-swarm-nth, /production-upgrade, /security-audit, and /ship into a single end-to-end flow.
Retrospective workflow that summarizes what shipped, what broke, and what should improve next.
Resume a paused pipeline from .productionos/CHECKPOINT.json. Restores context and routes to the correct step.
ProductionOS — dual-target AI engineering operating system for repo-wide audits, upgrade plans, code reviews, strategic product reviews, security sweeps, UX audits, and recursive quality improvement.
Nth-iteration agent swarm — spawns parallel agent waves, evaluates strictly per wave, re-swarms gaps until 100% coverage and 10/10 quality. Can invoke any ProductionOS skill or command within waves.
Distributed agent swarm orchestrator — spawns parallel subagent clusters for any task with configurable depth, swarm size, and convergence criteria
8-phase autonomous research pipeline with multi-source discovery, 4-layer citation verification, hypothesis generation, and PIVOT/REFINE/PROCEED decision loops. Confidence-gated — loops until 95%+ confidence.
Full-stack frontend upgrade pipeline — fuses /production-upgrade iterative audit with /plan-ceo-review vision and /plan-eng-review rigor. Deploys parallel auto-swarm agents for iterative audit and execution. Enriched with /deep-research for competitive parity.
Nuclear-scale autonomous research — deploys 500-1000 agents in ONE massive simultaneous wave for exhaustive topic saturation. Deep-research methodology × auto-swarm scale = maximum parallel intelligence. WARNING: Extreme resource consumption.
Pre-landing code review — analyzes diff for SQL safety, LLM trust boundaries, conditional side effects, missing tests, dependency risks, and security issues.
Niche-agnostic agentic evaluator using CLEAR v2.0 framework — 6-domain assessment, 8 analysis dimensions, 6-tier source prioritization, evidence strength ratings, and decision trees. Evaluates any plan, codebase, or research output.
Ship workflow — detect base branch, merge, run tests, review diff, bump VERSION, update CHANGELOG, commit, push, create PR.
Distributed agent swarm orchestrator — spawns parallel subagent clusters for any task with configurable depth, swarm size, and convergence criteria
QA composite — browser testing, health scoring, and evaluation with test result memory. Replaces 5 QA skills.
n8n composite — workflow creation, import, debugging, and monitoring with workflow memory. Replaces 33 n8n skills.
7-domain security hardening audit — OWASP Top 10 2025, MITRE ATT&CK mapping, NIST CSF 2.0 alignment, secret detection, supply chain audit, container security, DevSecOps pipeline. Grounded in 734 cybersecurity skills.
Context engineering agent — researches context window optimization from arxiv, builds token-efficient context packages for downstream agents, manages cross-session persistence via MetaClaw.
Interactive code tutor — breaks down codebase logic, explains complexities, translates technical concepts for the user. Ideal after /btw commands. Teaches the WHY behind the code, not just the WHAT.
Update ProductionOS plugin to the latest version from GitHub
Retrospective workflow that summarizes what shipped, what broke, and what should improve next.
Self-improving agent optimization — generates challenger variants of any agent/command, benchmarks against baseline, promotes winners, logs learnings to instincts. Inspired by Karpathy's autoresearch pattern.
UX improvement pipeline — creates user stories from UI guidelines, maps user journeys, identifies friction, dispatches fix agents. The user-experience equivalent of /production-upgrade.
Autonomous recursive improvement loop for a single target. Runs gap analysis, recursive refinement, evaluation, and convergence checks until the target reaches quality threshold or converges.
Context engineering agent — researches context window optimization from arxiv, builds token-efficient context packages for downstream agents, manages cross-session persistence via MetaClaw.
Post-ship documentation update — reads all project docs, cross-references the diff, updates README/ARCHITECTURE/CONTRIBUTING/CLAUDE.md to match what shipped.
Nth-iteration omni-plan — recursive orchestration that chains ALL ProductionOS skills and agents, evaluates strictly per iteration, and loops until 10/10 is achieved. Each iteration can invoke any command or skill in the system.
Composite: audit -> upgrade -> research -> plan -> swarm fix -> eval -> ship. The complete ProductionOS pipeline. Use when user says 'do everything', 'full cycle', 'end to end', or 'make it production-ready'.
Composite: SEO -> content -> ads -> analytics audit for marketing and growth. Use when user mentions 'marketing', 'SEO', 'growth', 'ads', 'conversion', or 'traffic'.
Build composite — brainstorm, plan, implement, and test with project memory. Replaces 6 fragmented build skills.
Implementation planning workflow that turns approved ideas into dependency-aware execution plans.