ShaheerKhawaja — AI Agent Skills on SkillsAuth

audit-and-fix

Composite: security audit -> production upgrade -> self-evaluation. Use when user says 'audit', 'check the codebase', 'find and fix issues', or 'is this production-ready'.

development7

auto-swarm-nth

Nth-iteration agent swarm — spawns parallel agent waves, evaluates strictly per wave, re-swarms gaps until 100% coverage and 10/10 quality. Can invoke any ProductionOS skill or command within waves.

testing7

brainstorming

Idea exploration before building — understand the problem, propose approaches, present design, get approval. HARD-GATE: no implementation until design is approved.

development7

browse

Headless browser for QA testing, site inspection, and interaction verification. Navigate, screenshot, click, fill forms, capture snapshots.

tools7

build-productionos

ProductionOS smart router — single entry point that routes to the right pipeline based on intent. The ONLY command new users need to know.

development7

8-phase autonomous research pipeline with multi-source discovery, 4-layer citation verification, hypothesis generation, and PIVOT/REFINE/PROCEED decision loops. Confidence-gated — loops until 95%+ confidence.

testing7

designer-upgrade

Full UI/UX redesign pipeline — audits design, creates design systems, generates interactive HTML mockups, launches local browser for user interaction. Fuses /production-upgrade rigor with design agency methodology.

development7

frontend-upgrade

Full-stack frontend upgrade pipeline — fuses /production-upgrade iterative audit with /plan-ceo-review vision and /plan-eng-review rigor. Deploys parallel auto-swarm agents for iterative audit and execution. Enriched with /deep-research for competitive parity.

development7

interface-craft

Interface Craft by ProductionOS Design — a toolkit for building polished, animated interfaces in React. Includes Motion System (human-readable animation DSL with stage-driven sequencing), DialKit (live control panels for tuning animation values), and Design Evaluator (systematic UI review based on ProductionOS Design's methodology). Triggers on: animate, animation, transition, storyboard, entrance, motion, spring, easing, timing, finetune-control, sliders, controls, tune, tweak, critique, review, feedback, audit, improve, polish, refine, redesign.

tools7

learn-mode

Interactive code tutor — breaks down codebase logic, explains complexities, translates technical concepts for the user. Ideal after /btw commands. Teaches the WHY behind the code, not just the WHAT.

development7

logic-mode

Business idea -> production-ready plan pipeline. User provides an idea or business plan, agent researches market, competitors, existing solutions, challenges assumptions, identifies flaws, and builds a comprehensive execution plan with auto-document population.

development7

max-research

Nuclear-scale autonomous research — deploys 500-1000 agents in ONE massive simultaneous wave for exhaustive topic saturation. Deep-research methodology x auto-swarm scale = maximum parallel intelligence. WARNING: Extreme resource consumption.

devops7

omni-plan

ProductionOS flagship — 13-step orchestrative pipeline with tri-tiered evaluation, recursive convergence, CEO/Eng/Design review chain, CLEAR framework evaluation, multi-model judge tribunal, and autonomous PIVOT/REFINE/PROCEED decisions. Targets 100% production-ready output.

development7

plan-ceo-review

CEO/founder-mode plan review — rethink the problem, find the 10-star product, challenge premises. Four modes: SCOPE EXPANSION, SELECTIVE EXPANSION, HOLD SCOPE, SCOPE REDUCTION.

tools7

plan-eng-review

Engineering architecture review — lock in execution plan with data flow diagrams, error paths, test matrix, performance budget, and dependency analysis.

testing7

pos-ads

Ads composite — audit campaigns, create ad copy, optimize bids, and report performance with persistent campaign memory. Replaces 20 fragmented ads skills.

testing7

pos-content

Content composite — strategy, writing, audit, and refresh with brand voice memory. Replaces 10 fragmented content skills.

testing7

pos-debug

Debug composite — reproduce, hypothesize, test, fix with bug pattern memory. Replaces 4 debug skills.

development7

pos-frontend

Frontend composite — UI audit, design system upgrade, and UX analysis with design token memory. Replaces 10 frontend skills.

development7

pos-github

GitHub composite — PR management, issue triage, release automation, and workflow management with project memory. Replaces 8 GitHub skills.

tools7

pos-plan

Planning composite — CEO vision review, engineering architecture review, design review, and brainstorming with decision memory. Replaces 8 fragmented planning skills.

testing7

pos-research

Research composite — quick lookup, deep investigation, and exhaustive multi-source research with citation memory. Replaces 5 research skills.

testing7

pos-review

Code review composite — PR review, architecture review, and diff analysis with persistent review patterns. Replaces 7 fragmented review skills.

development7

pos-security

Security composite — OWASP audit, dependency scan, secret detection, and hardening with persistent vulnerability memory. Replaces 9 fragmented security skills.

testing7

pos-ship

Ship composite — PR creation, deployment, canary monitoring, and rollback with release memory. Replaces 4 ship skills.

devops7

productionos-agentic-eval

Niche-agnostic agentic evaluator using CLEAR v2.0 framework — 6-domain assessment, 8 analysis dimensions, 6-tier source prioritization, evidence strength ratings, and decision trees. Evaluates any plan, codebase, or research output.

development7

productionos-autoloop

Autonomous recursive improvement loop for a single target. Runs gap analysis, recursive refinement, evaluation, and convergence checks until the target reaches quality threshold or converges.

testing7

productionos-auto-mode

Idea-to-running-code lifecycle orchestration. 10-phase pipeline with 5 hard decision gates, wave-based parallelism, and STATE.json resumability. Composes /deep-research, /auto-swarm-nth, /production-upgrade, /security-audit, and /ship into a single end-to-end flow.

development7

productionos-auto-optimize

Self-improving agent optimization — generates challenger variants of any agent/command, benchmarks against baseline, promotes winners, logs learnings to instincts. Inspired by Karpathy's autoresearch pattern.

data-ai7

productionos-brainstorming

Idea exploration before building — understand the problem, propose approaches, present design, get approval. HARD-GATE: no implementation until design is approved.

development7

productionos-browse

Headless browser for QA testing, site inspection, and interaction verification. Navigate, screenshot, click, fill forms, capture snapshots.

tools7

productionos-build-productionos

ProductionOS smart router — single entry point that routes to the right pipeline based on intent. The ONLY command new users need to know.

development7

productionos-designer-upgrade

Full UI/UX redesign pipeline — audits design, creates design systems, generates interactive HTML mockups, launches local browser for user interaction. Fuses /production-upgrade rigor with design agency methodology.

development7

productionos-document-release

Post-ship documentation update — reads all project docs, cross-references the diff, updates README/ARCHITECTURE/CONTRIBUTING/CLAUDE.md to match what shipped.

documentation7

productionos-help

Show how to use ProductionOS — explains commands, recommended workflows, best flows to run, and usage guidelines.

documentation7

productionos-logic-mode

Business idea → production-ready plan pipeline. User provides an idea or business plan, agent researches market, competitors, existing solutions, challenges assumptions, identifies flaws, and builds a comprehensive execution plan with auto-document population.

development7

productionos-omni-plan

ProductionOS flagship — 13-step orchestrative pipeline with tri-tiered evaluation, recursive convergence, CEO/Eng/Design review chain, CLEAR framework evaluation, multi-model judge tribunal, and autonomous PIVOT/REFINE/PROCEED decisions. Targets 100% production-ready output.

development7

productionos-omni-plan-nth

Nth-iteration omni-plan — recursive orchestration that chains ALL ProductionOS skills and agents, evaluates strictly per iteration, and loops until 10/10 is achieved. Each iteration can invoke any command or skill in the system.

data-ai7

productionos-pause

Save current pipeline state for later resumption. Creates a checkpoint at .productionos/CHECKPOINT.json with all active context.

testing7

productionos

ProductionOS — dual-target AI engineering operating system for repo-wide audits, upgrade plans, code reviews, strategic product reviews, security sweeps, UX audits, and recursive quality improvement.

development7

productionos-productionos-pause

Save current pipeline state for later resumption. Creates a checkpoint at .productionos/CHECKPOINT.json with all active context.

testing7

productionos-productionos-stats

Display ProductionOS system statistics — agent count, command count, hook count, test count, version, instinct count, and session history.

testing7

productionos-production-upgrade

Run the full product upgrade pipeline — 55-agent iterative review with CEO/Engineering/UX/QA parallel loops

testing7

productionos-qa

Systematic QA testing with health scoring — tests web app, finds bugs, fixes them iteratively. Regression mode for re-testing known issues.

development7

productionos-qa-only

Report-only QA testing — produces structured report with health score, screenshots, and repro steps. No fixes applied.

testing7

productionos-refine

Review and refine flagged outputs, using critique and focused iteration to improve weak results.

tools7

productionos-resume

Resume a paused pipeline from .productionos/CHECKPOINT.json. Restores context and routes to the correct step.

testing7

productionos-security-audit

7-domain security hardening audit — OWASP Top 10 2025, MITRE ATT&CK mapping, NIST CSF 2.0 alignment, secret detection, supply chain audit, container security, DevSecOps pipeline. Grounded in 734 cybersecurity skills.

development7

productionos-self-eval

Run self-evaluation on recent work — questions quality, necessity, correctness, dependencies, completeness, learning, and honesty. Enabled by default in all flows. Standalone invocation for on-demand evaluation.

testing7

productionos-session-validate

End-of-session self-training — captures session metrics, extracts patterns via metaclaw-learner, updates instincts, and generates optimization hypotheses for the next run.

testing7

productionos-stats

Display ProductionOS system statistics — agent count, command count, hook count, test count, version, instinct count, and session history.

testing7

productionos-tdd

Test-driven development workflow that writes failing tests first, implements minimally, and refactors safely.

development7

productionos-update

Update ProductionOS plugin to the latest version from GitHub

tools7

productionos-writing-plans

Implementation planning workflow that turns approved ideas into dependency-aware execution plans.

tools7

production-upgrade

Run the full product upgrade pipeline — 55-agent iterative review with CEO/Engineering/UX/QA parallel loops

testing7

qa

Systematic QA testing with health scoring — tests web app, finds bugs, fixes them iteratively. Regression mode for re-testing known issues.

development7

refine

Review and refine flagged outputs, using critique and focused iteration to improve weak results.

tools7

review

Pre-landing code review — analyzes diff for SQL safety, LLM trust boundaries, conditional side effects, missing tests, dependency risks, and security issues.

development7

review-gate

Enforces code review quality before commits and pushes across ALL projects. 6-gate sequence: diff size, PII/secrets, conventions, cross-project boundaries, completeness, self-review reminder. Only PII gate blocks; rest are advisory. Triggers on: "review before push", "pre-commit review", "quality gate", "/review-gate".

development7

self-eval

Run self-evaluation on recent work — questions quality, necessity, correctness, dependencies, completeness, learning, and honesty. Enabled by default in all flows. Standalone invocation for on-demand evaluation.

testing7

seo

SEO composite — technical audit, content creation, keyword research, and rank monitoring with persistent domain memory. Replaces 9 fragmented SEO skills.

testing7

session-validate

End-of-session self-training — captures session metrics, extracts patterns via metaclaw-learner, updates instincts, and generates optimization hypotheses for the next run.

testing7

setup-secondbrain

Scaffold and wire a persistent SecondBrain (Obsidian vault + LLM wiki) for cross-session knowledge management. Creates PARA structure, wiki domains/entities/concepts, cross-project references, and RAG integration. Runs once per user, then the wiki compounds over time. Triggers on: "setup secondbrain", "create knowledge base", "setup wiki", "persistent memory", "second brain", "/setup-secondbrain".

documentation7

ship

Ship workflow — detect base branch, merge, run tests, review diff, bump VERSION, update CHANGELOG, commit, push, create PR.

testing7

ship-safe

Composite: self-eval -> review -> ship. Use when user says 'ship', 'deploy', 'push', 'merge', or 'create PR'. Ensures quality before shipping.

testing7

tdd

Test-driven development workflow that writes failing tests first, implements minimally, and refactors safely.

development7

ux-genie

UX improvement pipeline — creates user stories from UI guidelines, maps user journeys, identifies friction, dispatches fix agents. The user-experience equivalent of /production-upgrade.

devops7

wiki-rag

Local RAG and Graph RAG over the SecondBrain wiki vault. Progressive context loading (hot cache -> index -> domain -> entity). Graph traversal via wikilink resolution. Use when agents need cross-project context, when answering questions that span multiple domains, or when building context for planning tasks. Triggers on: "wiki context", "cross-project context", "what do we know about", "check the wiki", "graph context", "/wiki-rag".

development7

productionos-productionos-help

Show how to use ProductionOS — explains commands, recommended workflows, best flows to run, and usage guidelines.

documentation7

productionos-plan-eng-review

Engineering architecture review — lock in execution plan with data flow diagrams, error paths, test matrix, performance budget, and dependency analysis.

testing7

productionos-plan-ceo-review

CEO/founder-mode plan review — rethink the problem, find the 10-star product, challenge premises. Four modes: SCOPE EXPANSION, SELECTIVE EXPANSION, HOLD SCOPE, SCOPE REDUCTION.

tools7

productionos-devtools

ProductionOS Mission Control — launch Claude DevTools, show session dashboard with eval convergence, agent dispatches, cost tracking, and hot file intelligence.

tools7

productionos-debug

Systematic debugging with hypothesis tracking — reproduce, hypothesize, test, narrow, fix. Never guess-and-check.

development7

research-and-plan

Composite: deep research -> CEO review -> eng review. Use when user says 'research', 'plan', 'design', 'architect', or 'spec out'.

content-media7

qa-only

Report-only QA testing — produces structured report with health score, screenshots, and repro steps. No fixes applied.

testing7

debug

Systematic debugging with hypothesis tracking — reproduce, hypothesize, test, narrow, fix. Never guess-and-check.

development7

devtools

ProductionOS Mission Control — launch Claude DevTools, show session dashboard with eval convergence, agent dispatches, cost tracking, and hot file intelligence.

tools7

auto-mode

Idea-to-running-code lifecycle orchestration. 10-phase pipeline with 5 hard decision gates, wave-based parallelism, and STATE.json resumability. Composes /deep-research, /auto-swarm-nth, /production-upgrade, /security-audit, and /ship into a single end-to-end flow.

development7

productionos-retro

Retrospective workflow that summarizes what shipped, what broke, and what should improve next.

tools7

productionos-productionos-resume

Resume a paused pipeline from .productionos/CHECKPOINT.json. Restores context and routes to the correct step.

testing7

productionos

ProductionOS — dual-target AI engineering operating system for repo-wide audits, upgrade plans, code reviews, strategic product reviews, security sweeps, UX audits, and recursive quality improvement.

development7

productionos-auto-swarm-nth

Nth-iteration agent swarm — spawns parallel agent waves, evaluates strictly per wave, re-swarms gaps until 100% coverage and 10/10 quality. Can invoke any ProductionOS skill or command within waves.

testing7

productionos-auto-swarm

Distributed agent swarm orchestrator — spawns parallel subagent clusters for any task with configurable depth, swarm size, and convergence criteria

data-ai7

productionos-deep-research

8-phase autonomous research pipeline with multi-source discovery, 4-layer citation verification, hypothesis generation, and PIVOT/REFINE/PROCEED decision loops. Confidence-gated — loops until 95%+ confidence.

testing7

productionos-frontend-upgrade

Full-stack frontend upgrade pipeline — fuses /production-upgrade iterative audit with /plan-ceo-review vision and /plan-eng-review rigor. Deploys parallel auto-swarm agents for iterative audit and execution. Enriched with /deep-research for competitive parity.

development7