.cursor/skills/playbook-skill-creation/SKILL.md
Create domain-expertise playbook skills for the bilbo repo. Routes to the right architectural pattern (router, linear pipeline, routed playbook, configured playbook with overlays) based on domain complexity, then provides a procedural build guide for each layer. Use when the user wants to create a new business-analysis or strategy skill, or improve the structure of an existing one.
npx skillsauth add alexwox/genesis-template playbook-skill-creationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Build or improve domain-expertise skills that produce structured, actionable deliverables — not code utilities or simple lookup tools. This skill addresses content architecture: how to organize knowledge inside a skill so it's reusable across contexts, calibrated with benchmarks, and impossible to use in a purely declarative way.
For file mechanics (frontmatter, directory layout, description writing), defer to the create-skill skill. This skill starts where that one stops.
Slash command: /playbook-skill
Apply when the user asks things like:
Before choosing a pattern, gather:
| Input | Why It Matters | Default If Not Provided | |---|---|---| | Domain | Determines what knowledge to encode | — (must ask) | | Number of distinct contexts | Routes to pattern complexity | Infer from domain description | | Published benchmarks exist? | Determines whether to add a benchmark layer | Assume no; research if unclear | | Compliance or legal guardrails needed? | Determines whether to add a compliance layer | No | | Deliverable shape | Single artifact, multi-part spec, or routing decision | Infer from domain | | Existing assets to anchor on | Local files, books, frameworks to reference | None |
Route to exactly one architectural pattern. Each pattern has its own build guide, layer count, and target line range.
IF the skill's job is to select the right resource or sub-skill (not produce a deliverable):
→ Pattern 1: Router
ELIF the domain has one way to do the work regardless of context:
→ Pattern 2: Linear Pipeline
ELIF the domain has 2–5 distinct contexts that change the workflow:
→ Pattern 3: Routed Playbook
ELIF the domain spans 5+ contexts, has benchmarks, needs compliance, and produces multi-part deliverables:
→ Pattern 4: Configured Playbook with Overlays
Always default to the simplest pattern that covers the domain, because complexity
that isn't justified creates maintenance burden and cognitive load without
improving output quality.
What it is: Frame request → classify → point to right source → format answer. The skill IS the routing logic.
Existing example: consult-hormozi (273 lines) — classifies business issues into demand/supply/scaling buckets, routes to the right Hormozi asset, formats a source-anchored answer.
Target length: 100–300 lines.
Layers to build:
Build procedure:
What it is: Collect inputs → Step 1 → Step 2 → ... → Step N → output template. One path, no branching by business model. Variation is in inputs, not process.
Existing examples: lead-magnet-creation (145 lines), stakeholder-discovery (181 lines), deep-web-research (369 lines).
Target length: 100–400 lines.
Layers to build:
Build procedure:
What it is: Collect inputs → classify into one of N types → follow type-specific workflow → output template. Has a router, but branches are workflow variants, not deep parallel architectures.
Existing examples: brand-guide-creation (411 lines), high-cagr-market-discovery (280 lines), offer-pillar-discovery (195 lines).
Target length: 200–500 lines.
Layers to build:
Build procedure:
What it is: Collect inputs → select one of N configurations → optionally select one overlay → per-unit template → benchmarks by configuration → compliance → experiments → output templates. Governed by design principles with procedures and worked examples.
Existing examples: constructing-a-funnel (1088 lines), constructing-a-landing-page (1287 lines).
Target length: 500–1300 lines.
Layers to build (in order):
Build procedure:
Follow the build procedure for your selected pattern. Use the iteration protocol (Section 5) to sequence the work.
This is the single highest-leverage quality technique, because declarative rules produce generic output that a domain expert would immediately recognize as LLM-generated. A declarative rule tells the agent what to believe. A procedural rule tells it what to do. Skills that stay declarative produce generic, unactionable output.
For every principle or rule in the skill:
Step 1. Write the principle as a one-sentence declaration.
Example: "Each funnel stage should shift a specific belief."
Step 2. Turn it into a fill-in-the-blank sentence or table the agent must complete before proceeding.
Example: "Before writing this stage's spec, fill in: Entry belief: 'Right now, the prospect believes ___.' Exit belief: 'After this stage, the prospect believes ___.'"
Step 3. Write one worked example showing the blanks filled for a real scenario.
Example:
| Stage | Entry belief | Exit belief | |---|---|---| | Ad | "I've tried diets and they don't work long-term." | "There's a medical approach I haven't tried." | | Landing page | "Medical weight loss sounds expensive." | "This is simple, all-inclusive, real doctors." |
Step 4. Write one near-miss counter-example — a case that passes on most criteria but fails on exactly one, in a way that looks right at first glance.
Near-miss examples are dramatically more effective than obvious failures. Research on contrastive learning (Springer 2025, 40-study systematic review) shows each negative example can improve accuracy 10x more than each positive example during critical learning phases — but only when the negative is close to the decision boundary, not a random failure. Patrick Winston's foundational concept-learning work established that "near-misses" (cases that are almost right but wrong on one dimension) are the primary mechanism for learning category boundaries.
The near-miss counter-example procedure:
Good near-miss example (from the funnel skill): "A funnel that asks for email on the landing page (personal data), then shows a quiz with easy yes/no questions (lower commitment), then asks for payment. Everything else is correct — the offer is strong, the copy is clear, the checkout is frictionless. But the quiz de-escalates commitment after the email gate, breaking consistency momentum. Fix: move the email ask to after the quiz, when the prospect has more invested."
This works because: the funnel looks professional and well-designed; the failure is a sequencing error, not an obvious mistake; the consequence is named (broken consistency momentum); and the fix is a single reorder, not a rebuild.
Bad counter-example (too obvious): "A funnel with no landing page, no trust elements, and a broken checkout form." This teaches nothing — nobody would build this. It doesn't sharpen the boundary between good and almost-good.
Step 5. If you cannot write the worked example, the principle is too vague — sharpen it or cut it.
Red flags that a rule needs conversion:
Research is the difference between a skill that produces generic "fluff" and one that produces real, differentiated insight. Every layer of a skill benefits from targeted web research via deep-web-research. Research is not just for benchmarks — it's for discovering the ideas, frameworks, tactics, procedures, and real-world examples that make the skill worth using.
| Build stage | What to research | What it produces for the skill | |---|---|---| | Skeleton (Round 1) | Existing taxonomies and classification systems in the domain | Configuration branches for the router; categories you might have missed | | Data (Round 2) | Industry benchmarks, conversion rates, cost ranges, performance standards | Benchmark tables with named sources and confidence tags | | Components (Round 3) | Edge-case architectures, format variants, practitioner workflows | Overlays, sub-configurations, and variant-specific tactics | | Philosophy (Round 4) | Proven principles with empirical backing, real case studies, counter-examples | Procedural rules with worked examples grounded in evidence, not opinion | | Coherence (Round 5) | Updated numbers if any benchmarks are stale; competing frameworks that contradict your structure | Contradiction resolution; stronger confidence tags |
deep-web-research for any claim that needs to be decision-grade: benchmarks, conversion rates, compliance requirements, framework comparisons, and real case studies.create-skill conventions)Building a skill in one pass produces structure without depth. The funnel skill reached its current quality through six rounds. Codify the sequence:
Build the router + per-unit template + output template. Get the structure right. Don't add benchmarks or principles yet.
Research in this round: Search for existing taxonomies and classification systems in the domain. Are there established ways to categorize the contexts your router needs to handle? Use these to validate or expand your branches.
Exit test: Can you trace a sample input through the skill and see which configuration it hits, what template gets filled, and what the output shape looks like?
Research and add benchmarks, sources, and confidence tags. Fill in the per-unit template with real numbers.
Research in this round: This is the heaviest research round. Use deep-web-research to find industry benchmarks, performance standards, and cost ranges. Target 10–30 sources. Tag every number.
Exit test: Does every benchmark have a named source and year? Is every claim tagged Observed, Inferred, or Uncertain?
Add overlays, variants, edge cases, and compliance layers. Cover the configurations you missed in round 1.
Research in this round: Search for edge-case architectures and format variants that practitioners use but that your skill doesn't yet cover. The funnel skill's overlay system (VSL, webinar, quiz, tripwire, etc.) came entirely from researching what practitioner taxonomies named as distinct funnel types.
Exit test: Take the three most different possible inputs. Does the skill handle all three without falling through to a generic path?
Write design principles — but only after the structure exists. You need to see the skill's shape before you can write guardrails for it. Convert every principle from declarative to procedural (Section 4).
Research in this round: Find the empirical backing for each principle. Search for real case studies that demonstrate the principle in action (worked examples) and real failures that demonstrate what happens when it's violated (counter-examples). The funnel skill's Principle 7 (one bottleneck at a time) is powerful because it cites a real $270K→$540K case, not because the principle itself is novel.
Exit test: Does every principle have a fill-in-the-blank procedure AND a worked example grounded in a real (researched) case?
Read the skill top to bottom. Check:
Ask: "If I used this skill for [extreme case], would it produce a good result?" Try:
Skills degrade silently. 91% of ML models experience degradation over time, yet only 5% have mature monitoring (Agent Patterns 2025). Agent skills degrade through four mechanisms: API/URL changes, model updates, prompt drift, and world changes (new regulations, new tools, new benchmarks). A skill's output quality can drop significantly with no code changes and no visible errors.
Don't schedule calendar-based reviews uniformly — skills decay at different rates. Instead, use staleness triggers: specific signals that a skill needs review.
Review a skill when any of these are true:
git log --since="6 months ago" -- .cursor/skills/[skill-name]/ returns nothing and the domain is fast-movingDesigned to be completed in one session:
deep-web-research on the 3 most time-sensitive claims in the skill (benchmarks, compliance rules, tool references)curl -sI [url] | head -1 for any that might have gone stalelast-reviewed frontmatter field (see below)Skills in this repo are git-tracked markdown in .cursor/skills/. Use that to your advantage:
last-reviewed: YYYY-MM-DD to the YAML frontmatter of any skill built with this meta-skill. This gives both the agent and the user a quick way to see when the content was last verified. Update it only after completing the review procedure above — not on cosmetic edits.git log -1 --format="%ci" -- .cursor/skills/[skill-name]/SKILL.md shows the last modification date. If it's older than 6 months in a fast-moving domain, that's a trigger.Do not ship a skill unless all applicable gates pass:
create-skill requirements)Avoid:
create-skilldeep-web-research to source themconsult-hormozi for framework alignmentconstructing-a-landing-page or constructing-a-funnel as structural models| Skill | Pattern | Lines | Key structural feature |
|---|---|---|---|
| consult-hormozi | Router | 273 | Classification taxonomy + asset routing |
| lead-magnet-creation | Linear Pipeline | 145 | 4/4 meta gate + awareness-stage matching |
| stakeholder-discovery | Linear Pipeline | 181 | Interview playbook + outreach templates |
| deep-web-research | Linear Pipeline | 369 | Hypothesis-driven search + evidence scoring |
| offer-pillar-discovery | Routed Playbook | 195 | Phased discovery + quality filter |
| high-cagr-market-discovery | Routed Playbook | 280 | 4-question filter + macro/micro frameworks |
| brand-guide-creation | Routed Playbook | 411 | Job classification + external template file |
| constructing-a-funnel | Configured + Overlays | 1088 | 6 configs + 7 overlays + design principles |
| constructing-a-landing-page | Configured + Overlays | 1287 | Config router + per-section spec + benchmarks |
| constructing-onboarding | Configured + Overlays | 1462 | 7 configs + 7 touch-model overlays + screen build specs + flow state machine + email sequences |
tools
Translate role-based organizations into workflow-based organizations by decomposing roles into scored tasks, extracting dark playbooks (proprietary tacit knowledge), formalizing workflows, calculating automation ROI, and producing a sequenced automation roadmap. Use when a company wants to identify what work can be automated, extract undocumented expert knowledge, or build an automation strategy.
development
Review UI code for Web Interface Guidelines compliance. Use when asked to "review my UI", "check accessibility", "audit design", "review UX", or "check my site against best practices".
development
Builds stakeholder-friendly project status updates from markdown sources. Use when asked for progress reports, implementation status, future plans, UI/UX flow summaries, infrastructure/data-flow summaries, risks, code smells, or scout-principle improvement notes.
development
Repeatable playbook for finding and interviewing key stakeholders to validate an offer pillar hypothesis. Produces a pain proximity map, target list, outreach plan, interview protocol, and structured synthesis of findings. Use when a hypothesis needs human validation before building.