skills/skill-architect/SKILL.md
Design, create, evaluate, and improve Claude Skills with structure matched to the task. Use when creating a new skill, auditing or refactoring an existing skill, deciding how a skill should be structured, choosing between Simple/Complex/Lightweight archetypes, adding evals or validation patterns, optimizing a skill description, or packaging a skill for reuse. Covers discovery interviews, archetype selection, structural patterns, evaluation loops, benchmarking, and trigger-tuning.
npx skillsauth add mgajewskik/opencode-config skill-architectInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Design skills with structure matched to task requirements, then iterate until they are actually good.
Keep the moat behind progressive disclosure. Do not load every reference file up front. Detect the user's stage, load only what that stage needs, and keep the core SKILL.md focused on routing, architecture, and high-leverage decisions.
Good Skill = Expert Knowledge - What Claude Already Knows
Only add what Claude does not already have: decision trees, trade-offs, edge cases, anti-patterns, trigger boundaries, and domain-specific workflows. Delete tutorials, basic explanations, and routine tooling instructions.
Move the user through the full skill lifecycle:
Before doing anything else, classify the request into one or more stages:
| Stage | Typical user intent | What you should optimize for | |------|----------------------|------------------------------| | Discover | "Help me create a skill", "What should this skill be?" | Scope, trigger boundaries, archetype | | Architect | "How should I structure it?" | Archetype, patterns, references, scripts | | Draft | "Write the skill" | Compact core instructions + on-demand moat | | Evaluate | "Test this skill", "Is it better?" | Realistic evals, baselines, assertions | | Iterate | "Improve based on feedback" | Generalization, de-overfitting, reusable tools | | Optimize Triggering | "Improve the description" | Trigger precision/recall | | Package | "Ship this skill" | Validation, portability, distributable output |
If the request spans multiple stages, work in that order unless the user explicitly wants a later stage only.
Use these loading triggers aggressively. Read the listed files before acting in that area.
MANDATORY: Read these files when creating a new skill, restructuring an existing one, or deciding what shape a skill should take:
references/discovery-questions.mdreferences/archetypes.mdreferences/patterns.mdreferences/frontmatter-spec.mdThese files contain the core moat for choosing the right architecture, not just writing instructions.
MANDATORY: Read these files when the user wants to test, compare, benchmark, or improve a skill based on evidence:
references/testing-methodology.mdreferences/skill-lifecycle.mdreferences/schemas.mdLoad agents/grader.md, agents/comparator.md, and agents/analyzer.md only when you are running graded evals or blind comparisons.
MANDATORY: Read this file when the user wants to improve triggering accuracy, reduce undertriggering, or compare description variants:
references/description-optimization.mdUse the scripts in scripts/ for repeatable trigger-evaluation work.
Read this file when tool availability matters or the user is on Claude.ai, OpenCode, Cowork, or a headless environment:
references/environment-modes.mdRead this file only when the skill orchestrates MCP tools or multi-service workflows:
references/mcp-integration.mdreferences/mcp-integration.md for non-MCP skills.skill-name/
├── SKILL.md # Required: routing + core instructions
├── references/ # Optional: loaded on-demand
├── scripts/ # Optional: deterministic operations
├── agents/ # Optional: specialist grading/comparison guidance
└── assets/ # Optional: templates or review artifacts
Three-layer loading:
Preferred organization:
SKILL.md = routing + architecture + stage detectionreferences/ = deep guidance, schemas, workflows, examplesscripts/ = deterministic utilities and reusable automationagents/ = specialist grading/comparison/analyzer instructionsassets/ = templates or review artifacts only when truly usefulSkills follow the open Agent Skills standard (https://agentskills.io/specification).
Required:
---
name: skill-name
description: What + when + trigger boundaries
---
Optional:
license: Apache-2.0
compatibility: Requires git, docker
metadata:
author: example-org
version: "1.0"
allowed-tools: Bash(git:*) Read
NOT in spec (avoid for portability):
version at root level -> use metadata.versiontools -> implementation-specificcategory, color, displayName -> UI-specificRead references/frontmatter-spec.md before authoring or refactoring frontmatter.
Use discovery questions to determine the archetype.
| Archetype | When to Use | Key Indicators | |-----------|-------------|----------------| | Simple | Transform existing content | Linear workflow, existing input, subjective quality | | Complex | Create from scratch via discovery | Multi-phase, no content, high variation | | Lightweight | Binary execution tasks | Linear, pass/fail quality, low variation |
Decision tree:
Multi-phase workflow? -> Complex
Binary quality + low variation? -> Lightweight
Existing content + transformation? -> Simple
Default -> Complex
Read references/archetypes.md for the full logic and justification templates.
| Trigger | Add Element | |---------|-------------| | Subjective quality | 8-criteria rubric with scoring | | High variation | Two-pass diagnostic/reconstruction | | Binary quality | Pass/fail checklist only | | Transformation task | Fact preservation rules | | Multi-phase | Question bank + approval gates | | Repeat use | Memory/configuration pattern | | High stakes | Enhanced failure modes + disclaimers | | 3+ components | Decomposition-first step |
Read references/patterns.md for implementation patterns.
| Extension | When to Use |
|-----------|-------------|
| references/testing-methodology.md | Validate triggers, function, consistency, and iteration signals |
| references/skill-lifecycle.md | Run the full draft/test/review/improve loop |
| references/description-optimization.md | Tune the description using trigger evals |
| references/environment-modes.md | Adapt the workflow to different tool/runtime environments |
| references/mcp-integration.md | Design skills that orchestrate MCP tools |
| references/schemas.md | Use shared artifact formats for evals and benchmarks |
Match specificity to task fragility:
| Task Type | Freedom Level | Format | |-----------|---------------|--------| | Creative/design | High | Text-based principles | | Code review, analysis | Medium | Pseudocode, parameterized | | File operations, fragile | Low | Exact scripts, few parameters |
Rule: High consequence of mistakes -> Low freedom.
Split into two skills when:
Keep as one skill when:
Pattern: "interview about X, then generate Y" often becomes Skill 1 = gatherer (Complex) and Skill 2 = generator (Simple).
Before asking new questions, inspect the current conversation for:
Only ask for information that is still missing.
Ask:
Use references/discovery-questions.md to determine:
For each example, identify:
If multiple eval runs or revisions would recreate the same helper repeatedly, promote it into scripts/.
uv run scripts/init_skill.py <skill-name> --path <output-directory>
Start with the smallest structure that still matches the archetype.
Then write SKILL.md so it:
If the skill will be reused, do not stop at a pretty draft.
Read references/testing-methodology.md, references/skill-lifecycle.md, and references/description-optimization.md when working in this phase.
uv run scripts/package_skill.py <path/to/skill-folder>
Useful utilities in scripts/:
scripts/init_skill.py, scripts/quick_validate.py, scripts/package_skill.py, scripts/generate_report.py, scripts/aggregate_benchmark.pyscripts/run_eval.py, scripts/improve_description.py, scripts/run_loop.pySKILL.md when a reference file would doWhen discovery answers contradict, resolve them before recommending structure:
| Conflict | Ask | |----------|-----| | Binary quality + high variation | "Does quality vary subjectively, or is it truly pass/fail?" | | Existing content + multi-phase | "Is this transformation or creation from scratch?" | | Ready-to-publish + high variation | "Is output consistently final, or does it need iteration?" |
The skill is ready when all of these are true:
If progress flattens out, say so explicitly and recommend the smallest next experiment.
documentation
Create senior-level deep research dossiers and roadmap companions. Use when the user asks for a dossier, senior research, deep research, in-depth research, mental models for a topic, senior perspective on a topic, how something actually works, ramp up on a topic, architectural deep dive, tradeoffs, failure modes, or what a senior would notice. Produces current-directory research-* and roadmap-* markdown artifacts, not a tutorial or short summary.
development
Senior-level Knative and OpenShift Serverless guidance for Serving, Eventing, Functions, autoscaling, scale-to-zero, CloudEvents, RabbitMQ/Kafka sources, Lambda migration, Harbor/OCI images, debugging, operations, and production rollout. Use when working with Knative Service, Revision, Route, KPA, activator, queue-proxy, Broker, Trigger, Source, Sink, kn func, OpenShift Serverless, Kourier, eventing-rabbitmq, Knative Kafka, or serverless workloads on Kubernetes/OpenShift.
development
Senior-level RHEL-family Linux operations. Use when running, debugging, hardening, patching, installing, upgrading, or operating Red Hat Enterprise Linux, Rocky Linux, AlmaLinux, CentOS Stream, Fedora-as-upstream, or related enterprise Linux hosts: systemd, RPM/DNF, SELinux, NetworkManager, firewalld, storage, kernel/kdump, FIPS/STIG, Satellite, IdM, Podman, bootc, air-gapped fleets.
development
Senior-level Proxmox VE guidance for VM creation, templates, storage, ZFS, Ceph, networking, clusters, HA, PBS backups, debugging, upgrades, security, and production/homelab operations. Use when working with Proxmox, PVE, Proxmox VE, qm, pct, pvesm, pvecm, pmxcfs, HA manager, Proxmox Backup Server, VM migration, Proxmox incidents, or Ceph/ZFS/Corosync/VLAN bridges in a Proxmox VE context.