Agent Plugin Development

Build Claude Code plugins the modern way: skills as the primary orchestration layer, agents only for high-agency specialists, reference material aggressively externalized, and commands reduced to structural compatibility shims. This is the post-restructure model proven in devex-kit and the authstack consolidation.

This skill covers the full lifecycle of creating or refactoring effective plugins:

| Phase | What you do | Key question | |-------|-------------|-------------| | Model | Classify work as skill orchestration, agent specialist, or pure reference data | Does this belong in a lean SKILL.md, a spawnable agent, or a references/ file? | | Craft | Write lean SKILL.md using devex-kit patterns + extract to references/ | Does the main file stay focused and delegate via load calls? | | Specialize | Design and implement focused agents with rich example blocks | Is this truly narrow, high-agency, and self-contained enough to spawn independently? | | Structure | Lay out the plugin with skills/ primary and intentionally empty commands/agents/ where appropriate | Does the tree signal the 5 principles at a glance? | | Package | Add README, plugin.json, tile.json (if devex), examples, validation | Can another developer (or agent) land here and immediately understand the architecture? |

State the phase you are in, or describe the plugin or skill you are building — the skill routes accordingly.

The 5 Restructure Principles

These five points define current best practice. Every decision in this skill and every plugin you build with it must honor them.

For the full expanded explanation with authstack before/after, migration tactics, and anti-patterns, load references/restructure-best-practices.md.

Commands de-emphasized (exist structurally but empty/unused in restructured plugins).
Skills are the primary orchestration layer (lean SKILL.md + heavy refs in references/ or siblings).
Agents remain for high-agency specialists (spawnable focused workers, reasonably self-contained).
Reference material aggressively externalized (references/, docs/, rules/, AUDIT-CHECKLIST.md etc.).
scalekit-code-doctor is the model (short SKILL.md + orchestration, massive data in references/).

Internalize these before you write a single line of a new SKILL.md or agent.

The official Claude Code docs also provide a clear decision table for plugins vs standalone .claude/ configuration. Load the details (including when to start in .claude/ and when to convert) from references/plugin-structure.md.

Plugin Structure (Modern Skills-First)

Follow the layout used by restructured authstack plugins and the clean docs-engineering example.

my-plugin/
├── .claude-plugin/
│   └── plugin.json
├── skills/                 # The only place new logic belongs
│   └── core-skill/
│       ├── SKILL.md        # Lean, imperative, load references
│       ├── references/     # Or sibling .md files (AUDIT-CHECKLIST.md, *-reference.md)
│       └── examples/
├── agents/                 # 0–6 or intentionally empty
├── commands/               # Present but empty or .deleted shims only
├── docs/                   # Canonical durable content (link from skills with ../../)
├── references/             # Plugin-wide shared refs (optional)
├── rules/                  # Cross-cutting (optional)
├── README.md
└── tile.json               # When shipping via devex-kit

For the complete tree, plugin.json example, legacy-vs-restructured comparison table, rules about empty dirs, official plugin vs standalone guidance, local development with --plugin-dir, claude plugin init, complex components (LSP/monitors/settings), and migration steps, load references/plugin-structure.md.

Empty commands/ and agents/ directories are correct and intentional. They satisfy Claude Code layout expectations while signaling that logic lives in skills/.

Skill Development

Skills are the entry point users invoke (/agent-plugin-development, /sdk-craft, etc.). They must be lean, imperative, and delegate.

Follow these devex-kit conventions exactly:

Extended frontmatter with license: MIT and metadata block.
Description written in third person listing concrete trigger phrases.
Opening phase/mode table.
Imperative voice throughout ("State the phase. Load the reference. Produce the table.").
> For expanded ... load references/xxx.md`` blockquotes for every heavy section.
Phase gates before transitions.
Quality checklist with actionable checkboxes.
"Did this help?" feedback section with the github issues link.
"When to switch skills" guidance.

For the exact frontmatter template, description best practices, full body organization, subagent handoff rules, common mistakes, and a ready-to-copy skeleton, load references/skill-development.md.

The target length for the SKILL.md body itself is 1,500–2,000 words. Everything beyond that lives in references/.

Agent Development

Agents are for narrow, high-agency, spawnable specialists. They are not the default.

Use the pr-review-toolkit pattern as the canonical illustration: a skill (or thin shim) classifies the request, then spawns only the relevant subset of agents (code-reviewer, silent-failure-hunter, etc.) via the Task tool. The old approach of one giant command is deprecated.

Agent files live under agents/ (preferred) or inside a skill. Each must have:

Rich description containing 2–4 concrete <example> blocks (Context / user / assistant / commentary).
Appropriate model (sonnet or opus).
Minimal tools list.
Distinct color.

For the complete agent frontmatter shape, when-to-use criteria, orchestration-from-skill guidance, model/tool advice, color assignments, and checklist, load references/agent-development.md.

Many excellent plugins have zero agents (see docs-engineering). Add agents only when the work genuinely benefits from isolation and focused context.

Common Workflows

Starting a brand new plugin from scratch

Create the directory skeleton with skills/, empty commands/, empty agents/, docs/, references/, rules/ as appropriate.
Write the first lean SKILL.md using the skeleton from references/skill-development.md. Frontmatter first.
Extract the first large decision table or checklist into a sibling or references/ file and replace with a load call.
Populate a minimal README that names the 5 principles and points to the skill.
Add tile.json if this skill will live in devex-kit.
Test immediately using official tooling: run claude --plugin-dir ./your-plugin (or point at a zip), then invoke the namespaced skill (/your-plugin:your-skill-name). Use /reload-plugins after edits. See the full local testing workflow and --plugin-url / multi-plugin options in references/examples-and-packaging.md.

Refactoring an existing command-heavy plugin (pr-review-toolkit style)

Identify the command that contains real workflow logic.
Rename it to commands/review-foo.deleted.md (keep for history and compat).
Create skills/review-foo/SKILL.md that performs the classification and orchestration.
Promote each distinct concern inside the old command into its own agent under agents/.
Give every agent a rich description with 3+ <example> blocks drawn from the scenarios the old command handled.
Update README and any marketplace entries. Remove duplication.
Verify: the user can still say the old command name (via shim if needed) but the implementation now follows the 5 principles.

Adding a specialist agent to an existing skill

Write the agent .md first with narrow charter and examples.
In the skill, add a decision block: "If the task matches X, spawn the Y-agent using Task with focused diff or file list."
Tell the user explicitly what you are spawning and why.
After the agent returns, integrate its findings into the skill's final output (dedupe, prioritize, format consistently).
Document the handoff contract in the skill's "When to spawn agents" subsection.

Externalizing content from a bloated SKILL.md (scalekit-code-doctor style)

Identify the largest sections (method tables, 50-item checklists, full code samples for 6 languages).
Create references/REFERENCE.md or references/COMMON-MISTAKES.md (or equivalent) and move the data.
At the top of the skill body, after frontmatter and intro, insert the mandatory "Before doing anything else, load the references..." instruction.
Replace the inline content with a short summary + load blockquote.
The skill now stays small; the references can be consumed by other tools.

For more migration and externalization patterns drawn from the actual authstack restructure, load references/restructure-best-practices.md.

Packaging and Distribution

A plugin is not complete until it is self-documenting for both humans and future agents.

Required artifacts:

README.md at plugin root that explains the canonical content layer (docs/, skills/, rules/) vs the Claude adapters (commands/, agents/, hooks/), states the 5 principles, and gives real invocation examples.
tile.json (exact shape used by every skill in this devex-kit) when the skill is part of devex-kit.
At least one skill with examples/ demonstrating good usage.
Cross-links that use the load-references pattern or relative paths into docs/.

For the two primary study examples (pr-review-toolkit as agents+skill instead of command; docs-engineering as skills+refs+examples), the full authstack restructure trees, the packaging checklist, official quickstart, detailed local testing with --plugin-dir / /reload-plugins / --plugin-url, and migration from standalone .claude/, load references/examples-and-packaging.md.

Phase Gates

Model → Craft: Have you classified every piece of functionality using the 5 principles? Is the primary deliverable a lean skill rather than a command or a monolithic agent?

Craft → Specialize: SKILL.md written and already delegating via load references calls? Heavy data extracted before the main file grew too large?

Specialize → Structure: Any agents have rich descriptions with concrete <example> blocks? Are they narrow enough that a one-sentence handoff produces reliable output? Commands/ and agents/ dirs intentionally minimal?

Structure → Package: Plugin root has README explaining the layers and the 5 principles? examples/ and references/ populated? Empty dirs documented as intentional? Tile.json present if this is a devex-kit skill?

Quality Checklist

Before declaring the plugin or skill complete:

[ ] Commands/ dir exists and is empty or contains only .deleted shims (no logic)
[ ] Agents/ dir exists and is empty or contains only 0–6 focused specialists with rich example blocks in their descriptions
[ ] Every SKILL.md uses the exact devex-kit frontmatter (license + metadata block)
[ ] SKILL.md body is imperative, uses tables for decisions, contains phase gates + quality checklist, ends with "Did this help?", and delegates via > For expanded ... load references/...``
[ ] All long tables, exhaustive checklists, framework samples, anti-patterns, and reference data live in references/ or sibling .md files (scalekit-code-doctor pattern)
[ ] At least one skill demonstrates the examples/ sibling pattern
[ ] Plugin README documents the 5 principles and canonical vs adapter layers
[ ] Cross-references to pr-review-toolkit (agents + orchestration) and docs-engineering (skills + refs + examples) appear in the content
[ ] The plugin was tested by actually invoking the skill and any agents inside real Claude Code using the official local development workflow (claude --plugin-dir ./plugin, /reload-plugins, namespaced /plugin:skill invocations, and optionally --plugin-url or multiple --plugin-dir flags). Run claude plugin validate before distribution.
[ ] No duplication of concerns already covered by existing devex-kit skills (sdk-craft, mcp-server-craft, docs-writing-style, etc.)

Did this help?

At the end of every session, ask: "Did this solve what you were trying to do?"

If yes: done.
If the structure advice was incomplete, a reference was missing or stale, an example did not match current best practice, or the output led to a plugin that felt bloated: encourage the user to file an issue at https://github.com/saif-shines/devex-kit/issues. Offer to help draft it using their agent — include: what they were building, which phase they were in, what this skill produced, and what was missing or incorrect.

Adapt and Evolve

This skill itself follows the model it teaches. The authoritative patterns live in the references/ directory and in the live restructured plugins (authstack, skillkit examples). When official Claude Code plugin guidance (https://code.claude.com/docs/en/plugins, full index at https://code.claude.com/docs/llms.txt) or devex-kit conventions change, update the references first, then the orchestration in this SKILL.md.

When you create a new plugin using this skill, you are also contributing to the living catalog of good examples. Add it to the references when it demonstrates a new successful variation.

State the phase or the concrete task. Load the relevant reference. Build correctly.

Agent Plugin Development

This skill covers the full lifecycle of creating or refactoring effective plugins:

State the phase you are in, or describe the plugin or skill you are building — the skill routes accordingly.

The 5 Restructure Principles

These five points define current best practice. Every decision in this skill and every plugin you build with it must honor them.

For the full expanded explanation with authstack before/after, migration tactics, and anti-patterns, load references/restructure-best-practices.md.

Commands de-emphasized (exist structurally but empty/unused in restructured plugins).
Skills are the primary orchestration layer (lean SKILL.md + heavy refs in references/ or siblings).
Agents remain for high-agency specialists (spawnable focused workers, reasonably self-contained).
Reference material aggressively externalized (references/, docs/, rules/, AUDIT-CHECKLIST.md etc.).
scalekit-code-doctor is the model (short SKILL.md + orchestration, massive data in references/).

Internalize these before you write a single line of a new SKILL.md or agent.

Plugin Structure (Modern Skills-First)

Follow the layout used by restructured authstack plugins and the clean docs-engineering example.

my-plugin/
├── .claude-plugin/
│   └── plugin.json
├── skills/                 # The only place new logic belongs
│   └── core-skill/
│       ├── SKILL.md        # Lean, imperative, load references
│       ├── references/     # Or sibling .md files (AUDIT-CHECKLIST.md, *-reference.md)
│       └── examples/
├── agents/                 # 0–6 or intentionally empty
├── commands/               # Present but empty or .deleted shims only
├── docs/                   # Canonical durable content (link from skills with ../../)
├── references/             # Plugin-wide shared refs (optional)
├── rules/                  # Cross-cutting (optional)
├── README.md
└── tile.json               # When shipping via devex-kit

For the complete tree, plugin.json example, legacy-vs-restructured comparison table, rules about empty dirs, official plugin vs standalone guidance, local development with --plugin-dir, claude plugin init, complex components (LSP/monitors/settings), and migration steps, load references/plugin-structure.md.

Empty commands/ and agents/ directories are correct and intentional. They satisfy Claude Code layout expectations while signaling that logic lives in skills/.

Skill Development

Skills are the entry point users invoke (/agent-plugin-development, /sdk-craft, etc.). They must be lean, imperative, and delegate.

Follow these devex-kit conventions exactly:

Extended frontmatter with license: MIT and metadata block.
Description written in third person listing concrete trigger phrases.
Opening phase/mode table.
Imperative voice throughout ("State the phase. Load the reference. Produce the table.").
> For expanded ... load references/xxx.md`` blockquotes for every heavy section.
Phase gates before transitions.
Quality checklist with actionable checkboxes.
"Did this help?" feedback section with the github issues link.
"When to switch skills" guidance.

For the exact frontmatter template, description best practices, full body organization, subagent handoff rules, common mistakes, and a ready-to-copy skeleton, load references/skill-development.md.

The target length for the SKILL.md body itself is 1,500–2,000 words. Everything beyond that lives in references/.

Agent Development

Agents are for narrow, high-agency, spawnable specialists. They are not the default.

Agent files live under agents/ (preferred) or inside a skill. Each must have:

Rich description containing 2–4 concrete <example> blocks (Context / user / assistant / commentary).
Appropriate model (sonnet or opus).
Minimal tools list.
Distinct color.

For the complete agent frontmatter shape, when-to-use criteria, orchestration-from-skill guidance, model/tool advice, color assignments, and checklist, load references/agent-development.md.

Many excellent plugins have zero agents (see docs-engineering). Add agents only when the work genuinely benefits from isolation and focused context.

Common Workflows

Starting a brand new plugin from scratch

Create the directory skeleton with skills/, empty commands/, empty agents/, docs/, references/, rules/ as appropriate.
Write the first lean SKILL.md using the skeleton from references/skill-development.md. Frontmatter first.
Extract the first large decision table or checklist into a sibling or references/ file and replace with a load call.
Populate a minimal README that names the 5 principles and points to the skill.
Add tile.json if this skill will live in devex-kit.
Test immediately using official tooling: run claude --plugin-dir ./your-plugin (or point at a zip), then invoke the namespaced skill (/your-plugin:your-skill-name). Use /reload-plugins after edits. See the full local testing workflow and --plugin-url / multi-plugin options in references/examples-and-packaging.md.

Refactoring an existing command-heavy plugin (pr-review-toolkit style)

Identify the command that contains real workflow logic.
Rename it to commands/review-foo.deleted.md (keep for history and compat).
Create skills/review-foo/SKILL.md that performs the classification and orchestration.
Promote each distinct concern inside the old command into its own agent under agents/.
Give every agent a rich description with 3+ <example> blocks drawn from the scenarios the old command handled.
Update README and any marketplace entries. Remove duplication.
Verify: the user can still say the old command name (via shim if needed) but the implementation now follows the 5 principles.

Adding a specialist agent to an existing skill

Write the agent .md first with narrow charter and examples.
In the skill, add a decision block: "If the task matches X, spawn the Y-agent using Task with focused diff or file list."
Tell the user explicitly what you are spawning and why.
After the agent returns, integrate its findings into the skill's final output (dedupe, prioritize, format consistently).
Document the handoff contract in the skill's "When to spawn agents" subsection.

Externalizing content from a bloated SKILL.md (scalekit-code-doctor style)

Identify the largest sections (method tables, 50-item checklists, full code samples for 6 languages).
Create references/REFERENCE.md or references/COMMON-MISTAKES.md (or equivalent) and move the data.
At the top of the skill body, after frontmatter and intro, insert the mandatory "Before doing anything else, load the references..." instruction.
Replace the inline content with a short summary + load blockquote.
The skill now stays small; the references can be consumed by other tools.

For more migration and externalization patterns drawn from the actual authstack restructure, load references/restructure-best-practices.md.

Packaging and Distribution

A plugin is not complete until it is self-documenting for both humans and future agents.

Required artifacts:

README.md at plugin root that explains the canonical content layer (docs/, skills/, rules/) vs the Claude adapters (commands/, agents/, hooks/), states the 5 principles, and gives real invocation examples.
tile.json (exact shape used by every skill in this devex-kit) when the skill is part of devex-kit.
At least one skill with examples/ demonstrating good usage.
Cross-links that use the load-references pattern or relative paths into docs/.

For the two primary study examples (pr-review-toolkit as agents+skill instead of command; docs-engineering as skills+refs+examples), the full authstack restructure trees, the packaging checklist, official quickstart, detailed local testing with --plugin-dir / /reload-plugins / --plugin-url, and migration from standalone .claude/, load references/examples-and-packaging.md.

Phase Gates

Model → Craft: Have you classified every piece of functionality using the 5 principles? Is the primary deliverable a lean skill rather than a command or a monolithic agent?

Craft → Specialize: SKILL.md written and already delegating via load references calls? Heavy data extracted before the main file grew too large?

Quality Checklist

Before declaring the plugin or skill complete:

[ ] Commands/ dir exists and is empty or contains only .deleted shims (no logic)
[ ] Agents/ dir exists and is empty or contains only 0–6 focused specialists with rich example blocks in their descriptions
[ ] Every SKILL.md uses the exact devex-kit frontmatter (license + metadata block)
[ ] SKILL.md body is imperative, uses tables for decisions, contains phase gates + quality checklist, ends with "Did this help?", and delegates via > For expanded ... load references/...``
[ ] All long tables, exhaustive checklists, framework samples, anti-patterns, and reference data live in references/ or sibling .md files (scalekit-code-doctor pattern)
[ ] At least one skill demonstrates the examples/ sibling pattern
[ ] Plugin README documents the 5 principles and canonical vs adapter layers
[ ] Cross-references to pr-review-toolkit (agents + orchestration) and docs-engineering (skills + refs + examples) appear in the content
[ ] The plugin was tested by actually invoking the skill and any agents inside real Claude Code using the official local development workflow (claude --plugin-dir ./plugin, /reload-plugins, namespaced /plugin:skill invocations, and optionally --plugin-url or multiple --plugin-dir flags). Run claude plugin validate before distribution.
[ ] No duplication of concerns already covered by existing devex-kit skills (sdk-craft, mcp-server-craft, docs-writing-style, etc.)

Did this help?

At the end of every session, ask: "Did this solve what you were trying to do?"

If yes: done.
If the structure advice was incomplete, a reference was missing or stale, an example did not match current best practice, or the output led to a plugin that felt bloated: encourage the user to file an issue at https://github.com/saif-shines/devex-kit/issues. Offer to help draft it using their agent — include: what they were building, which phase they were in, what this skill produced, and what was missing or incorrect.

Adapt and Evolve

When you create a new plugin using this skill, you are also contributing to the living catalog of good examples. Add it to the references when it demonstrates a new successful variation.

State the phase or the concrete task. Load the relevant reference. Build correctly.

Adoption

saif-shines/agent-plugin-development

$ install --global

Security Scan Results

SKILL.md

Agent Plugin Development

The 5 Restructure Principles

Plugin Structure (Modern Skills-First)

Skill Development

Agent Development

Common Workflows

Starting a brand new plugin from scratch

Refactoring an existing command-heavy plugin (pr-review-toolkit style)

Adding a specialist agent to an existing skill

Externalizing content from a bloated SKILL.md (scalekit-code-doctor style)

Packaging and Distribution

Phase Gates

Quality Checklist

Did this help?

Adapt and Evolve

Related Skills

saif-shines/using-devex-kit

saif-shines/sdk-craft

saif-shines/mcp-server-craft

saif-shines/devrel-tooling

saif-shines/agent-plugin-development

$ install --global

Security Scan Results

SKILL.md

Agent Plugin Development

The 5 Restructure Principles

Plugin Structure (Modern Skills-First)

Skill Development

Agent Development

Common Workflows

Starting a brand new plugin from scratch

Refactoring an existing command-heavy plugin (pr-review-toolkit style)

Adding a specialist agent to an existing skill

Externalizing content from a bloated SKILL.md (scalekit-code-doctor style)

Packaging and Distribution

Phase Gates

Quality Checklist

Did this help?

Adapt and Evolve

Related Skills

saif-shines/using-devex-kit

saif-shines/sdk-craft

saif-shines/mcp-server-craft

saif-shines/devrel-tooling