Implement Feature

Autonomous orchestrator that takes a feature from plan through to merge-ready PR. This is the primary workflow — it coordinates planning, implementation, verification, cleanup, and review.

Input

Accept either:

A plan file path (e.g., docs/plans/active/2026-03-07-user-auth.md)
A natural language feature description (e.g., "Add password reset flow with email verification")

If both are ambiguous, ask the human to clarify.

Setup

Before anything else, read harness.yaml at the project root. Parse and hold in context:

commands — every command you run comes from here
verification.checks — the pre-commit gate checks
verification.max_retries — how many times to retry before escalating (default 3)
review.max_review_cycles — how many review cycles before escalating (default 3)
review.provider — the review agent (e.g., coderabbit)
docs — paths for plans, decisions, architecture
enforcement — rules to enforce
gardening — post-feature checks

Also read AGENTS.md to understand project-specific agent rules.

If a memory/ directory exists at the project root, read its files to load cross-session learned patterns (common failures, fixes, user preferences). This is cheap context that prevents repeating past mistakes.

Determine the repo owner and name from git remote for use in gh api calls:

git remote get-url origin

Phase 1: Planning

If input is natural language (no plan file provided):

Invoke /create-plan with the feature description
Wait for the human to review and approve the plan
Do NOT proceed until the human explicitly approves

If input is a plan file:

Read the plan file
Extract the ordered steps from the ## Steps section
Confirm with the human: "I found N steps in the plan. Proceeding with implementation."

Create task list

Create a task list from the plan steps. Each step becomes a task. Also create completion tasks for post-implementation phases — these ensure cleanup, archival, and retrospective don't get skipped when the user sends ad-hoc requests late in the workflow.

Required tasks:

One task per implementation step (from the plan)
"Move plan to completed" (after all implementation tasks)
"Run retrospective" (after plan move)

The completion tasks act as visible reminders. Do NOT mark them complete until the actual work is done — even if the user asks for other things in between.

Create a feature branch

git checkout -b feat/<short-topic-name>

Use a descriptive branch name derived from the plan title.

Phase 2: Implementation Loop

For each step in the plan, in order:

Context isolation note: For plans with 7+ steps, consider using sub-agents for each phase (Planning, Implementation, Cleanup, PR). Each sub-agent gets a fresh context window, preventing degradation from accumulated conversation history. Pass only: plan file path, branch name, harness.yaml path, and memory/ path. Sub-agents exist primarily to isolate context, not to divide roles.

File-based handoff (avoid the telephone game): When using sub-agents, have them write results to files rather than relying on conversation relay. Sub-agents should write decisions to the plan's Decision Log, phase results to scratch/phase-N-results.md, and issues to scratch/issues-found.md. The orchestrator reads these files directly — this preserves full fidelity instead of lossy paraphrasing.

2.1 — Re-anchor context (plan re-reading)

Re-read the current step from the plan file. During long sessions, the original plan drifts toward the middle of the context window where recall is lowest (10-40% lower than edges). Explicitly re-reading places the step requirements at a recent, high-attention position. This costs ~100-200 tokens per step but prevents drift from plan intent.

2.2 — Read the step requirements

Read the step from the plan file. Understand:

Which files to create or modify (listed under Files:)
What tests are needed (listed under Tests:)
What the step does (listed under What to do:)
How to verify it (listed under Verify:)

If the step references patterns from docs/conventions.md or docs/architecture.md, read those sections now.

2.3 — Write or update tests first (TDD)

Write the test(s) for this step before writing the implementation

Run the tests to confirm they fail as expected (red phase):

# Use commands.test or commands.test_unit from harness.yaml

If a test framework or helper is missing, create it as part of this step

2.4 — Implement the code

Write the implementation to make the tests pass
Follow patterns from docs/conventions.md
Keep changes focused — one logical change per step

2.5 — Run the step's verify command

If the step has a Verify: section with a specific command, run it now to confirm the step works before committing.

2.6 — Stage changes and commit

Stage the files changed in this step (prefer specific file paths over git add .):

git add <specific-files>
git commit -m "<descriptive message for this step>"

2.7 — Pre-commit verification gate

The pre-commit gate fires automatically on commit. Here is what it does and how to handle it:

Read harness.yaml -> verification.checks
Run each check in order (lint, test, enforcement — whatever is configured)
Capture verbose output to scratch/ — write the full output of each check to scratch/last-verification.log. Keep only a compact summary in conversation context: pass/fail status, error count, and the first 3 error messages. This prevents test and lint output from bloating the context window.
If all checks pass: the commit succeeds. Move on.
If any check fails:
- Read the error summary (or re-read scratch/last-verification.log for details)
- Identify the root cause
- Fix the issue
- Stage the fix and retry the commit
- Track the retry count

After verification.max_retries consecutive failures (default 3): STOP IMMEDIATELY.

Do NOT keep trying

Print a clear error summary:

VERIFICATION FAILED after [N] retries.

Failing check: [check name]
Last error output:
[paste relevant error output]

What I tried:
[list each fix attempt]

I need your help to resolve this before continuing.

Wait for the human to help

2.8 — Update the plan file

After a successful commit, update the plan file to mark the step as done. Add a checkmark or [DONE] prefix to the step heading:

### Step 1: [DONE] Set up database models

2.9 — Context checkpoint (after step 4+)

Starting from step 4 onward (or after any retry cycle on the verification gate), write a structured state summary to scratch/session-state.md covering: current position, completed steps, files modified, decisions made, and next action. This serves as a cheap re-orientation point — re-reading this file (~200 tokens) is faster and more reliable than scanning the full conversation history. See the context management reference for the checkpoint format.

Phase 3: Cleanup

After ALL steps in the plan are complete:

3.1 — Decision probes (plan adherence check)

Before running cleanup, verify that the implementation actually matches the plan. Re-read the plan file and check:

Step coverage — Is every step marked [DONE]? If any are unmarked, they were missed.
File paths — Do the files listed in the plan's Files: sections actually exist? Run ls on each.
Test paths — Do the test files listed in the plan's Tests: sections exist and pass?
Conventions — If the plan's Context and Orientation section referenced specific conventions, spot-check that they were followed (e.g., if it said "follow the repository pattern from conventions.md", verify the new code uses that pattern).
Open questions resolved — Check the plan's Open Questions section. Were they all answered? If any were left unresolved, flag them now.

If any probe fails, fix the gap before continuing. If the gap requires architectural judgment, escalate to the human.

3.2 — Full codebase enforcement check

Run all verification checks from harness.yaml -> verification.checks against the full codebase (not just changed files). Fix any issues found.

3.3 — Deslop

Invoke /deslop to clean up generated code:

Remove unnecessary comments
Tighten verbose patterns
Clean up any slop introduced during implementation

3.4 — Garden

Invoke /garden for an entropy scan:

Doc freshness (do docs still match the code?)
Coverage gaps (any new code without tests?)
File sizes (anything exceeding enforcement.file_size_limit?)
Unused imports and exports

Address any findings before proceeding.

3.5 — Generate QA checklist

Invoke /create-qa to generate a manual QA checklist based on the changes made. This goes into the PR description or a linked QA doc.

3.6 — Observation masking cleanup

Before moving to Phase 4, ensure all verbose outputs from Phase 2 and 3 are in scratch/ files and not bloating the conversation context. If you are carrying large diffs, test outputs, or lint results in context, write them to scratch/ now and keep only summaries going forward.

Phase 4: PR and Review

4.1 — Create the PR

Invoke /create-pr to handle the standards gate (lint, test, enforcement) and PR creation. This skill will:

Run all verification checks (blocking gate — stops if any fail)
Push the branch
Create the PR with a properly formatted title and body

If /create-pr fails the standards gate, fix the issues and invoke it again.

4.2 — Wait for review

Wait 30 seconds for the review agent (e.g., CodeRabbit) to begin its review:

sleep 30

4.3 — Poll for review comments

Get the PR number from the gh pr create output, then poll:

gh api repos/{owner}/{repo}/pulls/{pr_number}/reviews
gh api repos/{owner}/{repo}/pulls/{pr_number}/comments

4.4 — Address review comments

For each review comment:

If actionable (requests a code change):

Read the comment and the referenced code
Make the fix
Commit with a message referencing the review (e.g., fix: address review — <what was fixed>)
Push

If informational (suggestion, praise, question, or nitpick you disagree with):

Reply inline via gh api:

gh api repos/{owner}/{repo}/pulls/{pr_number}/comments/{comment_id}/replies -f body="<response>"

If unclear (you do not understand the comment):

Do NOT guess. Reply asking for clarification, and escalate to the human.

4.5 — Re-review loop

After pushing fixes:

Wait 30 seconds for re-review
Poll for new comments
Address new comments
Repeat

Loop up to review.max_review_cycles (from harness.yaml, default 3).

4.6 — Review retrospective

After all review cycles are complete (whether approved or escalated), run /retrospective to capture learnings from the review feedback. Review comments are the highest-signal input for memory — reviewers explicitly state team standards that may not be documented anywhere. Focus on question 1f (review-specific) and 1c (documentation gaps revealed by reviewer comments).

4.7 — Resolution

If the review agent approves or there are no new actionable comments:

PR is ready for merge: <PR URL>

Summary of changes:
- [bullet points]

Review cycles completed: [N]
All review comments addressed.

Please review and merge when ready.

If max review cycles reached with unresolved comments:

REVIEW ESCALATION after [N] cycles.

Unresolved comments:
1. [file:line] — [summary of comment] — [why it's unresolved]
2. ...

I need your guidance on these before continuing.

PR URL: <url>

Phase 5: Completion

After the human merges the PR:

Move the plan file from docs/plans/active/ to docs/plans/completed/:

git mv docs/plans/active/<plan-file>.md docs/plans/completed/<plan-file>.md
git commit -m "chore: move completed plan to archive"
git push

Run /retrospective to capture learnings from this implementation session. This writes structured findings to memory/ and optionally proposes improvements to docs or skills. If you prefer a quick finish, at minimum write back to memory any recurring issues you fixed during implementation.

Report final summary:

Feature complete: <feature name>

Commits: [N]
Files changed: [N]
Tests added: [N]
Review cycles: [N]
Plan: docs/plans/completed/<plan-file>.md

Critical Rules

NEVER hardcode any command. Every shell command for lint, test, build, format, etc. MUST come from harness.yaml -> commands or verification.checks. The only exceptions are git and gh commands.
Read harness.yaml at the start and reference it throughout. If the file changes mid-workflow (unlikely but possible), re-read it.
When stuck: STOP and escalate. Do not guess, do not loop endlessly, do not make assumptions about what the human wants. Print what you know and ask for help.
Keep the plan file updated as a living document. Mark steps done, record decisions in the Decision Log section.
Each commit should be atomic — one logical change. Do not bundle unrelated changes. Do not make giant commits.
Stage specific files when committing. Prefer git add <file1> <file2> over git add . or git add -A.
Never merge PRs. That is the human's decision.
Never dismiss review comments. Address them or escalate.
Never skip verification. The pre-commit gate exists for a reason.
Test first, implement second. Write failing tests before writing implementation code.

Implement Feature

Autonomous orchestrator that takes a feature from plan through to merge-ready PR. This is the primary workflow — it coordinates planning, implementation, verification, cleanup, and review.

Input

Accept either:

A plan file path (e.g., docs/plans/active/2026-03-07-user-auth.md)
A natural language feature description (e.g., "Add password reset flow with email verification")

If both are ambiguous, ask the human to clarify.

Setup

Before anything else, read harness.yaml at the project root. Parse and hold in context:

commands — every command you run comes from here
verification.checks — the pre-commit gate checks
verification.max_retries — how many times to retry before escalating (default 3)
review.max_review_cycles — how many review cycles before escalating (default 3)
review.provider — the review agent (e.g., coderabbit)
docs — paths for plans, decisions, architecture
enforcement — rules to enforce
gardening — post-feature checks

Also read AGENTS.md to understand project-specific agent rules.

Determine the repo owner and name from git remote for use in gh api calls:

git remote get-url origin

Phase 1: Planning

If input is natural language (no plan file provided):

Invoke /create-plan with the feature description
Wait for the human to review and approve the plan
Do NOT proceed until the human explicitly approves

If input is a plan file:

Read the plan file
Extract the ordered steps from the ## Steps section
Confirm with the human: "I found N steps in the plan. Proceeding with implementation."

Create task list

Required tasks:

One task per implementation step (from the plan)
"Move plan to completed" (after all implementation tasks)
"Run retrospective" (after plan move)

The completion tasks act as visible reminders. Do NOT mark them complete until the actual work is done — even if the user asks for other things in between.

Create a feature branch

git checkout -b feat/<short-topic-name>

Use a descriptive branch name derived from the plan title.

Phase 2: Implementation Loop

For each step in the plan, in order:

2.1 — Re-anchor context (plan re-reading)

2.2 — Read the step requirements

Read the step from the plan file. Understand:

Which files to create or modify (listed under Files:)
What tests are needed (listed under Tests:)
What the step does (listed under What to do:)
How to verify it (listed under Verify:)

If the step references patterns from docs/conventions.md or docs/architecture.md, read those sections now.

2.3 — Write or update tests first (TDD)

Write the test(s) for this step before writing the implementation

Run the tests to confirm they fail as expected (red phase):

# Use commands.test or commands.test_unit from harness.yaml

If a test framework or helper is missing, create it as part of this step

2.4 — Implement the code

Write the implementation to make the tests pass
Follow patterns from docs/conventions.md
Keep changes focused — one logical change per step

2.5 — Run the step's verify command

If the step has a Verify: section with a specific command, run it now to confirm the step works before committing.

2.6 — Stage changes and commit

Stage the files changed in this step (prefer specific file paths over git add .):

git add <specific-files>
git commit -m "<descriptive message for this step>"

2.7 — Pre-commit verification gate

The pre-commit gate fires automatically on commit. Here is what it does and how to handle it:

Read harness.yaml -> verification.checks
Run each check in order (lint, test, enforcement — whatever is configured)
Capture verbose output to scratch/ — write the full output of each check to scratch/last-verification.log. Keep only a compact summary in conversation context: pass/fail status, error count, and the first 3 error messages. This prevents test and lint output from bloating the context window.
If all checks pass: the commit succeeds. Move on.
If any check fails:
- Read the error summary (or re-read scratch/last-verification.log for details)
- Identify the root cause
- Fix the issue
- Stage the fix and retry the commit
- Track the retry count

After verification.max_retries consecutive failures (default 3): STOP IMMEDIATELY.

Do NOT keep trying

Print a clear error summary:

VERIFICATION FAILED after [N] retries.

Failing check: [check name]
Last error output:
[paste relevant error output]

What I tried:
[list each fix attempt]

I need your help to resolve this before continuing.

Wait for the human to help

2.8 — Update the plan file

After a successful commit, update the plan file to mark the step as done. Add a checkmark or [DONE] prefix to the step heading:

### Step 1: [DONE] Set up database models

2.9 — Context checkpoint (after step 4+)

Phase 3: Cleanup

After ALL steps in the plan are complete:

3.1 — Decision probes (plan adherence check)

Before running cleanup, verify that the implementation actually matches the plan. Re-read the plan file and check:

Step coverage — Is every step marked [DONE]? If any are unmarked, they were missed.
File paths — Do the files listed in the plan's Files: sections actually exist? Run ls on each.
Test paths — Do the test files listed in the plan's Tests: sections exist and pass?
Conventions — If the plan's Context and Orientation section referenced specific conventions, spot-check that they were followed (e.g., if it said "follow the repository pattern from conventions.md", verify the new code uses that pattern).
Open questions resolved — Check the plan's Open Questions section. Were they all answered? If any were left unresolved, flag them now.

If any probe fails, fix the gap before continuing. If the gap requires architectural judgment, escalate to the human.

3.2 — Full codebase enforcement check

Run all verification checks from harness.yaml -> verification.checks against the full codebase (not just changed files). Fix any issues found.

3.3 — Deslop

Invoke /deslop to clean up generated code:

Remove unnecessary comments
Tighten verbose patterns
Clean up any slop introduced during implementation

3.4 — Garden

Invoke /garden for an entropy scan:

Doc freshness (do docs still match the code?)
Coverage gaps (any new code without tests?)
File sizes (anything exceeding enforcement.file_size_limit?)
Unused imports and exports

Address any findings before proceeding.

3.5 — Generate QA checklist

Invoke /create-qa to generate a manual QA checklist based on the changes made. This goes into the PR description or a linked QA doc.

3.6 — Observation masking cleanup

Phase 4: PR and Review

4.1 — Create the PR

Invoke /create-pr to handle the standards gate (lint, test, enforcement) and PR creation. This skill will:

Run all verification checks (blocking gate — stops if any fail)
Push the branch
Create the PR with a properly formatted title and body

If /create-pr fails the standards gate, fix the issues and invoke it again.

4.2 — Wait for review

Wait 30 seconds for the review agent (e.g., CodeRabbit) to begin its review:

sleep 30

4.3 — Poll for review comments

Get the PR number from the gh pr create output, then poll:

gh api repos/{owner}/{repo}/pulls/{pr_number}/reviews
gh api repos/{owner}/{repo}/pulls/{pr_number}/comments

4.4 — Address review comments

For each review comment:

If actionable (requests a code change):

Read the comment and the referenced code
Make the fix
Commit with a message referencing the review (e.g., fix: address review — <what was fixed>)
Push

If informational (suggestion, praise, question, or nitpick you disagree with):

Reply inline via gh api:

gh api repos/{owner}/{repo}/pulls/{pr_number}/comments/{comment_id}/replies -f body="<response>"

If unclear (you do not understand the comment):

Do NOT guess. Reply asking for clarification, and escalate to the human.

4.5 — Re-review loop

After pushing fixes:

Wait 30 seconds for re-review
Poll for new comments
Address new comments
Repeat

Loop up to review.max_review_cycles (from harness.yaml, default 3).

4.6 — Review retrospective

4.7 — Resolution

If the review agent approves or there are no new actionable comments:

PR is ready for merge: <PR URL>

Summary of changes:
- [bullet points]

Review cycles completed: [N]
All review comments addressed.

Please review and merge when ready.

If max review cycles reached with unresolved comments:

REVIEW ESCALATION after [N] cycles.

Unresolved comments:
1. [file:line] — [summary of comment] — [why it's unresolved]
2. ...

I need your guidance on these before continuing.

PR URL: <url>

Phase 5: Completion

After the human merges the PR:

Move the plan file from docs/plans/active/ to docs/plans/completed/:

git mv docs/plans/active/<plan-file>.md docs/plans/completed/<plan-file>.md
git commit -m "chore: move completed plan to archive"
git push

Run /retrospective to capture learnings from this implementation session. This writes structured findings to memory/ and optionally proposes improvements to docs or skills. If you prefer a quick finish, at minimum write back to memory any recurring issues you fixed during implementation.

Report final summary:

Feature complete: <feature name>

Commits: [N]
Files changed: [N]
Tests added: [N]
Review cycles: [N]
Plan: docs/plans/completed/<plan-file>.md

Critical Rules

NEVER hardcode any command. Every shell command for lint, test, build, format, etc. MUST come from harness.yaml -> commands or verification.checks. The only exceptions are git and gh commands.
Read harness.yaml at the start and reference it throughout. If the file changes mid-workflow (unlikely but possible), re-read it.
When stuck: STOP and escalate. Do not guess, do not loop endlessly, do not make assumptions about what the human wants. Print what you know and ask for help.
Keep the plan file updated as a living document. Mark steps done, record decisions in the Decision Log section.
Each commit should be atomic — one logical change. Do not bundle unrelated changes. Do not make giant commits.
Stage specific files when committing. Prefer git add <file1> <file2> over git add . or git add -A.
Never merge PRs. That is the human's decision.
Never dismiss review comments. Address them or escalate.
Never skip verification. The pre-commit gate exists for a reason.
Test first, implement second. Write failing tests before writing implementation code.

Adoption

alchemishty/implement-feature

$ install --global

Security Scan Results

SKILL.md

Implement Feature

Input

Setup

Phase 1: Planning

If input is natural language (no plan file provided):

If input is a plan file:

Create task list

Create a feature branch

Phase 2: Implementation Loop

2.1 — Re-anchor context (plan re-reading)

2.2 — Read the step requirements

2.3 — Write or update tests first (TDD)

2.4 — Implement the code

2.5 — Run the step's verify command

2.6 — Stage changes and commit

2.7 — Pre-commit verification gate

2.8 — Update the plan file

2.9 — Context checkpoint (after step 4+)

Phase 3: Cleanup

3.1 — Decision probes (plan adherence check)

3.2 — Full codebase enforcement check

3.3 — Deslop

3.4 — Garden

3.5 — Generate QA checklist

3.6 — Observation masking cleanup

Phase 4: PR and Review

4.1 — Create the PR

4.2 — Wait for review

4.3 — Poll for review comments

4.4 — Address review comments

4.5 — Re-review loop

4.6 — Review retrospective

4.7 — Resolution

Phase 5: Completion

Critical Rules

Related Skills

alchemishty/retrospective

alchemishty/project-structure-validator

alchemishty/migrate-harness

alchemishty/install-harness

alchemishty/implement-feature

$ install --global

Security Scan Results

SKILL.md

Implement Feature

Input

Setup

Phase 1: Planning

If input is natural language (no plan file provided):

If input is a plan file:

Create task list

Create a feature branch

Phase 2: Implementation Loop

2.1 — Re-anchor context (plan re-reading)

2.2 — Read the step requirements

2.3 — Write or update tests first (TDD)

2.4 — Implement the code

2.5 — Run the step's verify command

2.6 — Stage changes and commit

2.7 — Pre-commit verification gate

2.8 — Update the plan file

2.9 — Context checkpoint (after step 4+)

Phase 3: Cleanup

3.1 — Decision probes (plan adherence check)

3.2 — Full codebase enforcement check

3.3 — Deslop

3.4 — Garden

3.5 — Generate QA checklist

3.6 — Observation masking cleanup

Phase 4: PR and Review

4.1 — Create the PR

4.2 — Wait for review

4.3 — Poll for review comments

4.4 — Address review comments

4.5 — Re-review loop