Your task: execute ONE task end-to-end, producing output that meets all acceptance criteria. The task is either a new story (from /build-stories) or a fix (from /validate-execution findings).

Phase 1 — Load Context

Read the task file (your story) at the provided path
Determine the version from the path (e.g., specs/v0.1-core-push/001-... → v0.1-core-push)
Locate specs. If the orchestrator specified a specs repo path in your prompt, read specs from there. Otherwise, look for specs/ in CWD.
Read specs/<version>/context.md — shared version context (conventions, manifest, cross-cutting decisions).
Read specs/<version>/lessons.md — what the team has learned so far. Re-read this whenever you resume after a red-button halt.
Your story is self-contained — only open specs/<version>/architecture.md if the story explicitly sends you there. (Don't reload the full architecture by default.)
Scan the relevant part of the code workspace for the patterns and conventions you'll follow
Check if the task file already has an ## Execution Log section — if so, resume from where it left off
Code worktree: set it up exactly as the setup-playbook says (specs/<version>/setup-playbook.md) — how to add the worktree, copy .env/gitignored files, install deps, run gates. If you discover a setup step the playbook is missing, add it to the playbook.

Project Type Detection

The spec's Project Context → Type determines your execution approach:

code: Write code, run tests, verify builds
business / consulting: Write documents, verify against criteria
research: Investigate, analyze, produce findings
hybrid: Mix approaches as needed

If this is a fix task

The orchestrator will include validation findings in the prompt — specific failures that need to be addressed. Read the referenced validation spec to understand what failed and why.

If this follows a design story

If the task lists a prior /interface-design story as a prerequisite, read that story's execution log to find the files it produced. Build on them, don't redesign them.

Phase 2 — Plan Implementation

Before producing any output:

List every file to create or modify
Define the order of operations
Identify any ambiguities or blockers

Red-button check (do this now, and any time during execution). If the story turns out much larger or different than specified, or you hit an unexpected blocker, do NOT grind for a long time and do NOT split the story yourself:

Broadcast a halt to the other engineers (SendMessage) so they don't hit the same wall.
Report to the team lead: the challenge, what you found, and 2–3 concrete options.
Wait for direction (the lead decides with the user; scope issues go to the live PO to re-refine). On resume, re-read lessons.md first.

This early-flag discipline is what prevents mid-flight story splits.

Do it the short way — use the work-modes primitives (dispatch them as agents, or apply their technique) instead of grep-spelunking:

explore-conventions before writing a new <thing> (controller, subscriber, error, factory, test) — match the codebase's established pattern instead of inventing one.
verify-symbol before calling any method/field/endpoint you didn't write — confirm it exists and get its real signature.
probe-contract to see how something actually behaves (run it in a REPL) instead of guessing from the source.
trace-flow when a value crosses layers and you need to know exactly what happens to it.

Phase 3 — Execute

For Code Projects

New stories:

Write the code following the architecture and patterns
Write tests alongside the implementation (see Phase 4)
Ensure everything compiles/runs

Fix tasks:

Understand the failure first — read validation findings carefully
Write a failing test that reproduces the failure
Fix the bug with minimal code change
Verify the test passes
Run the full suite — ensure no regressions

Integrating Interface-Design Output:

Read the design story's execution log for produced files
Read .interface-design/system.md for design tokens and patterns
Preserve the visual design — don't restyle or restructure
Your job is to wire: data fetching, state management, API calls, routing

For Non-Code Projects

New stories:

Research and gather inputs needed for the deliverable
Write the deliverable following the architecture's delivery structure
Check the deliverable against each acceptance criterion
If the deliverable references other documents or data, verify those references are accurate

Fix tasks:

Read the validation findings — understand what's missing or incorrect
Address each finding specifically
Re-verify the affected acceptance criteria

For Research Projects

New stories:

Conduct the investigation described in the task
Collect and organize findings
Produce the output in the format specified by the architecture
Verify findings are supported by sources
Flag uncertainties or areas needing deeper investigation

Phase 4 — Verify (Code Projects)

Write automated tests alongside the implementation. The goal is pragmatic coverage — not 100% unit testing, but confidence that key behaviors work.

Testing Philosophy

Complex logic (engines, validators, state machines, algorithms): thorough test suite
Key components (database layers, providers, API routes): at least one test pass verifying primary behavior
Simple glue code (factories, config loading, re-exports): tested implicitly
Integration points: test that components work together

Run Tests

After writing tests, run them and ensure they all pass. Tests MUST pass before moving to Phase 5.

Phase 4 — Verify (Non-Code Projects)

For non-code projects, verification is criteria-based:

Walk through each acceptance criterion from the task file
Check completeness — does the deliverable address everything?
Check quality — is the writing clear? Are claims supported? Is the format correct?
Check language — verify the output language matches the project's language setting
Report what's done and what meets criteria

Phase 5 — Final Checks & QA Handover

Check every acceptance criterion from the task file
For code: run all tests, linters, type checks, builds
For non-code: re-read deliverable against criteria, verify references and links
Integration check — verify the output connects properly to what exists
Hand over to the live QA before declaring done. There is one QA agent for the whole execution. SendMessage it: what you built, how to exercise it, and which Definition-of-Done items it covers. Address QA's findings now, while the work is fresh — don't defer them to an end-of-version gate.

Phase 6 — Update Task File

Append an ## Execution Log section to the task file:

## Execution Log

### Session: <date>

**Status**: completed | in-progress

**Completed:**
- What was done (with file paths)

**Decisions Made:**
- Any implementation decisions not in the original task

**Issues Encountered:**
- Problems hit and how they were resolved

**Struggled With:**
- Things that took multiple attempts
- Process difficulty that future agents should know about

**Pending:** (only if in-progress)
- What's left to do

Then update specs/<version>/stories.md — mark the task as completed ONLY if ALL acceptance criteria pass. Otherwise mark as in-progress. After updating, re-read stories.md to verify the change was saved.

Also append your durable learnings (surprises, under-specified spots, setup gotchas, decisions) to specs/<version>/logs/engineer-<N>.md — your own file. The PO consolidates these into lessons.md for everyone.

Spec-workspace git: write all of the above (execution log, stories.md, your engineer log) on the current branch, but do not run git in the spec workspace — the team lead commits it (single-committer rule). Your only git is in the code worktree, per your role's merge protocol.

Phase 7 — Make QA's job guess-free (Code Projects)

The live QA verifies your handover (Phase 5), so give it what it needs:

In the handover message — exact steps to exercise what you built, which DoD items it covers, and any new seed scripts, migrations, env vars, or config overrides needed to run it.
Startup commands — if the task changed how services start, update docs/dev-environment.md with the exact commands, and reflect any new gotcha in specs/<version>/setup-playbook.md.
Architecture updates — if the implementation diverged from the architecture doc, note it in your execution log and flag it to the team lead (so the PO can reconcile the spec). Don't silently diverge.

Your task: execute ONE task end-to-end, producing output that meets all acceptance criteria. The task is either a new story (from /build-stories) or a fix (from /validate-execution findings).

Phase 1 — Load Context

Read the task file (your story) at the provided path
Determine the version from the path (e.g., specs/v0.1-core-push/001-... → v0.1-core-push)
Locate specs. If the orchestrator specified a specs repo path in your prompt, read specs from there. Otherwise, look for specs/ in CWD.
Read specs/<version>/context.md — shared version context (conventions, manifest, cross-cutting decisions).
Read specs/<version>/lessons.md — what the team has learned so far. Re-read this whenever you resume after a red-button halt.
Your story is self-contained — only open specs/<version>/architecture.md if the story explicitly sends you there. (Don't reload the full architecture by default.)
Scan the relevant part of the code workspace for the patterns and conventions you'll follow
Check if the task file already has an ## Execution Log section — if so, resume from where it left off
Code worktree: set it up exactly as the setup-playbook says (specs/<version>/setup-playbook.md) — how to add the worktree, copy .env/gitignored files, install deps, run gates. If you discover a setup step the playbook is missing, add it to the playbook.

Project Type Detection

The spec's Project Context → Type determines your execution approach:

code: Write code, run tests, verify builds
business / consulting: Write documents, verify against criteria
research: Investigate, analyze, produce findings
hybrid: Mix approaches as needed

If this is a fix task

The orchestrator will include validation findings in the prompt — specific failures that need to be addressed. Read the referenced validation spec to understand what failed and why.

If this follows a design story

If the task lists a prior /interface-design story as a prerequisite, read that story's execution log to find the files it produced. Build on them, don't redesign them.

Phase 2 — Plan Implementation

Before producing any output:

List every file to create or modify
Define the order of operations
Identify any ambiguities or blockers

Broadcast a halt to the other engineers (SendMessage) so they don't hit the same wall.
Report to the team lead: the challenge, what you found, and 2–3 concrete options.
Wait for direction (the lead decides with the user; scope issues go to the live PO to re-refine). On resume, re-read lessons.md first.

This early-flag discipline is what prevents mid-flight story splits.

Do it the short way — use the work-modes primitives (dispatch them as agents, or apply their technique) instead of grep-spelunking:

explore-conventions before writing a new <thing> (controller, subscriber, error, factory, test) — match the codebase's established pattern instead of inventing one.
verify-symbol before calling any method/field/endpoint you didn't write — confirm it exists and get its real signature.
probe-contract to see how something actually behaves (run it in a REPL) instead of guessing from the source.
trace-flow when a value crosses layers and you need to know exactly what happens to it.

Phase 3 — Execute

For Code Projects

New stories:

Write the code following the architecture and patterns
Write tests alongside the implementation (see Phase 4)
Ensure everything compiles/runs

Fix tasks:

Understand the failure first — read validation findings carefully
Write a failing test that reproduces the failure
Fix the bug with minimal code change
Verify the test passes
Run the full suite — ensure no regressions

Integrating Interface-Design Output:

Read the design story's execution log for produced files
Read .interface-design/system.md for design tokens and patterns
Preserve the visual design — don't restyle or restructure
Your job is to wire: data fetching, state management, API calls, routing

For Non-Code Projects

New stories:

Research and gather inputs needed for the deliverable
Write the deliverable following the architecture's delivery structure
Check the deliverable against each acceptance criterion
If the deliverable references other documents or data, verify those references are accurate

Fix tasks:

Read the validation findings — understand what's missing or incorrect
Address each finding specifically
Re-verify the affected acceptance criteria

For Research Projects

New stories:

Conduct the investigation described in the task
Collect and organize findings
Produce the output in the format specified by the architecture
Verify findings are supported by sources
Flag uncertainties or areas needing deeper investigation

Phase 4 — Verify (Code Projects)

Write automated tests alongside the implementation. The goal is pragmatic coverage — not 100% unit testing, but confidence that key behaviors work.

Testing Philosophy

Complex logic (engines, validators, state machines, algorithms): thorough test suite
Key components (database layers, providers, API routes): at least one test pass verifying primary behavior
Simple glue code (factories, config loading, re-exports): tested implicitly
Integration points: test that components work together

Run Tests

After writing tests, run them and ensure they all pass. Tests MUST pass before moving to Phase 5.

Phase 4 — Verify (Non-Code Projects)

For non-code projects, verification is criteria-based:

Walk through each acceptance criterion from the task file
Check completeness — does the deliverable address everything?
Check quality — is the writing clear? Are claims supported? Is the format correct?
Check language — verify the output language matches the project's language setting
Report what's done and what meets criteria

Phase 5 — Final Checks & QA Handover

Check every acceptance criterion from the task file
For code: run all tests, linters, type checks, builds
For non-code: re-read deliverable against criteria, verify references and links
Integration check — verify the output connects properly to what exists
Hand over to the live QA before declaring done. There is one QA agent for the whole execution. SendMessage it: what you built, how to exercise it, and which Definition-of-Done items it covers. Address QA's findings now, while the work is fresh — don't defer them to an end-of-version gate.

Phase 6 — Update Task File

Append an ## Execution Log section to the task file:

## Execution Log

### Session: <date>

**Status**: completed | in-progress

**Completed:**
- What was done (with file paths)

**Decisions Made:**
- Any implementation decisions not in the original task

**Issues Encountered:**
- Problems hit and how they were resolved

**Struggled With:**
- Things that took multiple attempts
- Process difficulty that future agents should know about

**Pending:** (only if in-progress)
- What's left to do

Phase 7 — Make QA's job guess-free (Code Projects)

The live QA verifies your handover (Phase 5), so give it what it needs:

In the handover message — exact steps to exercise what you built, which DoD items it covers, and any new seed scripts, migrations, env vars, or config overrides needed to run it.
Startup commands — if the task changed how services start, update docs/dev-environment.md with the exact commands, and reflect any new gotcha in specs/<version>/setup-playbook.md.
Architecture updates — if the implementation diverged from the architecture doc, note it in your execution log and flag it to the team lead (so the PO can reconcile the spec). Don't silently diverge.

Adoption

jaisonerick/execute-task

$ install --global

Security Scan Results

SKILL.md

Phase 1 — Load Context

Project Type Detection

If this is a fix task

If this follows a design story

Phase 2 — Plan Implementation

Phase 3 — Execute

For Code Projects

For Non-Code Projects

For Research Projects

Phase 4 — Verify (Code Projects)

Testing Philosophy

Run Tests

Phase 4 — Verify (Non-Code Projects)

Phase 5 — Final Checks & QA Handover

Phase 6 — Update Task File

Phase 7 — Make QA's job guess-free (Code Projects)

Related Skills

jaisonerick/web-design-guidelines

jaisonerick/mcp-builder

jaisonerick/markdown-converter

jaisonerick/validate-execution

jaisonerick/execute-task

$ install --global

Security Scan Results

SKILL.md

Phase 1 — Load Context

Project Type Detection

If this is a fix task

If this follows a design story

Phase 2 — Plan Implementation

Phase 3 — Execute

For Code Projects

For Non-Code Projects

For Research Projects

Phase 4 — Verify (Code Projects)

Testing Philosophy

Run Tests

Phase 4 — Verify (Non-Code Projects)

Phase 5 — Final Checks & QA Handover

Phase 6 — Update Task File

Phase 7 — Make QA's job guess-free (Code Projects)

Related Skills

jaisonerick/web-design-guidelines

jaisonerick/mcp-builder

jaisonerick/markdown-converter

jaisonerick/validate-execution