create-app — From Description to Verified Application

Identity

create-app is the full pipeline from "I want an app that does X" to a verified, working application. It integrates the prd skill, architect skill, and archon skill into a single flow with human checkpoints at the right moments.

This exists because no tool has built the complete pipeline. They either generate fast with no verification (Bolt, v0) or run autonomously with no guardrails (Devin, Replit Agent). This does both: fast generation with phase-by-phase verification, self-correction, and circuit breakers.

Tier Detection

Classify the user's input into one of five tiers:

Tier 1: Blank Project

Trigger: "create a blank project", "new project", "scaffold"
Action: Run the scaffold skill with stack detection. No PRD, no architecture.
Use when: The user knows what they're building and just wants the infrastructure.

Tier 2: Guided

Trigger: "I want to build...", "help me create...", description with questions
Action: Run prd skill (asks clarifying questions) → user approves → architect skill → user approves → archon skill
Use when: The user has an idea but needs help structuring it.
Human checkpoints: after PRD, after architecture, before each major phase.

Tier 3: Templated

Trigger: Describes a well-known app type ("a todo app", "a blog", "a dashboard with auth")
Action: Load template PRD if available → architect skill with template defaults → archon skill
Use when: The app shape is known and common. Speeds up the planning phase.
Template detection: check .citadel/_templates/app-types/ for matching templates. If no template matches, fall through to Tier 2.

Tier 4: Generated (Full Autonomy)

Trigger: "build me [detailed description]", "create [app] and deploy it"
Action: prd skill (minimal questions) → architect skill (auto-approve if confidence high) → archon skill with self-correction loop
Use when: The user wants maximum autonomy. Trust the pipeline.
Human checkpoints: after PRD only. Architecture and execution are autonomous.
Safety: all Archon self-correction mechanisms active. Direction alignment every 2 phases. Quality spot-checks every phase. Circuit breakers armed.

Tier 5: Feature Addition (Existing Codebase)

Trigger: user has an existing project + describes a feature, not a whole app ("add auth", "add a dashboard", "add payment processing", "add dark mode")
Detection: project has source files (src/, app/, lib/, package.json with deps) AND the user's description is a feature, not a standalone app
Action: prd skill in feature mode → architect skill in existing codebase mode → archon skill
Key differences from greenfield tiers:
- PRD reads existing codebase before asking questions
- Architecture describes changes to existing files, not a standalone system
- Phase 0 is always "Baseline" — record current typecheck/test state
- Every phase end condition includes "no new typecheck errors" + "existing tests pass"
- Risk register always includes "regression in existing functionality"
Human checkpoints: after feature spec (PRD). Architecture can auto-approve if the feature is well-scoped and all conditions are machine-verifiable.

Tier Classification

| Input Pattern | Tier | |---|---| | "blank project", "scaffold", "new empty" | 1 | | "help me build", "I want to create", "guide me" | 2 | | "todo app", "blog", "dashboard", well-known app type | 3 | | "build me [detailed]", "create [app]", confident description | 4 | | "add [feature]", "implement [feature]", existing project + feature description | 5 | | Ambiguous | Default to Tier 2 (safest) |

Protocol

Step 1: CLASSIFY

Read the user's input. Determine the tier.

Use plain language, not tier numbers. Announce what you'll do, not what tier they're in:

Tier 1: "I'll scaffold a blank project for you."
Tier 2: "I'll help you plan this out step by step. First I'll draft what we're building, then we'll agree on the approach before I write any code."
Tier 3: "This looks like a [type] app — I have a starting point for that. I'll show you the plan and you can adjust before I build."
Tier 4: "I'll plan this, show you the plan for approval, then build and verify it end to end."
Tier 5: "I'll read your existing codebase first, then plan how to add [feature] without breaking anything. You'll approve the plan before I touch any code."

If the classification is wrong, the user can override: "Actually, just scaffold it" → Tier 1 "Walk me through it" → Tier 2 "Just build it" → Tier 4

Step 2: EXECUTE TIER

Tier 1: Invoke the scaffold skill. Done.

Tier 2:

Invoke the prd skill with the user's description
Wait for user approval of the PRD
Invoke the architect skill with the approved PRD
Wait for user approval of the architecture
Create Archon campaign from the architecture
Execute campaign with standard Archon protocol
After each major phase: brief the user on progress

Tier 3:

Check .citadel/_templates/app-types/ for a matching template
If found: present the template PRD, ask "Does this match? What would you change?"
User approves or modifies → architect skill with template defaults
Brief architecture review (faster than Tier 2 since the shape is known)
Create and execute Archon campaign

Tier 4:

Invoke the prd skill in express mode (ask at most 1 question, or 0 if the description is complete)
User approves PRD (this is the only mandatory human checkpoint)
Invoke the architect skill (auto-approve if all end conditions are machine-verifiable)
Create Archon campaign with ALL safety systems active:
- Direction alignment every 2 phases
- Quality spot-check every phase
- Regression guard every build phase
- Anti-pattern scan every build phase
- Circuit breakers: 3 failures = new approach, 5+ type errors = park
- Phase end conditions must pass before proceeding
Execute autonomously until complete or parked
On completion: run full verification of all end conditions from the PRD
Present results to user

Tier 5 (Feature Addition):

Read the existing codebase — file tree, package.json, key entry points, existing patterns
Invoke the prd skill in feature mode (reads codebase before asking questions, max 2 questions)
User approves feature spec (one mandatory checkpoint)
Invoke the architect skill in existing codebase mode:
- Phase 0 is always "Baseline" — run typecheck and tests, record counts
- File tree shows only new + modified files
- Auto-approve if all conditions are machine-verifiable
Create Archon campaign. Every phase end condition includes:
- "No new typecheck errors vs baseline"
- "Existing tests pass"
Execute with all Archon safety systems active
On completion: verify all feature end conditions PLUS baseline regression check
Present results — what was added, what was verified, what still passes

Step 3: VERIFY (All Tiers except 1)

After the campaign completes:

Read the PRD's End Conditions
Check each one:
- Run commands, check file existence, invoke the live-preview skill for visual checks
Report:
- PASS: all end conditions met. App is v1-complete.
- PARTIAL: some conditions met, some failed. List what's missing.
- FAIL: critical conditions not met. Suggest next steps.

Step 4: DELIVER

Present the final state:

What was built (feature ledger from the campaign)
What was verified (end conditions that passed)
What needs attention (anything that failed or was sent to Review Queue)
How to run it (the command to start the app)
How to continue (what v2 could look like)
If deployed: "App is live at {URL}"
If deploy failed or skipped: "App verified locally. To deploy: {specific command}"
Suggest the postmortem skill to generate a campaign postmortem

Safety Systems

All of Archon's existing safety applies. Additionally:

PRD as contract: the PRD's end conditions are the acceptance criteria. The app is not "done" until those conditions pass. This prevents the 80% wall.
No stack lock-in: the architect skill chooses the stack based on the PRD, not a hardcoded default. Different apps get different stacks.
Graduated autonomy: Tier 2 has 3 human checkpoints. Tier 4 has 1. The user chooses how much control to keep.
Circuit breakers prevent death loops: if the build enters a fix-break cycle, the circuit breaker parks the campaign instead of burning tokens.

What create-app Does NOT Do

Skip the PRD (even Tier 4 generates one)
Build without end conditions (every phase has verifiable criteria)
Choose a stack without reasoning (every choice is justified)
Run fully unsupervised (Tier 4 still has PRD approval + all Archon safety)
Deploy without verification (deployment is an end condition, not automatic)

Quality Gates

PRD exists and is approved before any code is written
Architecture exists before Archon starts
Every campaign phase has machine-verifiable end conditions
Final verification checks all PRD end conditions
User receives a clear report of what was built and what needs attention

create-app — From Description to Verified Application

Identity

Tier Detection

Classify the user's input into one of five tiers:

Tier 1: Blank Project

Trigger: "create a blank project", "new project", "scaffold"
Action: Run the scaffold skill with stack detection. No PRD, no architecture.
Use when: The user knows what they're building and just wants the infrastructure.

Tier 2: Guided

Trigger: "I want to build...", "help me create...", description with questions
Action: Run prd skill (asks clarifying questions) → user approves → architect skill → user approves → archon skill
Use when: The user has an idea but needs help structuring it.
Human checkpoints: after PRD, after architecture, before each major phase.

Tier 3: Templated

Trigger: Describes a well-known app type ("a todo app", "a blog", "a dashboard with auth")
Action: Load template PRD if available → architect skill with template defaults → archon skill
Use when: The app shape is known and common. Speeds up the planning phase.
Template detection: check .citadel/_templates/app-types/ for matching templates. If no template matches, fall through to Tier 2.

Tier 4: Generated (Full Autonomy)

Trigger: "build me [detailed description]", "create [app] and deploy it"
Action: prd skill (minimal questions) → architect skill (auto-approve if confidence high) → archon skill with self-correction loop
Use when: The user wants maximum autonomy. Trust the pipeline.
Human checkpoints: after PRD only. Architecture and execution are autonomous.
Safety: all Archon self-correction mechanisms active. Direction alignment every 2 phases. Quality spot-checks every phase. Circuit breakers armed.

Tier 5: Feature Addition (Existing Codebase)

Trigger: user has an existing project + describes a feature, not a whole app ("add auth", "add a dashboard", "add payment processing", "add dark mode")
Detection: project has source files (src/, app/, lib/, package.json with deps) AND the user's description is a feature, not a standalone app
Action: prd skill in feature mode → architect skill in existing codebase mode → archon skill
Key differences from greenfield tiers:
- PRD reads existing codebase before asking questions
- Architecture describes changes to existing files, not a standalone system
- Phase 0 is always "Baseline" — record current typecheck/test state
- Every phase end condition includes "no new typecheck errors" + "existing tests pass"
- Risk register always includes "regression in existing functionality"
Human checkpoints: after feature spec (PRD). Architecture can auto-approve if the feature is well-scoped and all conditions are machine-verifiable.

Tier Classification

Protocol

Step 1: CLASSIFY

Read the user's input. Determine the tier.

Use plain language, not tier numbers. Announce what you'll do, not what tier they're in:

Tier 1: "I'll scaffold a blank project for you."
Tier 2: "I'll help you plan this out step by step. First I'll draft what we're building, then we'll agree on the approach before I write any code."
Tier 3: "This looks like a [type] app — I have a starting point for that. I'll show you the plan and you can adjust before I build."
Tier 4: "I'll plan this, show you the plan for approval, then build and verify it end to end."
Tier 5: "I'll read your existing codebase first, then plan how to add [feature] without breaking anything. You'll approve the plan before I touch any code."

If the classification is wrong, the user can override: "Actually, just scaffold it" → Tier 1 "Walk me through it" → Tier 2 "Just build it" → Tier 4

Step 2: EXECUTE TIER

Tier 1: Invoke the scaffold skill. Done.

Tier 2:

Invoke the prd skill with the user's description
Wait for user approval of the PRD
Invoke the architect skill with the approved PRD
Wait for user approval of the architecture
Create Archon campaign from the architecture
Execute campaign with standard Archon protocol
After each major phase: brief the user on progress

Tier 3:

Check .citadel/_templates/app-types/ for a matching template
If found: present the template PRD, ask "Does this match? What would you change?"
User approves or modifies → architect skill with template defaults
Brief architecture review (faster than Tier 2 since the shape is known)
Create and execute Archon campaign

Tier 4:

Invoke the prd skill in express mode (ask at most 1 question, or 0 if the description is complete)
User approves PRD (this is the only mandatory human checkpoint)
Invoke the architect skill (auto-approve if all end conditions are machine-verifiable)
Create Archon campaign with ALL safety systems active:
- Direction alignment every 2 phases
- Quality spot-check every phase
- Regression guard every build phase
- Anti-pattern scan every build phase
- Circuit breakers: 3 failures = new approach, 5+ type errors = park
- Phase end conditions must pass before proceeding
Execute autonomously until complete or parked
On completion: run full verification of all end conditions from the PRD
Present results to user

Tier 5 (Feature Addition):

Read the existing codebase — file tree, package.json, key entry points, existing patterns
Invoke the prd skill in feature mode (reads codebase before asking questions, max 2 questions)
User approves feature spec (one mandatory checkpoint)
Invoke the architect skill in existing codebase mode:
- Phase 0 is always "Baseline" — run typecheck and tests, record counts
- File tree shows only new + modified files
- Auto-approve if all conditions are machine-verifiable
Create Archon campaign. Every phase end condition includes:
- "No new typecheck errors vs baseline"
- "Existing tests pass"
Execute with all Archon safety systems active
On completion: verify all feature end conditions PLUS baseline regression check
Present results — what was added, what was verified, what still passes

Step 3: VERIFY (All Tiers except 1)

After the campaign completes:

Read the PRD's End Conditions
Check each one:
- Run commands, check file existence, invoke the live-preview skill for visual checks
Report:
- PASS: all end conditions met. App is v1-complete.
- PARTIAL: some conditions met, some failed. List what's missing.
- FAIL: critical conditions not met. Suggest next steps.

Step 4: DELIVER

Present the final state:

What was built (feature ledger from the campaign)
What was verified (end conditions that passed)
What needs attention (anything that failed or was sent to Review Queue)
How to run it (the command to start the app)
How to continue (what v2 could look like)
If deployed: "App is live at {URL}"
If deploy failed or skipped: "App verified locally. To deploy: {specific command}"
Suggest the postmortem skill to generate a campaign postmortem

Safety Systems

All of Archon's existing safety applies. Additionally:

PRD as contract: the PRD's end conditions are the acceptance criteria. The app is not "done" until those conditions pass. This prevents the 80% wall.
No stack lock-in: the architect skill chooses the stack based on the PRD, not a hardcoded default. Different apps get different stacks.
Graduated autonomy: Tier 2 has 3 human checkpoints. Tier 4 has 1. The user chooses how much control to keep.
Circuit breakers prevent death loops: if the build enters a fix-break cycle, the circuit breaker parks the campaign instead of burning tokens.

What create-app Does NOT Do

Skip the PRD (even Tier 4 generates one)
Build without end conditions (every phase has verifiable criteria)
Choose a stack without reasoning (every choice is justified)
Run fully unsupervised (Tier 4 still has PRD approval + all Archon safety)
Deploy without verification (deployment is an end condition, not automatic)

Quality Gates

PRD exists and is approved before any code is written
Architecture exists before Archon starts
Every campaign phase has machine-verifiable end conditions
Final verification checks all PRD end conditions
User receives a clear report of what was built and what needs attention

Adoption

special-place-administrator/create-app

$ install --global

Security Scan Results

SKILL.md

create-app — From Description to Verified Application

Identity

Tier Detection

Tier 1: Blank Project

Tier 2: Guided

Tier 3: Templated

Tier 4: Generated (Full Autonomy)

Tier 5: Feature Addition (Existing Codebase)

Tier Classification

Protocol

Step 1: CLASSIFY

Step 2: EXECUTE TIER

Step 3: VERIFY (All Tiers except 1)

Step 4: DELIVER

Safety Systems

What create-app Does NOT Do

Quality Gates

Related Skills

special-place-administrator/triage

special-place-administrator/test-gen

special-place-administrator/systematic-debugging

special-place-administrator/setup

special-place-administrator/create-app

$ install --global

Security Scan Results

SKILL.md

create-app — From Description to Verified Application

Identity

Tier Detection

Tier 1: Blank Project

Tier 2: Guided

Tier 3: Templated

Tier 4: Generated (Full Autonomy)

Tier 5: Feature Addition (Existing Codebase)

Tier Classification

Protocol

Step 1: CLASSIFY

Step 2: EXECUTE TIER

Step 3: VERIFY (All Tiers except 1)

Step 4: DELIVER

Safety Systems

What create-app Does NOT Do

Quality Gates

Related Skills

special-place-administrator/triage

special-place-administrator/test-gen

special-place-administrator/systematic-debugging

special-place-administrator/setup