engineering/skills/agentic-development/SKILL.md
Use for end-to-end software execution in an unfamiliar or complex repo: orienting the codebase, choosing an execution model, planning and verifying changes, reviewing architecture or PRs, improving agent-first harnesses, and coordinating work across the specialized `frontend` and `backend` engineering skills.
npx skillsauth add alvarovillalbaa/agent-suite agentic-developmentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill is the orchestration layer for software engineering work. It owns repo orientation, execution-mode selection, planning, review discipline, verification gates, release flow, cross-domain coordination, and agent-native architecture design.
<intake> ## What do you need?Pick a number or describe what you want. The routing table below maps your answer to the right references. </intake>
<routing> | Answer | Read | |--------|------| | 0, "brainstorm", "design", "before building", "what should we" | [brainstorming-design-gate.md](./references/brainstorming-design-gate.md) — explore context, one-at-a-time questions, 2-3 approach proposals, design approval gate | | 1, "implement", "execute", "feature" | Triage input → [subagents-and-parallelism.md](./references/subagents-and-parallelism.md); search learnings → [institutional-learnings.md](./references/institutional-learnings.md); orient → [repo-orientation.md](./references/repo-orientation.md) | | 2, "debug", "bug", "root cause", "error", "trace" | [debug-investigation.md](./references/debug-investigation.md), then [observability.md](./references/observability.md) for signal sources | | 3, "review", "PR", "code review" | [reviews-and-comments.md](./references/reviews-and-comments.md) — includes multi-agent persona selection and severity scale | | 4, "plan", "architect", "design", "system" | Search learnings first → [institutional-learnings.md](./references/institutional-learnings.md); then [prd-to-plan.md](./references/prd-to-plan.md), [system-design-workflows.md](./references/system-design-workflows.md), [architecture-analysis.md](./references/architecture-analysis.md), [codebase-architecture-language.md](./references/codebase-architecture-language.md) | | 5, "harness", "agent-readiness", "CI", "evals" | [harness-engineering.md](./references/harness-engineering.md) | | 6, "agent-native", "MCP", "tools", "first-class agents" | [agent-native-architecture.md](./references/agent-native-architecture.md) | | 7, "orchestrate", "cloud agents", "parallel agents" | [orchestrate-roles.md](./references/orchestrate-roles.md), then `/orchestrate` | | 8, "release", "merge", "deploy", "hotfix" | [land-and-deploy.md](./references/land-and-deploy.md), [hotfix-procedures.md](./references/hotfix-procedures.md) | | 9, "learn", "improve", "skill", "memory" | [institutional-learnings.md](./references/institutional-learnings.md) to capture findings; [self-improvement.md](./references/self-improvement.md), [skill-extraction.md](./references/skill-extraction.md) | | 10, "tech debt", "debt", "migrations", "dependency", "feature flags", "test coverage", "triage", "lint", "anti-pattern" | [tech-debt-cloud-agents.md](./references/tech-debt-cloud-agents.md) — 10 cloud-agent cadences for schedulable debt elimination, including lint-enforced anti-pattern elimination | </routing>../frontend/SKILL.md or ../backend/SKILL.md.BLOCKED or genuine ambiguity. See subagents-and-parallelism.md.DONE, DONE_WITH_CONCERNS, BLOCKED, or NEEDS_CONTEXT.../frontend/SKILL.md or ../backend/SKILL.md.quality-assurance before declaring done, code-documentation after implementing any public API surface, autoimprove or memory-management at session end. If the companion skill isn't installed, install it. If it is installed but hasn't run, run it now.| Scope / Complexity | Default execution model |
|--------------------|-------------------------|
| Small, single proof path | Direct execution |
| Medium, separable concerns | Subagents-driven development |
| Large, cross-domain feature | Team of agents |
| Epic, multi-iteration work | Supervised harness loop |
| Agent-first repo audit or harness improvement | Harness engineering pass |
| Distributed, multi-scope or context-exhausting epic | Cloud orchestration (/orchestrate) |
Read subagents-and-parallelism.md for controller/worker patterns, Parallel Safety Check, worktree isolation, git index safety, post-batch merge flow, two-stage review (spec compliance → code quality), implementer status handling (DONE/DONE_WITH_CONCERNS/BLOCKED/NEEDS_CONTEXT), and model routing for sub-agents.
Read subagent-prompt-templates.md for concrete prompt templates: implementer, spec compliance reviewer, and code quality reviewer. Use when dispatching sub-agents to avoid vague prompts.
Read repo-orientation.md for startup discovery, repo-shape detection, and initial command selection.
Read instruction-file-design.md when structuring or auditing AGENTS.md, CLAUDE.md, or other instruction surfaces.
Read harness-engineering.md when improving agent-first repo readiness: context architecture, subsystem guidance, mechanical enforcement, CI feedback, evals, observability, persistent knowledge, deslop passes, CI watching, compatibility scans, or background quality agents.
Read harness-loops.md for repeated agent iterations, disk-backed specs/plans, builder/reviewer loops, restore points, and sequential review pipelines with auto-decisions.
Read institutional-learnings.md when capturing a solved problem, searching docs/solutions/ before starting work in a documented area, or deciding whether a finding belongs in structured solutions vs prose lessons.
Read scratch-space.md when a harness run or skill needs to write intermediate files, checkpoints, delegation prompts, or any state that is not a final deliverable. Use this to avoid polluting the repo and to make state discoverable by sibling agents.
Read architecture-analysis.md when explaining how the system works, tracing dependencies, or evaluating change impact.
Read codebase-architecture-language.md when producing an architecture audit, writing AGENTS.md guidance, or describing structural problems in a PR or report. This reference defines the shared vocabulary: god objects, shotgun surgery, blast radius, layer violations, cohesion, coupling, lint gates, and the 7-dimension scoring rubric. Use this language consistently so findings can be compared across sessions.
Read architecture-patterns.md when choosing or comparing software architecture patterns.
Read system-design-workflows.md when designing a system, migration, API surface, or scaling plan from scratch.
Read interface-design.md when the right module or API shape is unclear and should be explored before implementation.
Read tech-decision-guide.md when the user needs to choose a technology.
Read interviewer-pattern.md before any non-trivial creation task where key constraints are still ambiguous.
Read prd-to-plan.md for phased implementation plans and tracer-bullet slicing.
Read refactor-planning.md for safe refactor decomposition, commit slicing, and RFC/issue planning.
Read debug-investigation.md for the full debug workflow: Triage → Investigate → Root Cause → Fix → Handoff. Includes assumption audits, causal chain gates, smart escalation, root cause tracing (trace backward through the call chain — never fix the symptom), defense-in-depth validation (validate at every layer), and condition-based waiting (replace arbitrary sleep() with real condition checks).
Read observability.md before adding logs or instrumentation, and whenever debugging should start from existing signals.
Read reviews-and-comments.md for code review, PR feedback, multi-agent persona selection, P0–P3 severity scale, and adversarial review passes.
Read code-simplification.md before opening a PR or after a long dev-loop: resolves scope from the branch diff, runs 3 parallel review agents (reuse/quality/efficiency), applies fixes, and verifies behavior is preserved.
Read specs-plans-tests.md when the workflow needs explicit separation between behavior spec, execution plan, and proof.
Read verification-and-finish.md before declaring success, opening a PR, or cleaning up a branch/worktree.
Read tdd-advanced.md for spec-first TDD, property-based testing, mutation testing, and autonomy limits.
Read tdd-framework-guide.md when setting up or changing a test framework.
Read tdd-ci-integration.md when wiring coverage into CI.
Read github-actions-templates.md, gitlab-ci-templates.md, and deployment-gates.md for baseline CI pipeline generation and deployment gates.
Read agent-native-architecture.md when building systems where agents are first-class citizens: the five principles (Parity, Granularity, Composability, Emergent Capability, Improvement Over Time), MCP tool design, action parity, architecture checklist, anti-patterns, and success criteria.
Read agent-native-architecture.md before any new agent-facing API, MCP server, or AI-powered product feature. Run the agent-native-auditor agent to score an existing system against the eight principles.
Read agentic-system-design.md when designing agent architectures from scratch.
Read prompt-engineering-patterns.md when writing or improving prompts.
Read llm-evaluation-frameworks.md when evaluating LLM or RAG output quality.
Read decision-principles.md when running autonomous loops that need to auto-decide intermediate choices, or when surfacing structured decision briefs to users. Use the decision classification (mechanical/taste/user-challenge) to determine whether to auto-decide silently, queue for a final gate, or pause and ask.
Read model-routing.md when spawning sub-agents to route tasks to the right model tier (Haiku for routine file ops, Sonnet for implementation, Opus for security/architecture). The 80/15/5 split achieves ~10× cost reduction vs defaulting everything to a premium model.
Read orchestrate-roles.md when distributing a large task across parallel cloud agents: planner/worker/verifier role separation, plan.json state management, Andon escalation, and when to use /orchestrate vs. /dev-loop.
Read collaboration-and-git.md for branch and PR workflow.
Read conventional-commits.md for commit message standards and release tooling.
Read release-workflow-comparison.md when choosing a branching strategy.
Read hotfix-procedures.md for emergency release flow.
Read land-and-deploy.md when the user asks to merge, land, or deploy.
Read tech-debt-cloud-agents.md when planning or executing debt elimination at scale: large-scale migrations, dependency updates, feature-flag retirement, test coverage fills, Sentry triage, design-system drift, monolith splits, docs sync, and automated bug reproduction. All nine patterns are designed to run as autonomous cloud-agent cadences — scheduled, parallelized, and reviewed as PRs rather than consuming sprint capacity.
Read self-improvement.md at session end or after recurring mistakes.
Read autoimprove.md for autonomous reference optimization.
Read skill-extraction.md when session learnings should become a reusable skill.
Read continual-learning.md to update durable repo/user memory in instruction files.
Read learning-system.md and its companion references when operating the repo-local learning/ system.
<reference_index>
Orientation and planning
references/repo-orientation.md — startup discovery, repo-shape detectionreferences/instruction-file-design.md — AGENTS.md/CLAUDE.md structurereferences/brainstorming-design-gate.md — design-first gate: explore, one-question-at-a-time, 2-3 proposals, approval before code ★ newreferences/prd-to-plan.md — phased implementation plans and tracer-bullet slicingreferences/refactor-planning.md — safe refactor decompositionreferences/interviewer-pattern.md — ambiguity resolution before non-trivial tasksArchitecture and design
references/architecture-analysis.md — tracing dependencies, evaluating change impactreferences/codebase-architecture-language.md — shared vocabulary for architecture audits: anti-patterns, health properties, enforcement terms, 7-dimension scoring rubric ★ newreferences/architecture-patterns.md — comparing software architecture patternsreferences/system-design-workflows.md — designing systems from scratchreferences/interface-design.md — exploring module or API shape before implementationreferences/tech-decision-guide.md — technology selectionExecution and parallelism
references/subagents-and-parallelism.md — Parallel Safety Check, worktree isolation, two-stage review, implementer status handling, model routingreferences/subagent-prompt-templates.md — concrete implementer, spec-reviewer, and code-quality-reviewer prompt templates ★ newreferences/harness-loops.md — disk-backed loops, builder/reviewer loops, restore pointsreferences/orchestrate-roles.md — planner/worker/verifier for cloud agentsAgent-native and AI systems
references/agent-native-architecture.md — five principles, MCP tool design, checklist, anti-patterns ★ newreferences/agentic-system-design.md — agent architecture from scratchreferences/prompt-engineering-patterns.md — prompt writing and improvementreferences/llm-evaluation-frameworks.md — LLM/RAG eval qualityreferences/decision-principles.md — mechanical/taste/user-challenge classificationTech debt and quality cadences
references/tech-debt-cloud-agents.md — 10 cloud-agent patterns for schedulable debt elimination (migrations, deps, flags, coverage, triage, design drift, monolith splits, docs, bug reproduction, lint-enforced anti-pattern elimination)Harness improvement
references/harness-engineering.md — audit dimensions, improvement patterns, scratch space disciplinereferences/observability.md — logs, traces, agent-legible debugging signalsreferences/institutional-learnings.md — docs/solutions/ schema, capture workflow, grep-first search ★ newreferences/scratch-space.md — OS-temp vs .context/ vs durable; decision tree for intermediate state ★ newDebugging
references/debug-investigation.md — phased debug workflow: Triage, Investigate, Root Cause, Fix, Handoff; assumption audits, causal chain gates; root cause tracing (backward call-chain tracing); defense-in-depth validation; condition-based waiting for flaky tests; find-polluter.sh bisection scriptReview, testing, and CI
references/reviews-and-comments.md — multi-agent code review, P0–P3 severity scale, persona selection, adversarial passesreferences/code-simplification.md — 3-agent simplification pass (reuse/quality/efficiency) before PR; scope resolution, verificationreferences/specs-plans-tests.md — behavior spec, plan, proof separationreferences/verification-and-finish.md — success gates before declaring donereferences/tdd-advanced.md — spec-first TDD, property-based testingreferences/tdd-framework-guide.md — test framework setupreferences/tdd-ci-integration.md — coverage wired to CIreferences/github-actions-templates.md, references/gitlab-ci-templates.md — CI baselinesreferences/deployment-gates.md — deployment gate patternsGit, release, and landing
references/collaboration-and-git.md — branch and PR workflowreferences/conventional-commits.md — commit standardsreferences/release-workflow-comparison.md — branching strategy selectionreferences/hotfix-procedures.md — emergency release flowreferences/land-and-deploy.md — merge and deployLearning and self-improvement
references/self-improvement.md — session-end learningreferences/autoimprove.md — autonomous reference optimizationreferences/skill-extraction.md — session learnings → reusable skillreferences/continual-learning.md — durable repo/user memory updatesreferences/learning-system.md — learning/ folder system
</reference_index>This skill should reference the specialized engineering skills repeatedly and intentionally:
../frontend/SKILL.md.../backend/SKILL.md.../frontend/SKILL.md and ../backend/SKILL.md.../frontend/SKILL.md.../backend/SKILL.md.Review agents (dispatch in parallel for multi-agent code review):
references/agents/correctness-reviewer.md — logic errors, off-by-one, null propagation, race conditions, broken error propagation; always-on for any code review.references/agents/adversarial-reviewer.md — chaos engineering perspective; constructs failure scenarios from assumption violations, composition failures, cascades, and abuse cases; conditional (diff ≥50 lines or high-risk domain).references/agents/performance-oracle.md — algorithmic complexity, N+1 queries, memory, caching, network, frontend bundle; conditional (DB-touching or data-scale code).references/agents/architecture-strategist.md — layer violations, coupling, SOLID, structural decisions; conditional (new services, refactors, cross-cutting concerns).Harness and CI agents:
references/agents/ci-watcher.yaml — watches CI for the current branch's PR; classifies failures as FLAKY, REAL, or INFRA; feeds /fix-ci.references/agents/compatibility-scanner.yaml — orchestrates the four-phase compatibility pass (harness scan, docs reliability, startup review, validation efficiency); feeds /check-agent-compat.Memory and improvement agents:
references/agents/agents-memory-updater.md — mines transcript deltas and updates CLAUDE.md / AGENTS.md with durable preferences and workspace facts.references/agents/learnings-researcher.md — grep-first search of docs/solutions/ for applicable past learnings before planning or implementing in a documented area. ★ newOrchestration agents:
references/agents/orchestrate.yaml — acts as Dispatcher for /orchestrate; spawns a cloud Planner, monitors progress, handles Andon escalations, and reconciles the final handoff.references/agents/agent-native-auditor.md — runs 8 parallel sub-agents scoring an existing codebase against agent-native principles (Action Parity, Tool Primitives, Context Injection, Shared Workspace, CRUD Completeness, UI Integration, Capability Discovery, Prompt-Native Features); produces a scored report with prioritized recommendations.scripts/repo_scan.py for repo scanning and startup context.scripts/harness_audit.py for agent-first readiness audits across instructions, CI, evals, observability, enforcement, and persistent knowledge.scripts/find-polluter.sh for test isolation bisection — finds which test creates unwanted files or state by running tests one-by-one until the polluter is identified.scripts/project_architect.py for architecture detection and layer violations.scripts/architecture_diagram_generator.py for Mermaid, PlantUML, or ASCII diagrams.scripts/stack_detector.py and scripts/pipeline_generator.py for baseline CI generation.scripts/coverage_analyzer.py, scripts/fixture_generator.py, and scripts/tdd_workflow.py for verification workflows.scripts/prompt_optimizer.py, scripts/rag_evaluator.py, and scripts/agent_orchestrator.py for LLM/agent systems work.scripts/init-learning.sh, scripts/init-learning.py, scripts/capture-item.py, scripts/scan-learning.py, and scripts/refresh-learning.py for the learning system.../frontend/scripts/.../backend/scripts/.docs/solutions/ for prior decisions and pitfalls in the affected area. See institutional-learnings.md or dispatch the learnings-researcher agent.../frontend/SKILL.md, ../backend/SKILL.md, or both when domain-specific detail is needed.docs/solutions/. Surface follow-up risks, rollout notes, and cleanup for the next engineer.Three slash commands come with this skill. Wire them from engineering/commands/.
/dev-loopStarts an agentic development loop in the current session. The stop hook re-injects the original task prompt each iteration until a <promise> completion tag is output or --max-iterations is reached.
/dev-loop Fix the auth bug --completion-promise 'FIXED' --max-iterations 10
/dev-loop Add test coverage --verify-cmd 'npm test' --completion-promise 'DONE'
/dev-loop Improve API layer --spec-file TASK.md --max-iterations 20
Options: --max-iterations N, --completion-promise TEXT, --verify-cmd CMD, --spec-file PATH.
/cancel-dev-loopCancels an active dev loop by removing .claude/agentic-dev-loop.local.md.
/check-agent-compatRuns the four-phase compatibility scan before extending agent autonomy: harness score, docs reliability, startup path, and validation efficiency. Produces a P0/P1/P2 finding list. Start here before running a long harness-loop on an unfamiliar repo.
/check-agent-compat [repo-root]
/fix-ciWatches CI for the current branch, classifies failures (FLAKY/REAL/INFRA), and applies targeted fixes in a watch → identify → fix → re-watch loop. Uses the ci-watcher agent internally.
/fix-ci [--max-iterations N] [--check <check-name>]
/harness-loopStarts a harness engineering improvement loop. Each iteration runs harness_audit.py, picks the highest-priority P0/P1 finding, enforces it as a CI gate or structural test, and commits. Stops when a fresh audit shows no P0 or P1 items.
/harness-loop --max-iterations 20
/harness-loop path/to/repo --max-iterations 15
/orchestrateDecomposes a large goal into parallel cloud-agent tasks using the planner/worker/verifier model. The local IDE session acts as the Dispatcher; a Planner cloud agent owns scope decomposition; Worker agents execute sub-tasks in isolation. Andon signals pause the pipeline and escalate to the user.
/orchestrate "Migrate the legacy auth module to the new SDK, add test coverage, and update AGENTS.md"
Use when a task would exhaust a single agent's context budget, benefits from parallelism, or spans independently verifiable sub-scopes. Read orchestrate-roles.md before invoking.
This skill ships four hooks. Wire them in Claude Code settings, replacing /absolute/path/to/agentic-development with the actual path.
hooks/check-completion.sh operates in two modes:
/dev-loop or /harness-loop): checks .claude/agentic-dev-loop.local.md for active loop state, parses the JSONL transcript for the last assistant message, detects <promise> completion tags, and re-injects the original task prompt via reason each iteration.hooks/pre-tool.sh logs every Bash, Write, and Edit invocation to stderr.hooks/post-bash.sh captures non-zero exit codes into memory/working/last_error.json.hooks/session-end.sh writes memory/working/session_end.json so the self-improvement phase can detect a finished session.development
Use for frontend engineering work such as components, routes, state management, accessibility, performance, design-system integration, and browser-facing debugging or refactors.
development
This skill should be used when the user asks to write, update, review, scaffold, move, remove, or continuously improve documentation for code, folders, services, repos, workflows, architectural decisions, or operational processes. Trigger for inline docs, `README.md`, `ARCHITECTURE.md`, `TESTS.md`, `SETUP.md`, `RUNBOOK.md`, `CHANGELOG.md`, `SECURITY.md`, `OVERVIEW.md`, `FAQ.md`, `DECISIONS.md`, `DEPENDENCIES.md`, `AGENTS.md`, `PLAN.md`, `SPEC.md`, `SOUL.md`, `PRINCIPLES.md`, `DESIGN.md`, `logs/`, `lessons/`, `items/`, `fixes/`, `audits/`, `raw/`, `plans/`, `specs/`, `sources/`, `lib/`, `references/`, `cookbook/`, `knowledge/`, `runbooks/`, `research/`, `official-documentation/`, `context/`, MDX docs, JSDoc/TSDoc, docstrings, ADRs, post-mortems, migration guides, documentation cleanups, and documentation-impact reviews.
tools
Cross-cloud CLI-first cloud operations for AWS, Azure, and GCP. Use when the assistant needs to identify which cloud provider or multi-cloud estate a repo uses, deploy new resources or services, wire automatic deployments, inventory and optimize infrastructure, or diagnose and repair cloud failures entirely from the terminal, with explicit approval gates for high-cost, destructive, identity-sensitive, or hard-to-reverse changes. Covers AWS Amplify full-stack projects, serverless workloads (Lambda, API Gateway, Step Functions, SAM, CDK), and the full AWS database portfolio (RDS, Aurora, Aurora DSQL, DynamoDB, ElastiCache), as well as deep Azure references for diagnostics, storage, compute, compliance, identity, Foundry, and cross-cloud migrations.
development
Use for backend engineering work such as APIs, services, data models, persistence, queues, caching, auth, background jobs, and server-side debugging or refactors.