plugins/agile-workflow/skills/ln-310-multi-agent-validator/SKILL.md
Use when validating Stories, plans, or tasks through the evaluation platform with mandatory research, parallel evidence lanes, sequential merge, and bounded refinement. Modes: story | plan_review.
npx skillsauth add levnikolaevich/claude-code-skills ln-310-multi-agent-validatorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Paths: File paths (
references/,../ln-*) are relative to this skill directory.
Type: L2 Coordinator Category: 3XX Planning
Evaluation-platform coordinator for:
mode=storymode=plan_reviewThis skill uses the evaluation platform for:
| Input | Required | Source | Description |
|-------|----------|--------|-------------|
| storyId | mode=story | args, git branch, kanban, user | Story to validate |
| plan {file} | mode=plan_review | args or auto | Plan file to validate |
Mode detection:
plan or plan {file} -> mode=plan_reviewmode=storyMANDATORY READ: Load references/environment_state_contract.md, references/storage_mode_detection.md, references/input_resolution_pattern.md
MANDATORY READ: Load references/evaluation_coordinator_runtime_contract.md, references/evaluation_summary_contract.md, references/evaluation_parallelism_policy.md, references/evaluation_research_contract.md
MANDATORY READ: Load references/agent_delegation_pattern.md
MANDATORY READ: Load references/penalty_points.md
MANDATORY READ: Load references/researchgraph_mcp_usage.md when researchgraph files changed or the target claims hypothesis, goal, benchmark, or proposal readiness.
Conditional read: load references/phase2_research_audit.md only when the coordinator performs inline criteria mapping instead of consuming ln-312 findings summaries.
Agent review policy: run health check, record skipped reason when no advisor is available, verify every advisor claim before merge, and treat transport/auth/tool failures as operator evidence rather than domain findings. Load references/agent_review_workflow.md only when debugging lifecycle/liveness details outside the evaluation runtime.
The coordinator uses these evaluation workers:
ln-311-review-research-workerln-312-review-findings-workerln-313-review-docs-workerln-314-review-repair-workerln-315-review-merge-workerln-316-review-refinement-workerHost Skill Invocation: Skill(skill: "...", args: "...") is mandatory delegation.
SKILL.md, treat args as $ARGUMENTS, execute that skill workflow, then return here with its result/artifact.Use the Skill tool for delegated workers. Do not inline worker logic inside the coordinator.
TodoWrite format (mandatory):
Resolve target and build runtime manifestLoad target artifacts and metadataLaunch external agents and verify healthRun research and findings workers in parallelGenerate documentation updatesApply accepted low-risk repairsSync agents and merge all evidenceRun refinement (MANDATORY in ALL modes when advisor available — do NOT skip)Compute verdict and write review outputVerify runtime cleanup and self-checkRepresentative invocations:
Skill(skill: "ln-311-review-research-worker", args: "{identifier} research")
Skill(skill: "ln-312-review-findings-worker", args: "{identifier} findings")
Skill(skill: "ln-313-review-docs-worker", args: "{identifier} docs")
Skill(skill: "ln-314-review-repair-worker", args: "{identifier} repair")
Skill(skill: "ln-315-review-merge-worker", args: "{identifier} merge")
Skill(skill: "ln-316-review-refinement-worker", args: "{identifier} refinement")
MANDATORY READ: Load references/loop_health_contract.md
Runtime family:
evaluation-runtimeIdentifier:
story-{storyId} for story modeplan-{slug} for plan reviewPhase order:
PHASE_0_CONFIGPHASE_1_DISCOVERYPHASE_2_AGENT_LAUNCHPHASE_3_EVIDENCE_LANESPHASE_4_DOCSPHASE_5_REPAIRPHASE_6_MERGEPHASE_7_REFINEMENTPHASE_8_APPROVALPHASE_9_SELF_CHECKPhase policy:
delegate_phases = [PHASE_3_EVIDENCE_LANES, PHASE_4_DOCS, PHASE_5_REPAIR, PHASE_6_MERGE, PHASE_7_REFINEMENT]aggregate_phase = PHASE_6_MERGEreport_phase = PHASE_8_APPROVALcleanup_phase = PHASE_9_SELF_CHECKself_check_phase = PHASE_9_SELF_CHECKagent_resolve_before = [PHASE_6_MERGE]required_phases_when_advisor_available = [PHASE_7_REFINEMENT]Allowed overlap:
ln-311ln-312Sequential only:
ln-313ln-314ln-315ln-316mode, identifier, and storage mode.expected_agentsrequired_research=truephase_orderphase_policynode references/scripts/evaluation-runtime/cli.mjs start \
--skill ln-310 \
--identifier {identifier} \
--manifest-file .hex-skills/evaluation/{identifier}_manifest.json
mode=story, resolve Story and child tasks.mode=plan_review, resolve the plan file.H##, G##, run IDs, benchmark manifests, or readiness claims, run read-only researchgraph verification/audits and attach the result as validation evidence..hex-skills/environment_state.json.agents_skipped_reasonnode references/scripts/evaluation-runtime/cli.mjs register-agent \
--skill ln-310 \
--identifier {identifier} \
--agent {name} \
--prompt-file {promptPath} \
--result-file {resultPath} \
--metadata-file {metadataPath}
health_check_done, agents_available, agents_required, and optional agents_skipped_reason.rate_limited, tool_missing, auth_missing, permission_denial, and asked_question are transport/operator states.NO-GO without domain evidence from artifacts or findings.This phase is the mandatory parallel evidence barrier.
worker_plan with:
ln-311 lane research (mandatory)ln-312 lane findings (mandatory)node references/scripts/evaluation-runtime/cli.mjs record-worker-result \
--skill ln-310 \
--identifier {identifier} \
--payload-file {childSummaryArtifactPath}
Research is mandatory in every mode:
For mode=story, findings must still produce penalty-point evidence and coverage analysis.
mode=story, run ln-313-review-docs-worker when documentation changes are required.mode=plan_review, skip only when there is no documentation delta to create.ln-314-review-repair-worker.Preconditions:
Steps:
node references/scripts/evaluation-runtime/cli.mjs sync-agent --skill ln-310 --identifier {identifier}
ln-315-review-merge-worker.aggregation_summary.NEVER SKIP THIS PHASE. Phase 7 applies to ALL modes:
story,plan_review. The ONLY valid skip reason is no advisor available in health check. Mode is NOT a skip reason. Complexity is NOT a skip reason. Time is NOT a skip reason. If you are about to checkpoint Phase 7 without running ln-316 while an advisor is available — STOP. You are making an error.
| Mode | Phase 7 required? | Skip allowed? |
|------|-------------------|---------------|
| story | YES | NO (only if no advisor available) |
| plan_review | YES | NO (only if no advisor available) |
Phase 7 is MANDATORY when an advisor is available. The coordinator MUST NOT checkpoint Phase 7 without a recorded review-refinement worker summary from ln-316. The runtime advance command will reject the transition if an advisor was available in health check but no refinement summary exists.
Run ln-316-review-refinement-worker. Refinement uses a 2-stage state machine:
Rules:
agent_runner.mjs (NOT host-native sub-agents)failure_class, progress_signals, and session_usable from agent_runner.mjs; classified transport failures pause/defer instead of becoming domain findingssync-agent; Claude hosts may use Monitor for observabilityStory mode:
| Metric | Before | After | Meaning |
|--------|--------|-------|---------|
| Penalty Points | from ln-312 | from ln-314 | 0 = all fixed |
| Readiness Score | clamp(1,10,10-floor(before/5)) | clamp(1,10,10-floor(after/5)) | Quality (1-10) |
| Anti-Hallucination | — | from ln-311 | VERIFIED/FLAGGED |
| AC Coverage | — | N/N | 100% = pass |
| Gate | — | GO/NO_GO | Final verdict |
GO = penalty_after=0 AND no FLAGGED items AND ac_coverage=100%NO_GO = otherwiseNO_GONO_GOGO: mutate Story status to Todo; update kanban_board.md to APPROVED.NO_GO.Plan mode:
Write coordinator summary:
node references/scripts/evaluation-runtime/cli.mjs record-summary \
--skill ln-310 \
--identifier {identifier} \
--payload '{...evaluation-coordinator summary...}'
Required checks:
Then:
node references/scripts/evaluation-runtime/cli.mjs complete --skill ln-310 --identifier {identifier}
Coordinator summary kind:
evaluation-coordinatorRecommended payload fields:
statusfinal_resultreport_pathworker_countagent_countissues_totalseverity_countswarningscleanup_verifiedresearch_completedpenalty_beforepenalty_afterreadiness_scoreac_coveragegate (GO/NO_GO)flagged_itemsevaluation-coordinator summary writtenOptional reference: load references/meta_analysis_protocol.md only when the user asks for post-run meta-analysis or protocol-formatted run reflection.
When requested after the coordinator run, analyze the session per protocol section 7 and include the protocol-formatted output with the final review result.
references/evaluation_coordinator_runtime_contract.md, references/evaluation_summary_contract.mdreferences/evaluation_research_contract.md, references/research_tool_fallback.md, references/plan_review_pipeline.mdreferences/evaluation_parallelism_policy.md../ln-311-review-research-worker/SKILL.md, ../ln-312-review-findings-worker/SKILL.md, ../ln-313-review-docs-worker/SKILL.md, ../ln-314-review-repair-worker/SKILL.md, ../ln-315-review-merge-worker/SKILL.md, ../ln-316-review-refinement-worker/SKILL.mdreferences/phase2_research_audit.md, references/penalty_points.mdreferences/cross_reference_validation.md, references/dependency_validation.md, references/domain_patterns.md, references/templates/mcp_ref_findings_template.md, references/premortem_validation.md, references/quality_validation.md, references/risk_validation.md, references/solution_validation.md, references/standards_validation.md, references/structural_validation.md, references/traceability_validation.md, references/workflow_validation.mdVersion: 8.0.0 Last Updated: 2026-03-22
testing
Audits architecture config boundaries: typed settings, scattered env reads, config leakage, and layer ownership. Use for config architecture.
tools
Finds architecture-level modernization opportunities: obsolete custom mechanisms, overbuilt extension points, and simplifiable architecture. Use when auditing architecture evolution.
development
Builds dependency topology, detects cycles, validates import rules, and calculates coupling metrics. Use when auditing architecture topology.
testing
Checks layer, resource ownership, and orchestration boundaries. Use when auditing architecture boundary enforcement.