skills/mb-harness/SKILL.md
Set up deterministic commands, worktrees, and quality gates so agents can run safely in this repository.
npx skillsauth add mrvladd-d/memobank mb-harnessInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Turn the repo into a reliable “harness” for agents:
If you use Codex:
.codex/ folder..codex/config.toml from assets/codex-config.toml.Usage examples:
codexcodex --profile deep-reviewIn AGENTS.md (keep it short), list the canonical commands (examples):
If the repo has UI or browser flows, explicitly document:
If the repo lacks them, add minimal scripts/Make targets.
If multiple agents work in parallel:
Example:
git worktree add ../wt-agent-1 -b agent-1
If not already present, install the packaged Memory Bank gate assets:
scripts/mb-lint.mjs from mb-garden/assets/mb-lint.mjsscripts/mb-doctor.mjs from mb-garden/assets/mb-doctor.mjsThe mb-garden/assets path is the current packaged asset location, not conceptual ownership of the doctor role. mb-lint covers structural/mechanical hygiene. mb-doctor covers workflow/autonomous readiness over mb-lint. Run default mb-doctor for ordinary pre-queue health checks. Run strict mode only after the JSON task queue exists: after /prd-to-tasks, before scheduler execution inside /autonomous, or before /autopilot when the queue is already prepared:
node scripts/mb-lint.mjs
node scripts/mb-doctor.mjs --strict
The harness should treat missing task.tier as a blocking policy error. Task state is JSON-only through .memory-bank/tasks/index.json and indexed TASK-*.task.json records; backlog.md, markdown task cards, and old risk / risk.level routing are unsupported.
If the product has a UI:
.tasks/TASK-XXX/.memory-bank/testing/index.mdIf you iterate on skills heavily:
codex exec --json runs + deterministic graders (see OpenAI evals guidance).codex/config.toml exists (if using Codex) with coding + review profiles.testing
Verify one TASK-* against acceptance criteria and record reproducible evidence.
testing
Review a Memory Bank with fresh-context specialists and produce a prioritized fix list.
testing
Adversarial semantic verification for one TASK-* so teams can catch solutions that pass process checks but are still wrong in substance.
development
Map an existing codebase into an as-is Memory Bank without inventing roadmap items.