skills/evidence-heavy-evaluator/SKILL.md
Generate an evidence-first, read-only repository evaluation report with deterministic scoring and actionable recommendations. Use when the user asks to assess readiness, maintainability, release-readiness, documentation gaps, or engineering health and wants auditable artifacts (`json` + `markdown` + raw command logs).
npx skillsauth add 0xsero/vllm-studio evidence-heavy-evaluatorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Run a deterministic repo evaluation and emit auditable artifacts in test-output.
target_dir: repo or subdirectory to evaluate.profile: readiness, maintainability, or release-readiness.depth: quick or deep.execute_checks: include to run lint/test/typecheck/build evidence.skills/evidence-heavy-evaluator/scripts/collect_evidence.sh \
--target-dir <target_dir> \
--profile <profile> \
--depth <depth> \
[--execute-checks]
<target_dir>/test-output/evidence-heavy-evaluator/:readiness-scorecard.jsonreadiness-report.mdchecks-summary.tsvmetrics.tsvsignals.tsv--execute-checks is omitted, call out that quality execution criteria are not evaluated.Use references/criteria-matrix.md as the source of truth for scoring criteria and profile weights.
render_report.py after evidence collection.uv is required because render_report.py is executed with uv run.development
Generate beautiful, self-contained HTML pages that visually explain systems, code changes, plans, and data. Use when the user asks for a diagram, architecture overview, diff review, plan review, project recap, comparison table, or any visual explanation of technical concepts. Also use proactively when you are about to render a complex ASCII table (4+ rows or 3+ columns) — present it as a styled HTML page instead.
testing
Create, edit, improve, or audit AgentSkills. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory (moving files to references/ or scripts/, removing stale content, validating against the AgentSkills spec). Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".
testing
Host security hardening and risk-tolerance configuration for OpenClaw deployments. Use when a user asks for security audits, firewall/SSH/update hardening, risk posture, exposure review, OpenClaw cron scheduling for periodic checks, or version status checks on a machine running OpenClaw (laptop, workstation, Pi, VPS).
testing
Create, edit, improve, or audit AgentSkills. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory (moving files to references/ or scripts/, removing stale content, validating against the AgentSkills spec). Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".