agentharness-audit/SKILL.md
Audits any agent harness codebase against 13 meta-principles of quality, scores each 0 to 10 with code citations, generates a self-contained HTML report for operators, and writes an agent-readable AGENTS.md improvement directive with file-level architecture recommendations to reach 9.5 or better. Use when the user wants to audit or rate an agent harness, evaluate harness quality, score an agent framework, get a roadmap for improving a harness, run agentharness-audit, audit this harness, rate Hermes, audit Codex, or audit a Claude Code harness. Not for individual skill audits or general code review.
npx skillsauth add ingpoc/skills agentharness-auditInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Self-validate after edits. Any change to this skill's files must be followed by
./scripts/validate.shfrom the skill directory.
Audits an agent harness codebase against 13 evidence-based meta-principles derived from comparative analysis of Hermes, Claude Code, Codex CLI, and OpenClaw. Each principle is scored 0–10 with code citations. The operator receives a print-ready Tufte HTML report; the harness itself receives a concrete self-improvement directive in AGENTS.md format.
Full step-by-step: references/procedure.md
templates/audit.html.template to docs/agentharness-audit/<name>-audit.html and fill placeholders using references/html-template.md. Do not regenerate the report from scratch or require loading another skill.docs/agentharness-audit/<name>-AGENTS.md (references/agents-template.md)If no path is specified, audit the current working directory.
html-artifact is provenance only; do not depend on loading it during an audit run.Agent harnesses are evaluated on marketing claims, not the mechanisms that determine token efficiency, self-improvement, and operator accessibility. This skill makes those mechanisms legible and scoreable — enabling evidence-based harness selection and a concrete improvement roadmap that the harness agent can execute autonomously.
devops
Deploy applications and infrastructure to Cloudflare using Workers, Pages, and related platform services. Use when the user asks to deploy, host, publish, or set up a project on Cloudflare.
tools
Design or review CLIs so both coding agents and humans can use them reliably: dual-audience output, non-interactive paths, layered help, machine-readable data, predictable flags, safe mutations, and actionable errors. Use when building a CLI, adding commands, writing --help, or when the user mentions agents, terminals, automation-friendly CLIs, JSON output, or headless usage.
tools
Build a composable CLI for Codex from API docs, an OpenAPI spec, existing curl examples, an SDK, a web app, an admin tool, or a local script. Use when the user wants Codex to create a command-line tool that can run from any repo, expose composable read/write commands, return stable JSON, manage auth, and pair with a companion skill.
development
Use when configuring apps to use z.ai GLM 4.7 proxy instead of standard Anthropic API. Works with any frontend (Next.js, Vite) or backend (Python, Node.js) framework.