/audit-undocumented - Undocumented Decisions Audit

Discover design decisions that exist in the code but have no ADR. Each candidate is challenged by critic-design (Step 6.2) before promotion. The final list is what survived the adversarial pass, not what the initial scan found. Produce a ranked list of ADR promotion candidates so the next refactor has a complete decision baseline.

When to Use

Repo has accumulated implicit decisions that are not in ADRs
Before refactoring, you want to know which decisions to preserve vs question
New maintainers need a map of "why is the code shaped this way"

This skill is complementary to /audit-adr-drift. Run drift scan first if ADRs exist; the gaps it cannot answer become the input here.

For cross-repo scope: cluster detection on ADRs already tagged with the scope frontmatter (YAML or - Scope: bullet), invoke scripts/audit-adr-scopes.py directly. That sibling script aggregates ADRs from ~/.claude and ~/GitHub/{cli,apps,plugins}/*/docs/decisions/ and surfaces same-scope title clusters (horizontal-expansion candidates).

Input

$ARGUMENTS holds the full argument string. Parse before use:

Split on whitespace into tokens
--threshold=N: integer, default 400 if unset or non-numeric (must be > 0)
--paths=p1,p2,...: comma-separated paths to documents. If unset, auto-detect via Step 2 table
--no-challenge: skip the critic-design challenge step (default: challenge runs)
Reject unknown flags with an explicit error rather than silently ignore
All flags optional; with no input, auto-detect everything and run challenge

Execution

| Step | Action | | ---- | ------------------------------------------------------------------------------------------ | | 1 | Detect large files exceeding threshold (default >400 lines) | | 2 | Detect prose documents likely to contain decisions (README, CONTRIBUTING, etc.) | | 3 | Run language-appropriate reviewer on each large file; cross-reference with existing ADRs | | 4 | Extract decision-shaped sentences from prose documents; cross-reference with existing ADRs | | 5 | Tag each candidate with impact (H/M/L) and reversibility (high/medium/low) | | 6 | Rank initial candidates; spawn critic-design to challenge them (skip if --no-challenge) | | 7 | Write report to docs/audit/<YYYY-MM-DD>-undocumented-decisions.md | | 8 | List post-challenge ADR promotion candidates as follow-up issues |

Step 1: Large File Detection

# Find source files exceeding the threshold
# NOTE: Do not use awk '$1 > t' here. SKILL loader expands `$1` to the second
# whitespace-token of $ARGUMENTS (0-indexed split). Use find+while instead.
bfs <repo-root> -type f \( -name '*.rs' -o -name '*.ts' -o -name '*.tsx' -o -name '*.py' -o -name '*.go' -o -name '*.swift' \) \
  -not -path '*/target/*' -not -path '*/node_modules/*' -not -path '*/.git/*' \
  -print0 \
  | while IFS= read -r -d '' file; do
      lines=$(wc -l < "$file")
      [ "$lines" -gt "$THRESHOLD" ] && printf '%s %s\n' "$lines" "$file"
    done | sort -rn

Default threshold is 400 lines (one-screen ceiling, see rules/development/THRESHOLDS.md).

Step 2: Document Detection

Scan top-level and docs/ for files matching:

| Filename pattern | Likely content | | -------------------------------------------------------------------- | ------------------------------------ | | README.md | Design intent, naming, prohibitions | | CONTRIBUTING.md | Code style decisions, workflow rules | | SECURITY.md | Threat boundaries, security policies | | THREAT_MODEL.md | Trust boundaries, mitigations | | ARCHITECTURE.md | Module decomposition, layer policy | | DESIGN.md / *.design.md | Component rationale | | CLAUDE.md / AGENTS.md | AI-agent operating decisions | | Makefile / justfile | Build flow decisions | | Linter config (Cargo.toml [lints.*], .eslintrc, oxlint.json) | Rule selection rationale |

Step 3: Large File Decision Mining

For files exceeding 800 lines (2x threshold), run a triage sub-step first: spawn the reviewer with a "scan first 200 lines and estimate finding density (findings per 100 lines)" instruction, then AskUserQuestion with three options:

Continue full scan
Truncate to first 600 lines (covers module preamble and main types)
Skip this file (record as skipped_for_size)

For files at or below 800 lines, proceed directly to full scan.

For each large file (full or truncated scope), spawn the matching reviewer via Task (same routing as /audit-adr-drift Step 5). The reviewer answers:

Why is this file shaped at this granularity?
What invariant or contract does it carry that a reader cannot derive from the code?
Is there a comment or module-doc that already records the rationale?
Does the comment describe the current state but omit the rule for future contributors? (this is the incomplete-contract pattern)

Findings format: file:line + decision summary + evidence (code comment, naming, module-doc) + documented? (Yes/Partial/No) + incomplete-contract? (Yes/No).

The incomplete-contract flag captures findings where the code has a comment describing what is true, but not what must remain true. Example: an SSRF-safe HTTP client field is annotated "redirect disabled for SSRF" but no rule says "future commands handling user URLs MUST use this client." This pattern is common with security invariants and design rationale that depend on the reader inferring "and this should stay this way." Findings with incomplete-contract=Yes are strong ADR candidates regardless of documented? value, because the missing rule is what an ADR uniquely provides.

After collecting findings, cross-reference each one against the ADR directory (if any). Drop findings already covered by an ADR (record count as "ADR-covered (excluded)" in the summary).

Step 4: Prose Decision Extraction

For each detected document, find sentences containing decision verbs and check for ADR coverage:

Decision verbs (English): "decide", "choose", "adopt", "reject", "deprecate", "must not", "never", "always"
Decision verbs (Japanese): "決定", "採用", "禁止", "方針", "選定", "排除", "従う", "規約"

Each match is a candidate. Then for each candidate, search the ADR directory (if any) for cross-reference. Drop candidates already covered by an ADR.

External ADR Dependency Detection

In addition to prose docs, scan source code for per ADR-NNNN / see ADR-NNNN / ADR-NNNN reference patterns:

ugrep -r -n -E "(per |see |governed by )?ADR-[0-9]{4}" --include="*.rs" --include="*.ts" --include="*.tsx" --include="*.py" --include="*.go" --include="*.md" --include="*.toml" .

For each captured ADR id, check whether the matching NNNN-*.md exists in the local ADR directory:

Local match → existing ADR coverage, skip
No local match → record as External ADR Dependency candidate

External ADR dependencies are flagged for promotion (write a scout-local ADR that supersedes or imports the external decision). This is impact=H by default because cross-repo ADR drift is silent and hard to detect after the fact.

Step 5: Impact + Reversibility Tagging

| Impact | Criteria | | ------ | --------------------------------------------------------------------- | | H | Crosses module boundary, affects public API, or governs 2+ subsystems | | M | Affects single module's internal contract | | L | Local style or naming choice |

| Reversibility | Criteria | | ------------- | ---------------------------------------------------------------- | | high | Decision can be reversed by editing one location | | medium | Reversal requires coordinated changes across 2-5 files | | low | Reversal requires migration, deprecation cycle, or schema change |

Step 6: Ranking and Challenge

6.1 Initial Ranking

ADR promotion candidates = (impact=H) AND (reversibility=low OR medium).

Findings with incomplete-contract=Yes are also promoted regardless of documented? value, because the missing rule for future contributors is precisely what an ADR records (and a comment, by itself, cannot).

Other findings are recorded but not promoted (informational).

6.2 Devil's Advocate Challenge

Skip this sub-step if --no-challenge is set.

Spawn critic-design via Task with the initial promotion candidate list. The agent challenges each candidate with:

Does a future contributor actually benefit from the rule? Who is the reader?
Can a code comment + test suffice instead of an ADR?
Does ADR risk lock-in (over-documenting decisions that should evolve)?
For statement-of-fact configs (deny.toml, Cargo.toml lints): is the config file itself already the source of truth, making an ADR redundant?
For monolithic-boundary candidates: would the ADR justify the status quo and reduce pressure to split?
Is there already an enforcement mechanism (type system, lint, test) that makes the rule mechanical, leaving the ADR with nothing to add?
Bug vs Invariant: is this candidate describing a fix-the-bug case (current code is wrong and should change) or an invariant-to-document case (current code is intentional and should be preserved)? Bugs must be surfaced as bug-fix follow-ups, not ADRs — documenting wrong behavior as intentional locks in the bug.

ADR worth heuristic (empirically derived from scout試運転 2026-05-13): existing enforcement mechanisms (lint config, type system, automated tests) are stronger than ADR text for mechanical decisions. Reserve ADR for two categories where mechanisms cannot help:

Invariants not enforceable by tools (e.g., "field X must not be used with Y" when both are same type)
Public API compatibility commitments (e.g., exit code convention, JSON output schema)

Statement-of-fact configs (deny.toml, Cargo.toml [lints.*]) are themselves the source of truth — duplicating into an ADR creates drift risk. A 1-2 line policy comment in the config block usually suffices.

For each candidate, critic-design returns one of:

| Verdict | Meaning | | ----------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | keep | ADR worth, file as standalone or merge with related candidates | | downgrade | Not standalone ADR; absorb into a related ADR section or strengthen comments | | drop | Not ADR-worthy; config/comment/test already covers it, cost > value, or the candidate is a bug (surface as bug-fix follow-up, do not document the wrong behavior) |

Record the challenge verdict alongside the initial ranking. The final candidate list = initial candidates marked keep + those marked downgrade (with target ADR), minus those marked drop.

When agent is unavailable or times out, fall back to initial ranking with a challenge_skipped: timeout note in the summary.

Step 7: Report Output

Compute the date with date -u +%Y-%m-%d (UTC, ISO format) for cross-timezone consistency. Ensure the output directory exists:

mkdir -p docs/audit
DATE=$(date -u +%Y-%m-%d)
REPORT="docs/audit/${DATE}-undocumented-decisions.md"

Write the report with:

# Undocumented Decisions Audit: <YYYY-MM-DD>

## Summary

| Metric                   | Value |
| ------------------------ | ----- |
| Large files scanned      | N     |
| Documents scanned        | N     |
| Decision candidates      | N     |
| ADR-covered (excluded)   | N     |
| Net new candidates       | N     |
| ADR promotion candidates | N     |

## Large File Decisions

### src/foo.rs (NNN lines)

| # | Line | Decision | Documented? | Incomplete-contract? | Impact | Reversibility |
| - | ---- | -------- | ----------- | -------------------- | ------ | ------------- |
| 1 | 42   | ...      | Partial     | Yes                  | H      | low           |

## Prose Document Decisions

### README.md

| # | Line | Decision Verb | Decision | ADR Coverage |
| - | ---- | ------------- | -------- | ------------ |
| 1 | 12   | must not      | ...      | None         |

## ADR Promotion Candidates (post-challenge)

| # | Candidate                       | Initial | Challenge | Final          |
| - | ------------------------------- | ------- | --------- | -------------- |
| 1 | `<source>:<line>` — `<summary>` | promote | keep      | ADR            |
| 2 | `<source>:<line>` — `<summary>` | promote | downgrade | inline-comment |
| 3 | `<source>:<line>` — `<summary>` | promote | drop      | skip           |

Summary line per file: `keep N / downgrade N / drop N`.

If `--no-challenge` was set, omit Challenge and Final columns and use the initial ranking directly.

Step 8: Follow-up Hand-off

Print only the post-challenge keep candidates and offer to invoke /adr for each, or aggregate them into a single tracking issue via /issue. downgrade candidates are listed as comment-strengthening tasks (not ADRs). drop candidates are recorded in the report for traceability but not surfaced as follow-up.

Output

Report path: docs/audit/<YYYY-MM-DD>-undocumented-decisions.md
Console summary: candidate count, promotion candidate count
Optional: ADR drafting via /adr or tracking issue via /issue

Out of Scope

Drafting the ADRs themselves (use /adr after this skill produces the candidate list)
Implementing code changes
Updating README/CONTRIBUTING/etc. bodies (only extraction here)
Drift scan against existing ADRs (use /audit-adr-drift)
Cross-repo scope cluster detection (use scripts/audit-adr-scopes.py directly; this skill is single-repo by design)

Acceptance Criteria

[ ] Report file exists at docs/audit/<YYYY-MM-DD>-undocumented-decisions.md
[ ] Every large file exceeding threshold has a section
[ ] Every scanned document has an extraction section (or "no decisions found")
[ ] Every candidate has impact + reversibility tags
[ ] ADR promotion candidates are listed at the end with a one-line rationale

/audit-undocumented - Undocumented Decisions Audit

When to Use

Repo has accumulated implicit decisions that are not in ADRs
Before refactoring, you want to know which decisions to preserve vs question
New maintainers need a map of "why is the code shaped this way"

This skill is complementary to /audit-adr-drift. Run drift scan first if ADRs exist; the gaps it cannot answer become the input here.

Input

$ARGUMENTS holds the full argument string. Parse before use:

Split on whitespace into tokens
--threshold=N: integer, default 400 if unset or non-numeric (must be > 0)
--paths=p1,p2,...: comma-separated paths to documents. If unset, auto-detect via Step 2 table
--no-challenge: skip the critic-design challenge step (default: challenge runs)
Reject unknown flags with an explicit error rather than silently ignore
All flags optional; with no input, auto-detect everything and run challenge

Execution

Step 1: Large File Detection

# Find source files exceeding the threshold
# NOTE: Do not use awk '$1 > t' here. SKILL loader expands `$1` to the second
# whitespace-token of $ARGUMENTS (0-indexed split). Use find+while instead.
bfs <repo-root> -type f \( -name '*.rs' -o -name '*.ts' -o -name '*.tsx' -o -name '*.py' -o -name '*.go' -o -name '*.swift' \) \
  -not -path '*/target/*' -not -path '*/node_modules/*' -not -path '*/.git/*' \
  -print0 \
  | while IFS= read -r -d '' file; do
      lines=$(wc -l < "$file")
      [ "$lines" -gt "$THRESHOLD" ] && printf '%s %s\n' "$lines" "$file"
    done | sort -rn

Default threshold is 400 lines (one-screen ceiling, see rules/development/THRESHOLDS.md).

Step 2: Document Detection

Scan top-level and docs/ for files matching:

Step 3: Large File Decision Mining

Continue full scan
Truncate to first 600 lines (covers module preamble and main types)
Skip this file (record as skipped_for_size)

For files at or below 800 lines, proceed directly to full scan.

For each large file (full or truncated scope), spawn the matching reviewer via Task (same routing as /audit-adr-drift Step 5). The reviewer answers:

Why is this file shaped at this granularity?
What invariant or contract does it carry that a reader cannot derive from the code?
Is there a comment or module-doc that already records the rationale?
Does the comment describe the current state but omit the rule for future contributors? (this is the incomplete-contract pattern)

Findings format: file:line + decision summary + evidence (code comment, naming, module-doc) + documented? (Yes/Partial/No) + incomplete-contract? (Yes/No).

After collecting findings, cross-reference each one against the ADR directory (if any). Drop findings already covered by an ADR (record count as "ADR-covered (excluded)" in the summary).

Step 4: Prose Decision Extraction

For each detected document, find sentences containing decision verbs and check for ADR coverage:

Decision verbs (English): "decide", "choose", "adopt", "reject", "deprecate", "must not", "never", "always"
Decision verbs (Japanese): "決定", "採用", "禁止", "方針", "選定", "排除", "従う", "規約"

Each match is a candidate. Then for each candidate, search the ADR directory (if any) for cross-reference. Drop candidates already covered by an ADR.

External ADR Dependency Detection

In addition to prose docs, scan source code for per ADR-NNNN / see ADR-NNNN / ADR-NNNN reference patterns:

ugrep -r -n -E "(per |see |governed by )?ADR-[0-9]{4}" --include="*.rs" --include="*.ts" --include="*.tsx" --include="*.py" --include="*.go" --include="*.md" --include="*.toml" .

For each captured ADR id, check whether the matching NNNN-*.md exists in the local ADR directory:

Local match → existing ADR coverage, skip
No local match → record as External ADR Dependency candidate

Step 5: Impact + Reversibility Tagging

Step 6: Ranking and Challenge

6.1 Initial Ranking

ADR promotion candidates = (impact=H) AND (reversibility=low OR medium).

Other findings are recorded but not promoted (informational).

6.2 Devil's Advocate Challenge

Skip this sub-step if --no-challenge is set.

Spawn critic-design via Task with the initial promotion candidate list. The agent challenges each candidate with:

Does a future contributor actually benefit from the rule? Who is the reader?
Can a code comment + test suffice instead of an ADR?
Does ADR risk lock-in (over-documenting decisions that should evolve)?
For statement-of-fact configs (deny.toml, Cargo.toml lints): is the config file itself already the source of truth, making an ADR redundant?
For monolithic-boundary candidates: would the ADR justify the status quo and reduce pressure to split?
Is there already an enforcement mechanism (type system, lint, test) that makes the rule mechanical, leaving the ADR with nothing to add?
Bug vs Invariant: is this candidate describing a fix-the-bug case (current code is wrong and should change) or an invariant-to-document case (current code is intentional and should be preserved)? Bugs must be surfaced as bug-fix follow-ups, not ADRs — documenting wrong behavior as intentional locks in the bug.

Invariants not enforceable by tools (e.g., "field X must not be used with Y" when both are same type)
Public API compatibility commitments (e.g., exit code convention, JSON output schema)

For each candidate, critic-design returns one of:

Record the challenge verdict alongside the initial ranking. The final candidate list = initial candidates marked keep + those marked downgrade (with target ADR), minus those marked drop.

When agent is unavailable or times out, fall back to initial ranking with a challenge_skipped: timeout note in the summary.

Step 7: Report Output

Compute the date with date -u +%Y-%m-%d (UTC, ISO format) for cross-timezone consistency. Ensure the output directory exists:

mkdir -p docs/audit
DATE=$(date -u +%Y-%m-%d)
REPORT="docs/audit/${DATE}-undocumented-decisions.md"

Write the report with:

# Undocumented Decisions Audit: <YYYY-MM-DD>

## Summary

| Metric                   | Value |
| ------------------------ | ----- |
| Large files scanned      | N     |
| Documents scanned        | N     |
| Decision candidates      | N     |
| ADR-covered (excluded)   | N     |
| Net new candidates       | N     |
| ADR promotion candidates | N     |

## Large File Decisions

### src/foo.rs (NNN lines)

| # | Line | Decision | Documented? | Incomplete-contract? | Impact | Reversibility |
| - | ---- | -------- | ----------- | -------------------- | ------ | ------------- |
| 1 | 42   | ...      | Partial     | Yes                  | H      | low           |

## Prose Document Decisions

### README.md

| # | Line | Decision Verb | Decision | ADR Coverage |
| - | ---- | ------------- | -------- | ------------ |
| 1 | 12   | must not      | ...      | None         |

## ADR Promotion Candidates (post-challenge)

| # | Candidate                       | Initial | Challenge | Final          |
| - | ------------------------------- | ------- | --------- | -------------- |
| 1 | `<source>:<line>` — `<summary>` | promote | keep      | ADR            |
| 2 | `<source>:<line>` — `<summary>` | promote | downgrade | inline-comment |
| 3 | `<source>:<line>` — `<summary>` | promote | drop      | skip           |

Summary line per file: `keep N / downgrade N / drop N`.

If `--no-challenge` was set, omit Challenge and Final columns and use the initial ranking directly.

Step 8: Follow-up Hand-off

Output

Report path: docs/audit/<YYYY-MM-DD>-undocumented-decisions.md
Console summary: candidate count, promotion candidate count
Optional: ADR drafting via /adr or tracking issue via /issue

Out of Scope

Drafting the ADRs themselves (use /adr after this skill produces the candidate list)
Implementing code changes
Updating README/CONTRIBUTING/etc. bodies (only extraction here)
Drift scan against existing ADRs (use /audit-adr-drift)
Cross-repo scope cluster detection (use scripts/audit-adr-scopes.py directly; this skill is single-repo by design)

Acceptance Criteria

[ ] Report file exists at docs/audit/<YYYY-MM-DD>-undocumented-decisions.md
[ ] Every large file exceeding threshold has a section
[ ] Every scanned document has an extraction section (or "no decisions found")
[ ] Every candidate has impact + reversibility tags
[ ] ADR promotion candidates are listed at the end with a one-line rationale

Adoption

thkt/audit-undocumented

$ install --global

Security Scan Results

SKILL.md

/audit-undocumented - Undocumented Decisions Audit

When to Use

Input

Execution

Step 1: Large File Detection

Step 2: Document Detection

Step 3: Large File Decision Mining

Step 4: Prose Decision Extraction

External ADR Dependency Detection

Step 5: Impact + Reversibility Tagging

Step 6: Ranking and Challenge

6.1 Initial Ranking

6.2 Devil's Advocate Challenge

Step 7: Report Output

Step 8: Follow-up Hand-off

Output

Out of Scope

Acceptance Criteria

Related Skills

thkt/use-cli-herdr

thkt/scribe

thkt/dr

thkt/shake

thkt/audit-undocumented

$ install --global

Security Scan Results

SKILL.md

/audit-undocumented - Undocumented Decisions Audit

When to Use

Input

Execution

Step 1: Large File Detection

Step 2: Document Detection

Step 3: Large File Decision Mining

Step 4: Prose Decision Extraction

External ADR Dependency Detection

Step 5: Impact + Reversibility Tagging

Step 6: Ranking and Challenge

6.1 Initial Ranking

6.2 Devil's Advocate Challenge

Step 7: Report Output

Step 8: Follow-up Hand-off

Output

Out of Scope

Acceptance Criteria

Related Skills

thkt/use-cli-herdr

thkt/scribe

thkt/dr

thkt/shake