active/grok/SKILL.md
Trace how something works with an investigator subagent and a skeptical reviewer subagent.
npx skillsauth add kevinslin/skills grokInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill when the user asks to deeply understand why a system behaves a certain way, trace an end-to-end flow, explain a confusing interaction, or prove which code path is responsible.
The pattern is two-agent adversarial investigation:
If subagents are unavailable, run the investigator and reviewer roles sequentially yourself and state that limitation.
The investigator's job is to prove how the system works, not to produce a plausible explanation.
Prompt the investigator with:
You are the investigator.
Question:
<exact user question>
Mission:
Trace the behavior end to end. Start breadth-first across docs, code, tests, config, logs, and prior memory. Then go depth-first on the most promising leads until you can explain the exact mechanism.
Authority:
- You may add temporary debug statements when useful unless explicitly told not to.
- You may run commands, tests, local repros, and integration checks.
- You may use available UI/browser/computer-use tools when real product behavior must be observed.
- You may use relevant integration skills or harnesses when live behavior matters.
Requirements:
- Cite concrete files, functions, docs, commands, logs, screenshots, or runtime observations.
- Separate facts from hypotheses.
- Explain alternative hypotheses you ruled out.
- Report any temporary code changes, debug logs, or test artifacts you created.
- Do not hide gaps. If evidence is missing, say exactly what is missing.
Deliverable:
1. Short answer.
2. End-to-end flow.
3. Evidence table.
4. Ruled-out hypotheses.
5. Remaining uncertainty.
When code changes are allowed only for debugging, tell the investigator to mark them clearly and either revert them before finishing or report exact files for cleanup.
The reviewer does not investigate from scratch unless the investigator's evidence is insufficient. Its job is to attack the answer.
Prompt the reviewer with:
You are the reviewer.
Question:
<exact user question>
Investigator answer:
<paste or summarize investigator result>
Mission:
Judge whether the answer really proves the mechanism. Use the 5 Whys framework and assume the answer is invalid unless the evidence proves it beyond reasonable doubt.
Review stance:
- Be skeptical.
- Do not accept names, comments, or plausible architecture as proof.
- Require code-path, runtime, log, test, or documentation evidence for every causal claim.
- Look for omitted branches, fallback paths, race conditions, config gates, and mismatches between synthetic and real flows.
- Reject answers that explain a nearby mechanism but not the user's exact question.
Deliverable:
1. Decision: sufficient or insufficient.
2. The 5 Whys chain.
3. Missing evidence or contradictions.
4. Required follow-up questions for the investigator, if any.
5. A corrected answer if the evidence supports one.
Do not stop after a generic architecture summary. The loop is complete only when the answer explains the observed behavior, identifies the exact producer/consumer boundaries, and accounts for why similar-looking flows do or do not behave the same way.
Lead with the answer, then give the proof trail:
Short answer: <one paragraph>
What happens end to end:
1. <step>
2. <step>
3. <step>
Why alternatives are wrong:
- <ruled-out hypothesis>: <evidence>
Evidence:
- <file/function/log/test/screenshot>
Reviewer status: sufficient | insufficient because <gap>
Keep the final concise, but include enough concrete references that the user can audit the claim.
When the run exposes a reusable agent mistake, missing workflow, or repeated friction, route it through ../ag-learn/SKILL.md after the answer lands. Save a durable learning only when it should change a skill or future workflow.
development
Generate incremental Slack digests for channels, topics, and categories.
testing
Audit an OpenClaw maturity-scorecard surface into an evidence-backed component score report. Use when given a surface from an OpenClaw maturity-scorecard.md and asked to score coverage, quality, readiness, or generate a detailed surface report plus per-component subreports.
tools
Turn an existing concrete spec into a reusable generic spec template. Use when asked to create a generic spec, template spec, reusable implementation template, or generalized version of a spec from a specific implementation such as one plugin, channel, integration, feature, or PR.
development
Write or review high-quality OpenClaw developer documentation.