skills/test-designer/SKILL.md
Design failing tests for complex features using Independent Evaluation — dispatches a context-free agent that sees only the requirement spec and code paths (not the implementation approach), then returns executable failing tests. Use when starting TDD for a non-trivial feature, when the requirement is ambiguous enough that biased tests are a risk, or when the user asks for independent test design.
npx skillsauth add pangcheng1849/g-claude-code-plugins test-designerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Independent test-design orchestrator. Encodes Independent Evaluation: the agent writing the tests must not be the agent implementing the feature, and must not inherit the implementation's assumptions.
/test-designerDon't use for:
The agent designing the tests must not carry the implementation's context. If you (the main Agent) are about to implement the feature, you are disqualified from designing its tests. Dispatch.
Violating this = tests that pass because they mirror the buggy implementation.
Collect only these inputs — nothing else:
Explicitly exclude:
| Task shape | Executor | Reason |
|---|---|---|
| Complex, architectural implications | Independent Agent (e.g., codex-agent or claude-code-agent with fresh session) | True zero-context isolation; can use strongest model at highest effort |
| Medium complexity, current conversation clean | In-conversation subagent | Cheaper; still acceptable if main Agent hasn't yet proposed an implementation |
| Trivial | Don't dispatch — write tests inline |
Default to Independent Agent when the main Agent has already discussed or sketched implementation. Subagent isolation within the same conversation doesn't undo prior context pollution.
Test design is a correctness-critical reasoning task, not a rote mechanical one. Use:
xhigh (the maximum level the runtime supports). Escalation ladder: low → medium → high → xhighExample dispatch prompt skeleton:
You are designing failing tests for a feature. You will NOT see or write the
implementation. Your job is to produce executable tests that fail today and
pass only when the feature is correctly implemented.
Requirement:
<paste requirement description + acceptance criteria>
Code paths (read-only, for understanding context):
<list of file paths>
Existing test framework and conventions:
<infer from repo or specify>
Produce:
1. A test plan — enumerate the behaviors being tested (happy path + edge
cases), grouped by category (boundary / concurrency / lifecycle /
invariants / adversarial).
2. Executable test files that fail against the current code (or against
an empty implementation).
3. For each test, one-line rationale explaining the bug it would catch.
Constraints:
- Do NOT propose an implementation.
- Do NOT edit files outside the test directory.
- Cover edge cases explicitly; don't only test the happy path.
- Use the project's existing test framework and style.
Before handing the tests to the implementation phase:
ImportError, missing fixture, syntax error, or "module not found" is fake red — the test isn't actually exercising the behavior it claims to. Fix the test or drop it.[1, 2, 3] for this fixture") is shaping the test to the example, not to the requirement. Such a test passes when the implementation matches the fixture and breaks for any valid variant input. Replace with property-style assertions ("output is sorted and contains all input elements") or add a second test with a different input that exercises the same property.With the validated failing tests in place, implementation proceeds per test-driven-development skill: write minimal code to make them pass (green), then regression.
Require the agent to return:
A test plan (bullet list, grouped by category) followed by the test files. Each test must include a one-line rationale comment. No implementation code. No commentary on how to implement. If assumptions about the code are needed, list them explicitly at the top of the test file.
ImportError, missing fixture, or "module not found" looks red but isn't testing anything. Step 4 must verify the test fails for the reason the rationale predictsbrainstorming → clarifies the requirement (upstream of test-designer)test-driven-development → governs the red-green-refactor loop (downstream; consumes the failing tests)systematic-debugging → kicks in if tests unexpectedly fail after implementation (downstream)verification-before-completion → runs the tests at the "done" gate (downstream)User: Starting work on the new plugin dependency resolver. Design tests first.
Assistant:
1. Requirement: "Resolver takes a plugin manifest and returns install order
respecting deps and detecting cycles. Must handle: transitive deps,
diamond deps, self-references, missing deps, cycles."
2. Code paths: src/plugins.ts, .claude/plugins.json schema, tests/ dir
3. Dispatch to an independent-agent skill (fresh session) at `xhigh` effort,
read-only on src/, writable on tests/
4. Agent returns: test plan (5 categories, 18 tests), tests/resolver.test.ts
with failing assertions + per-test rationale comments
5. Main Agent runs tests → all red → validates rationale → hands off
tools
Plan how to slice a non-trivial coding task across parallel subagents. Returns a dispatch plan (file assignments, dependencies, output-format contracts) — the main Agent then executes it with the Agent tool + `isolation: "worktree"`. Invoke only when work justifies multi-agent overhead: (a) greenfield 0→1 across multiple independent modules, (b) change touches ≥3 modules, or (c) ≥5 files each with >50 lines of diff. Small changes write inline.
development
在 macOS + Chrome 上排查公网 IPv4/IPv6 出口、国家/地区、ASN/组织、DNS、默认路由、utun 状态,以及浏览器侧 Server Response 与 WebRTC 暴露情况。适用于用户要求检查 IP、地区一致性、VPN/代理接管情况、IPv6 问题或浏览器网络暴露,并输出详细运维报告与复查链接。
tools
通过 Gemini CLI 将编码、审查、诊断、规划和结构化输出任务委派给独立的 Gemini 会话。使用场景包括 `gemini -p` 非交互执行、`gemini -r latest` 续接最近会话、`gemini -r "<session-id>"` 指定会话恢复,以及需要 `--output-format json` / `stream-json`、`--approval-mode plan` 只读审查、`--sandbox` 隔离执行,或 `--worktree` 在独立 git worktree 中跑任务的 scripted / CI 调用。
tools
通过 Codex CLI 将编码、审查、诊断、规划、结构化输出和本机浏览器调研任务委派给独立的 Codex 会话。使用场景包括 `codex exec` 新建任务、`codex exec resume` 续接多轮会话、`codex exec review` 做只读审查,以及需要 `--json` 事件流、`-o` 最终消息落盘、图片输入或 Computer Use 浏览器操作时。