skills/devel/hack-through-testing/SKILL.md
Manual invocation only. Drive a crashy, hanging, or half-broken system forward along a real production user path using real data. Two subskills: `prepare` to analyze the target and set up `<htt-home>/` with infrastructure dirs (logs, runs, issues); optionally creates `<htt-home>/autotest/` with automatic scripts and interactive guides only when the developer explicitly requests test-case generation. `run` drives testing — with or without autotest artifacts — patching forward through blockers. Run subskill operates in-place by default (stash + test on current branch) or in a disposable snapshot worktree when explicitly requested. Supports automatic and interactive driving. Default when ambiguous: both subskills, in-place, automatic. Not for CI-oriented unit, smoke, or mock-based integration tests.
npx skillsauth add igamenovoer/magic-context hack-through-testingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Manual invocation only: use this skill only when the developer explicitly wants this workflow.
Drive a fragile system to the end by patching forward instead of solving each issue cleanly on first contact. Keep each workaround reviewable and finish with a synthesis that guides the real implementation.
If the developer wants testing without code changes, use test-and-log instead. If the developer wants a slow, stepwise session with approval before each action, use do-interactive-test instead.
Hack-through-testing targets production-level end-to-end paths: real data, real user workflows, real API calls, real output artifacts. It is not a CI smoke run, not a unit test harness, and not a mock-based integration check.
The distinction matters for choosing what to test:
If the only testable surface you can identify is CI-style (unit tests, smoke scripts, mock integrations), stop and ask the developer what the real production user path or end-to-end scenario is before proceeding. For example:
I can see unit/smoke/integration tests already covered by CI. What's the real production user path you want to exercise — the end-to-end scenario, the live data workflow, or a specific user journey?
Do not invent a CI-style test run and call it hack-through-testing.
This skill has exactly two subskills.
prepare
Analyze the target and set up <htt-home>/ with infrastructure dirs (logs, runs, issues). If the developer explicitly requests test-case generation (e.g., "prepare test cases", "prepare for auto test", "create autotest", "set up autotest"), also create <htt-home>/autotest/ with automatic scripts and interactive guides.
Primary guide: references/prepare.md
run
Drive testing, patching forward through blockers. Uses autotest artifacts when they exist; otherwise drives testing directly from the target analysis. Operates in-place (default) or in a disposable snapshot worktree.
Primary guide: references/run.md
prepare (infrastructure only, no autotest)prepare with autotest generationrun onlyprepare then runrun onlyUnless the developer says otherwise, use these defaults across both subskills:
htt-home): <repo-root>/.agent-automation/hacktest/<topic-slug> unless the developer explicitly sets htt-home=<dir><htt-home>/logs<htt-home>/runs<log-root>/<ts>.md<log-root>/issues/<ts>-<what>.md — one file per underlying issue; append later fixes to the same note rather than creating a new one<runs-root>/<run-ts>/ — always copy generated artifacts here so they survive session cleanupHT-01, HT-02, ...hack-through: <issue-id> <short workaround>If this skill creates .agent-automation/hacktest/, add it to .gitignore. If .gitignore already has commented .agent-automation/hacktest entries, do not auto-add the rule.
htt-branch (worktree mode) or mix them into stash snapshots (in-place mode).case-<id>.md) to wrappers that just say "run the automatic script"; they must be independent step-by-step procedures../scripts/create_snapshot_worktree.sh: Create a snapshot branch and separate worktree without touching the active checkout.Use $hack-through-testing on this CLI and keep patching forward until the happy path finishes.Use $hack-through-testing to prepare for the demo under scripts/demo/foo. (infrastructure only)Use $hack-through-testing to prepare autotest cases for the demo under scripts/demo/foo. (with autotest generation)Use $hack-through-testing in run mode with interactive driving — I want to watch each step.Use $hack-through-testing with a worktree so my checkout stays clean.Use $hack-through-testing to exercise the full build-then-launch sequence — build a brain, start a session, send a prompt, stop — and patch through every failure.Use $hack-through-testing, but pause if the only workaround would change the protocol or persistent data format.Use $hack-through-testing with htt-home=/tmp/my-htt-home so the whole session state is easy to revisit later.Use $hack-through-testing to prepare test cases, then run them automatically in a shadow repo, stop after 8 blockers.data-ai
Create readable Mermaid diagrams inside Markdown files. Use for flowcharts and sequence diagrams that must render cleanly in common Markdown renderers (e.g., GitHub) without horizontal scrolling. Covers fenced mermaid blocks, init/theme styling, label wrapping with <br/>, and sequenceDiagram layout rules (short IDs, wrapped labels, don’t break identifiers).
development
Manual invocation only; use only when the user explicitly requests `make-program-tutorial` by exact name, OR when the user asks to use a skill to create an SDK/API/library tutorial. Create a clear, reproducible, step-by-step tutorial for a specific API/SDK/library (or a set of functions/classes), with runnable examples, expected outputs, and basic troubleshooting.
testing
Use when the user wants to create a self-hosted, offline-installable Conda channel (mirror) containing a specific subset of packages using Pixi.
tools
Guides the agent to setup a new or existing Pixi environment for compiling C++ and CUDA code. It ensures the correct compilers, toolkits, and CMake configurations are in place for a robust user-space build.