opencode/skills/systematic-debugging/SKILL.md
Four-phase debugging framework with root cause tracing - understand the source before proposing fixes. Use when investigating bugs, errors, unexpected behavior, or failed tests.
npx skillsauth add third774/dotfiles systematic-debuggingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Random fixes waste time and create new bugs. Quick patches mask underlying issues.
Core principle: ALWAYS find root cause before attempting fixes. Symptom fixes are failure.
Violating the letter of this process is violating the spirit of debugging.
NO FIXES WITHOUT ROOT CAUSE INVESTIGATION FIRST
If you haven't completed Phase 1, you cannot propose fixes.
Use for ANY technical issue:
Use this ESPECIALLY when:
Don't skip when:
You MUST complete each phase before proceeding to the next.
Copy this checklist and track your progress:
Debugging Progress:
- [ ] Phase 1: Root Cause Investigation
- [ ] Read error messages carefully
- [ ] Reproduce consistently
- [ ] Check recent changes
- [ ] Gather evidence at component boundaries
- [ ] Trace data flow backward to source
- [ ] Phase 2: Pattern Analysis
- [ ] Find working examples
- [ ] Compare against references
- [ ] Identify differences
- [ ] Phase 3: Hypothesis and Testing
- [ ] Form single hypothesis
- [ ] Test minimally (one change)
- [ ] Verify before continuing
- [ ] Phase 4: Implementation
- [ ] Create failing test case
- [ ] Implement single fix at root cause
- [ ] Apply defense-in-depth
- [ ] Remove all // debug-shim markers
- [ ] Verify fix and tests pass
BEFORE attempting ANY fix:
WHEN system has multiple components (CI → build → signing, API → service → database):
For log-heavy investigations: When errors appear in application logs, use the reading-logs skill for efficient analysis. Never load entire log files into context - use targeted grep and filtering.
BEFORE proposing fixes, add diagnostic instrumentation:
For EACH component boundary:
- Log what data enters component
- Log what data exits component
- Verify environment/config propagation
- Check state at each layer
Run once to gather evidence showing WHERE it breaks
THEN analyze evidence to identify failing component
THEN investigate that specific component
Example (multi-layer system):
# Layer 1: Workflow
echo "=== Secrets available in workflow: ==="
echo "IDENTITY: ${IDENTITY:+SET}${IDENTITY:-UNSET}"
# Layer 2: Build script
echo "=== Env vars in build script: ==="
env | grep IDENTITY || echo "IDENTITY not in environment"
# Layer 3: Signing script
echo "=== Keychain state: ==="
security list-keychains
security find-identity -v
# Layer 4: Actual signing
codesign --sign "$IDENTITY" --verbose=4 "$APP"
This reveals: Which layer fails (secrets → workflow ✓, workflow → build ✗)
WHEN error is deep in call stack or unclear where invalid data originated:
Don't fix symptoms. Trace backward through the call chain to find the original trigger, then fix at the source.
Use Five Whys + Backward Tracing:
Symptom: git init creates .git in source code directory
Why? → cwd parameter is empty string, defaults to process.cwd()
Why? → projectDir variable passed to git init is ''
Why? → Session.create() received empty tempDir
Why? → Test accessed context.tempDir before beforeEach initialized it
Why? → setupCoreTest() returns object with tempDir: '' initially
Root Cause: Top-level variable initialization accessing uninitialized value
Trace the Call Chain backward:
execFileAsync('git', ['init'], { cwd: projectDir }) // Symptom
← WorktreeManager.createSessionWorktree(projectDir, sessionId)
← Session.initializeWorkspace()
← Session.create(tempDir)
← Test: Project.create('name', context.tempDir) // Root trigger
Adding Instrumentation when call chain is unclear:
async function gitInit(directory: string) {
// debug-shim
const stack = new Error().stack;
console.error("DEBUG:", { directory, cwd: process.cwd(), stack });
// end debug-shim
await execFileAsync("git", ["init"], { cwd: directory });
}
Key points:
console.error() in tests (logger may be suppressed)ALL temporary debug code MUST include the // debug-shim marker:
console.error("DEBUG:", { value, context }); // debug-shim
This enables reliable cleanup via grep. Before completing Phase 4:
grep -r "debug-shim" .For language-specific variants (Python, Bash, JSX), see references/debugging-techniques.md#debug-shim-markers.
Verify the Root Cause:
When executing the four phases, use these techniques to gather evidence:
Find the pattern before fixing:
Find Working Examples
Compare Against References
Identify Differences
Understand Dependencies
Scientific method:
Form Single Hypothesis
Test Minimally
Verify Before Continuing
When You Don't Know
Fix the root cause, not the symptom:
Don't just fix the root cause - add validation at each layer:
Result: Bug impossible to reintroduce, even with future code changes.
Pattern indicating architectural problem:
STOP and question fundamentals:
Discuss with your human partner before attempting more fixes
This is NOT a failed hypothesis - this is a wrong architecture.
If you catch yourself thinking:
ALL of these mean: STOP. Return to Phase 1.
If 3+ fixes failed: Question the architecture (see Phase 4.6)
Watch for these redirections:
When you see these: STOP. Return to Phase 1.
| Excuse | Reality | |--------|---------| | "Issue is simple, don't need process" | Simple issues have root causes too. Process is fast for simple bugs. | | "Emergency, no time for process" | Systematic debugging is FASTER than guess-and-check thrashing. | | "Just try this first, then investigate" | First fix sets the pattern. Do it right from the start. | | "I'll write test after confirming fix works" | Untested fixes don't stick. Test first proves it. | | "Multiple fixes at once saves time" | Can't isolate what worked. Causes new bugs. | | "Reference too long, I'll adapt the pattern" | Partial understanding guarantees bugs. Read it completely. | | "I see the problem, let me fix it" | Seeing symptoms ≠ understanding root cause. | | "One more fix attempt" (after 2+ failures) | 3+ failures = architectural problem. Question pattern, don't fix again. |
| Phase | Key Activities | Success Criteria | |-------|----------------|------------------| | 1. Root Cause | Read errors, reproduce, check changes, trace data flow | Understand WHAT and WHY | | 2. Pattern | Find working examples, compare | Identify differences | | 3. Hypothesis | Form theory, test minimally | Confirmed or new hypothesis | | 4. Implementation | Create test, fix with defense-in-depth, verify | Bug resolved, tests pass |
After completing the debugging process:
## Root Cause
[Explain the underlying issue in 1-3 sentences]
Located in: `file.ts:123`
## What Was Wrong
[Describe the specific problem - mutation, race condition, missing validation,
incorrect assumption, etc. Be technical and specific.]
## The Fix
[Describe the changes made and why they address the root cause]
Changes in:
- `file.ts:123-125` - [what changed and why]
- `test.ts:45` - [added regression test]
## Verification
- [x] Bug reproduced and confirmed fixed
- [x] Existing tests pass
- [x] Added regression test
- [x] Checked for similar issues in related code
- [x] No new errors or warnings introduced
If systematic investigation reveals issue is truly environmental, timing-dependent, or external:
But: 95% of "no root cause" cases are incomplete investigation.
Complementary skills:
writing-tests - For creating failing test case in Phase 4condition-based-waiting - Replace arbitrary timeouts identified in Phase 2verification-before-completion - Verify fix worked before claiming successreading-logs - Efficient log analysis for evidence gathering in Phases 1-2From debugging sessions:
Remember: Fixing symptoms creates technical debt. Finding root causes eliminates entire classes of bugs.
data-ai
Extract captions and transcripts from YouTube videos for agent context. Tries manual subtitles, then auto-generated, then falls back to audio transcription via Whisper. Use when a user provides a YouTube URL and wants to understand, summarize, reference, or search video content.
tools
Official skill for XcodeBuildMCP. Use when doing iOS/macOS/watchOS/tvOS/visionOS work (build, test, run, debug, log, UI automation).
development
Write behavior-focused tests following Testing Trophy model with real dependencies, avoiding common anti-patterns like testing mocks and polluting production code. Use when writing new tests, reviewing test quality, or improving test coverage.
data-ai
Create professional Mermaid diagrams with proper styling and visual hierarchy. Use when creating flowcharts, sequence diagrams, state machines, class diagrams, or architecture visualizations.