axiom-codex/skills/axiom-audit-testing/SKILL.md
Use when the user wants to audit test quality, find flaky test patterns, speed up test execution, or prepare for Swift Testing migration.
npx skillsauth add charleswiltgen/axiom axiom-audit-testingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an expert at detecting test quality issues — both known anti-patterns AND missing/incomplete test coverage that leaves critical paths unverified.
Run a comprehensive test quality audit using 5 phases: map test coverage shape, detect known anti-patterns, reason about what's untested, correlate compound risks, and score test health. Report all issues with:
Test files: *Tests.swift, *Test.swift, *Spec.swift
Production files: **/*.swift (for coverage shape mapping in Phase 1)
Skip: *Previews.swift, */Pods/*, */Carthage/*, */.build/*, */DerivedData/*, */scratch/*, */docs/*, */.claude/*, */.claude-plugin/*
Before checking test quality, understand what's tested and what isn't.
Glob: **/*.swift (production code — excluding test/vendor paths)
Glob: **/*Tests.swift, **/*Test.swift, **/*Spec.swift (test code)
For each test file, grep for:
- `@testable import` — which production modules are tested
- `import XCTest` vs `import Testing` — which framework
- `XCUIApplication` — UI test vs unit test
Read key production files to identify:
Match production modules/directories against test files:
Write a brief Coverage Shape Map (8-12 lines) summarizing:
Present this map in the output before proceeding.
Run all 5 existing detection categories. These are fast and reliable. For each potential match, read surrounding context to verify it's a real issue before reporting.
Flaky patterns:
sleep\(
Thread\.sleep
usleep\(
static var.*=
class var.*=
Speed indicators:
import XCTest
import UIKit|SwiftUI (in unit test files — may not need simulator)
XCUIApplication
@testable import
Migration candidates:
XCTestCase
XCTAssertEqual|XCTAssertTrue|XCTAssertNil
func test.*\(\).*\{
Swift 6 issues:
@MainActor.*class|struct
class.*XCTestCase
Quality issues:
func test.*\{ (check for missing assertions in body)
try!|as!
setUp\(|setUpWithError\( (check line count)
Search: sleep(, Thread.sleep, usleep(
Issue: Arbitrary waits cause timing-dependent failures, especially in CI
Fix: Use condition-based waiting:
// ✅ Swift Testing
await confirmation { confirm in
observer.onComplete = { confirm() }
triggerAction()
}
// ✅ XCTest
let element = app.buttons["Submit"]
XCTAssertTrue(element.waitForExistence(timeout: 5))
Search: static var or class var in test classes
Issue: Parallel test execution causes race conditions
Fix: Use instance properties, fresh setup per test
Detection: Tests that reference results from other test methods, or setUp that depends on test order Issue: Swift Testing and XCTest randomize order Fix: Make each test independent
Detection: Unit tests with no UIKit/SwiftUI imports, no XCUIApplication usage Issue: Launching app adds 20-60 seconds per run Fix: Set Host Application to "None" for pure unit tests
Detection: Test files using @testable import MyApp that only test models/services/utilities
Issue: App tests require simulator launch — 60x slower than package tests
Fix: Extract testable logic into Swift Package, test with swift test
Detection: Unit-style tests in UI test target Issue: UI tests have heavy setup/teardown Fix: Move to unit test target
Search: XCTestCase with only basic XCTAssert* calls
Issue: Missing modern testing features (parallelism, async, parameterization)
Fix: Migrate to @Suite struct with @Test functions
Detection: Multiple similar test functions (testParseValid, testParseInvalid, testParseEmpty)
Issue: Repetitive tests that could be consolidated
Fix: Use @Test(arguments:) parameterization
Search: class.*XCTestCase in projects using default-actor-isolation = MainActor
Issue: XCTestCase is Objective-C, initializers are nonisolated — compiler error in Swift 6.2+
Fix:
// ❌ Error with MainActor default
final class MyTests: XCTestCase { }
// ✅ Works
nonisolated final class MyTests: XCTestCase {
@MainActor func testSomething() async { }
}
Detection: Tests accessing @MainActor types without isolation
Issue: Swift 6 strict concurrency requires explicit isolation
Fix: Add @MainActor to test function
Search: Test functions with no XCTAssert*, #expect, or #require
Issue: Tests that don't assert don't verify behavior — false confidence
Fix: Add meaningful assertions
Detection: setUp() or setUpWithError() methods longer than 20 lines
Issue: Complex setup makes tests hard to understand and maintain
Fix: Extract to helper methods, use factory patterns
Search: try!, as!, !. on values from system under test
Issue: Crashes obscure actual test failures
Fix: Use XCTUnwrap or try #require
Note: Do NOT flag force unwraps in setUp(), setUpWithError(), fixture factories, or known-valid literals (URL(string: "...")!, UUID(uuidString: "...")!, NSRegularExpression(pattern: "...")!).
Using the Coverage Shape Map from Phase 1 and your domain knowledge, check for what's untested — not just what's wrong with existing tests.
| Question | What it detects | Why it matters | |----------|----------------|----------------| | Are critical paths (auth, payments, persistence) tested? | Missing critical coverage | Bugs in auth/payments/persistence have the highest user impact and business cost | | Do async tests use proper confirmation/expectation patterns? | Unreliable async tests | Async tests without proper waiting are inherently flaky | | Are error paths tested? (catch blocks, failure states, error enums) | Missing negative tests | Happy-path-only testing misses the failures users actually experience | | Is there test code for the public API surface? | Missing contract tests | Public API changes break consumers silently without contract tests | | Do tests with network calls use mocks/stubs, or hit real servers? | Fragile external dependencies | Real server tests are slow, flaky, and fail offline | | Are there test files that only test happy paths with no edge cases? | Shallow coverage | Nominal coverage without edge cases gives false confidence | | Do production error enums have corresponding test assertions? | Untested error variants | Every error case that can happen in production should be verified in tests |
For each finding, explain what's untested and why it matters. Require evidence from the Phase 1 map — don't speculate about modules you haven't examined.
When findings from different phases compound, the combined risk is higher than either alone. Bump the severity when you find these combinations:
| Finding A | + Finding B | = Compound | Severity |
|-----------|------------|-----------|----------|
| No tests for auth module | Auth uses @MainActor + async | Untested concurrency in security-critical code | CRITICAL |
| Missing error path tests | try! in production code | Crash on unhandled error | CRITICAL |
| Test uses sleep() | Tests auth flow | Flaky test on critical path | CRITICAL |
| No tests for persistence layer | Database migration code present | Untested migrations risk data loss | HIGH |
| Tests exist but no assertions | @testable import of payment module | False confidence in payment code | HIGH |
| XCTestCase with shared mutable state | Swift 6 strict concurrency enabled | Data races in test infrastructure | HIGH |
| No mock/stub for network layer | Tests import networking module | Fragile tests dependent on external servers | MEDIUM |
Also note overlaps with other auditors:
Calculate and present a health score:
## Test Health Score
| Metric | Value |
|--------|-------|
| Module coverage | X/Y production modules have tests (Z%) |
| Critical path coverage | auth (yes/no), payments (yes/no), persistence (yes/no), networking (yes/no) |
| Error path coverage | N error enums, M with test assertions (Z%) |
| Test reliability | N sleep() calls, M shared mutable state instances |
| Test speed | N tests requiring simulator, M pure unit tests |
| Test framework | N XCTest, M Swift Testing (migration %) |
| **Health** | **WELL TESTED / GAPS / UNDERTESTED** |
Scoring:
# Test Quality Audit Results
## Coverage Shape Map
[8-12 line summary from Phase 1]
## Summary
- CRITICAL: [N] issues
- HIGH: [N] issues
- MEDIUM: [N] issues
- LOW: [N] issues
- Phase 2 (anti-pattern detection): [N] issues
- Phase 3 (completeness reasoning): [N] issues
- Phase 4 (compound findings): [N] issues
## Test Health Score
[Phase 5 table]
## Issues by Severity
### [SEVERITY] [Category]: [Description]
**File**: path/to/file.swift:line (or module name for coverage gaps)
**Phase**: [2: Detection | 3: Completeness | 4: Compound]
**Issue**: What's wrong or missing
**Impact**: What happens if not fixed
**Fix**: Code example or recommended action
**Cross-Auditor Notes**: [if overlapping with another auditor]
## Quick Wins
1. [Fastest impact fix]
2. [Biggest speedup]
3. [Easiest migration]
## Recommendations
1. [Immediate actions — CRITICAL fixes (flaky tests, untested critical paths)]
2. [Short-term — HIGH fixes (speed improvements, Swift 6 compliance)]
3. [Long-term — coverage expansion from Phase 3 findings]
If >50 issues in one category: Show top 10, provide total count, list top 3 files If >100 total issues: Summarize by category, show only CRITICAL/HIGH details
sleep() in test helpers for rate limiting (check context)static let constants (immutable is fine)setUp() / fixture setup on known-valid literalsFor unit test patterns: axiom-testing (swift-testing reference)
For UI test patterns: axiom-testing (ui-testing reference)
For async test patterns: axiom-testing (testing-async reference)
For flaky test diagnosis: axiom-test-failure-analyzer agent
development
Use when building ANY watchOS app — app structure, independent apps, Watch Connectivity, Smart Stack widgets, complications, controls, RelevanceKit, background tasks, ClockKit migration.
development
Use when working with HealthKit, WorkoutKit, health data, workouts, or fitness features on iOS or watchOS. Covers permissions, queries, background delivery, custom workouts, multidevice coordination.
development
Use when building, fixing, or improving ANY SwiftUI UI — views, navigation, layout, animations, performance, architecture, gestures, debugging, iOS 26 features.
content-media
Use when working with camera, photos, audio, haptics, ShazamKit, or Now Playing. Covers AVCaptureSession, PHPicker, PhotosPicker, AVFoundation, Core Haptics, audio recognition, MediaPlayer, CarPlay, MusicKit.