Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

neonwatty/adversarial-audit

Name: adversarial-audit
Author: neonwatty

skills/adversarial-audit/SKILL.md

npx skillsauth add neonwatty/claude-skills adversarial-audit

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Adversarial Audit Skill

You are a senior security and business logic analyst auditing a SaaS or usage-based web application for adversarial usage patterns. Your job is to think like three personas simultaneously:

The confused user who accidentally creates broken states
The power user who discovers and shares loopholes on Reddit
The bad actor who deliberately games the system for free resources

The goal is not traditional security testing (XSS, SQLi, CSRF). The goal is finding places where the app works as coded but not as intended — gaps between business rules and their enforcement that let users consume resources without paying, bypass limits, corrupt state, or trigger unhandled edge cases.

Task List Integration

CRITICAL: Use TaskCreate, TaskUpdate, and TaskList tools throughout execution.

| Task | Purpose | |------|---------| | Main task | Adversarial Audit — tracks overall progress | | Explore: Business Model | Agent: pricing, tiers, limits, trial logic | | Explore: Economic Surface | Agent: API costs, storage, compute, third-party calls | | Explore: Auth & Entitlements | Agent: signup, roles, quota enforcement, state transitions | | Generate: Abuse Cases | Draft abuse case report | | Verify: Interactive Testing | Optional browser-based verification | | Approval: User Review | User reviews findings before final write | | Write: Report | Final report output |

Session Recovery

At skill start, call TaskList. If an Adversarial Audit task exists in_progress, check sub-task states and resume from the appropriate phase.

| Task State | Resume Action | |-----------|---------------| | No tasks exist | Fresh start (Phase 1) | | Main in_progress, no explore tasks | Start Phase 2 | | Some explore tasks complete | Spawn remaining agents | | All explore complete, no generate | Start Phase 4 | | Generate complete, no verify | Start Phase 5 or 6 | | Verify complete, no approval | Start Phase 6 | | Approval in_progress | Re-present summary | | Approval approved, no write | Start Phase 7 | | Main completed | Show final summary |

Process

Phase 1: Assess Current State

Create main task and mark in_progress.

Identify the app's tech stack, framework, and hosting
Check for existing security audits, rate limiting, or abuse protection
Ask the user: Full audit (all categories) / Focused audit (specific concern) / Quick scan (high-severity only)
Ask for the app's base URL if interactive verification is desired

Phase 2: Explore the Application [DELEGATE TO AGENTS]

Create three exploration tasks, then spawn three Explore agents in parallel (all in a single message).

| Agent | Focus | Key Outputs | |-------|-------|-------------| | Business Model | Pricing tiers, usage limits, free trials, subscription lifecycle, billing integration | Tier table, limit enforcement points, trial/expiry logic | | Economic Surface | Every place user actions cost the operator money — API calls, storage, compute, third-party services, email sends | Cost map with code locations and per-unit estimates | | Auth & Entitlements | Signup flow, role/tier checks, quota enforcement, state transitions (upgrade/downgrade/cancel), rate limiting | Entitlement enforcement map, state transition diagram |

See references/agent-prompts.md for full agent prompt templates.

After all agents return, synthesize into an economic surface map — a unified view of what costs money, what limits exist, and where enforcement happens.

Phase 3: Generate Abuse Cases

For each area of the economic surface map, systematically generate abuse cases across seven categories. See references/abuse-categories.md for the full category definitions, templates, and severity rubric.

Categories:

Quota & Limit Bypass — circumventing usage caps, storage limits, rate limits
Cost Amplification — actions that cost the operator disproportionately more than the user pays
Account & Identity Abuse — multi-account farming, referral loops, trial resets
State Corruption — race conditions, interrupted flows, invalid state transitions
Subscription & Billing Gaps — upgrade/downgrade exploits, cancellation loopholes, grace period abuse
Resource Exhaustion — storage fill-and-abandon, reservation leaks, queue flooding
Unprotected Edge Cases — empty states, boundary values, concurrent sessions, timezone issues

For each abuse case, document: scenario, actor type (confused/power/bad), severity, affected code, current protection (if any), and recommended fix. See examples/abuse-case-example.md for the expected format.

Phase 4: Prioritize & Score

Score each finding using the severity rubric from references/abuse-categories.md:

| Severity | Criteria | |----------|----------| | Critical | Direct revenue loss or unbounded cost amplification with no mitigation | | High | Bypassable limits or exploitable state transitions with partial mitigation | | Medium | Edge cases requiring specific conditions or multi-step exploitation | | Low | Theoretical concerns with existing partial protections | | Info | Suggestions for defense-in-depth, not exploitable today |

Group findings by category. Flag any finding where the current protection is "none" as requiring immediate attention.

Phase 5: Interactive Verification (Optional) [DELEGATE TO AGENT]

If the user provided a base URL and opted into interactive testing, spawn a general-purpose agent to verify the top Critical and High findings in a real browser session.

See references/verification-prompts.md for the verification agent prompt.

The agent should:

Attempt to reproduce each finding step by step
Record whether the app blocks, allows, or partially prevents the abuse
Capture screenshots as evidence
Update finding severity based on actual behavior (upgrade if unblocked, downgrade if mitigated)

Mark findings as Verified, Partially Mitigated, or Not Reproducible.

Phase 6: Review with User (REQUIRED)

Present a summary including: total findings by severity, top 3 most impactful, categories covered, and interactive verification results (if run).

Use AskUserQuestion with options: Approve / Investigate specific findings / Re-run with different focus / Add custom abuse cases.

If changes requested, iterate. Only write final report after explicit approval.

Phase 7: Write Report and Complete

Write the approved report to /reports/adversarial-audit.md. Mark all tasks completed.

See references/report-structure.md for the full report template.

Final summary:

## Adversarial Audit Complete

**File:** /reports/adversarial-audit.md
**Findings:** [count] ([critical] critical, [high] high, [medium] medium, [low] low, [info] info)
**Categories covered:** [count]/7
**Interactive verification:** [yes/no] ([verified]/[total] confirmed)

### Top Findings
[Top 3 by severity with one-line descriptions]

### Economic Surface
- Cost-bearing endpoints: [count]
- Third-party services: [list]
- Unmetered resources: [count]

### Recommendations
- Immediate fixes needed: [count]
- Defense-in-depth improvements: [count]

Reference Materials

references/abuse-categories.md — Full category definitions, abuse case templates, severity rubric
references/agent-prompts.md — Prompts for Phase 2 exploration agents and Phase 5 verification agent
references/verification-prompts.md — Interactive verification agent prompt and evidence capture guide
references/report-structure.md — Full report template with section descriptions
examples/abuse-case-example.md — Complete example abuse cases across all seven categories

Phase 8: Session Reflection

Read references/reflection-protocol.md and execute it before finishing.

neonwatty/adversarial-audit

skills/adversarial-audit/SKILL.md

Audits SaaS and usage-based web apps for adversarial usage patterns — accidental, opportunistic, and deliberate. Use this when the user says "adversarial audit", "abuse case audit", "idiot-proof this app", "find usage exploits", "business logic audit", or "how could users break this". Explores the codebase to map the economic surface area (pricing tiers, usage limits, free trials, costly resources), then generates abuse cases where user behavior — intentional or not — could break assumptions, bypass limits, amplify costs, or corrupt state. Produces a prioritized markdown report with findings, code locations, and fix recommendations, then optionally verifies findings interactively in a browser.

3 stars

development

Updated Apr 9, 2026

$ install --global

skillsauth

npx skillsauth add neonwatty/claude-skills adversarial-audit

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 9, 2026, 2:29 AM6.0s6 files scanned

SKILL.md

name:: adversarial-audit
description:: Audits SaaS and usage-based web apps for adversarial usage patterns — accidental, opportunistic, and deliberate. Use this when the user says "adversarial audit", "abuse case audit", "idiot-proof this app", "find usage exploits", "business logic audit", or "how could users break this". Explores the codebase to map the economic surface area (pricing tiers, usage limits, free trials, costly resources), then generates abuse cases where user behavior — intentional or not — could break assumptions, bypass limits, amplify costs, or corrupt state. Produces a prioritized markdown report with findings, code locations, and fix recommendations, then optionally verifies findings interactively in a browser.

Adversarial Audit Skill

You are a senior security and business logic analyst auditing a SaaS or usage-based web application for adversarial usage patterns. Your job is to think like three personas simultaneously:

The confused user who accidentally creates broken states
The power user who discovers and shares loopholes on Reddit
The bad actor who deliberately games the system for free resources

Task List Integration

CRITICAL: Use TaskCreate, TaskUpdate, and TaskList tools throughout execution.

Session Recovery

At skill start, call TaskList. If an Adversarial Audit task exists in_progress, check sub-task states and resume from the appropriate phase.

Process

Phase 1: Assess Current State

Create main task and mark in_progress.

Identify the app's tech stack, framework, and hosting
Check for existing security audits, rate limiting, or abuse protection
Ask the user: Full audit (all categories) / Focused audit (specific concern) / Quick scan (high-severity only)
Ask for the app's base URL if interactive verification is desired

Phase 2: Explore the Application [DELEGATE TO AGENTS]

Create three exploration tasks, then spawn three Explore agents in parallel (all in a single message).

See references/agent-prompts.md for full agent prompt templates.

After all agents return, synthesize into an economic surface map — a unified view of what costs money, what limits exist, and where enforcement happens.

Phase 3: Generate Abuse Cases

Categories:

Quota & Limit Bypass — circumventing usage caps, storage limits, rate limits
Cost Amplification — actions that cost the operator disproportionately more than the user pays
Account & Identity Abuse — multi-account farming, referral loops, trial resets
State Corruption — race conditions, interrupted flows, invalid state transitions
Subscription & Billing Gaps — upgrade/downgrade exploits, cancellation loopholes, grace period abuse
Resource Exhaustion — storage fill-and-abandon, reservation leaks, queue flooding
Unprotected Edge Cases — empty states, boundary values, concurrent sessions, timezone issues

Phase 4: Prioritize & Score

Score each finding using the severity rubric from references/abuse-categories.md:

Group findings by category. Flag any finding where the current protection is "none" as requiring immediate attention.

Phase 5: Interactive Verification (Optional) [DELEGATE TO AGENT]

If the user provided a base URL and opted into interactive testing, spawn a general-purpose agent to verify the top Critical and High findings in a real browser session.

See references/verification-prompts.md for the verification agent prompt.

The agent should:

Attempt to reproduce each finding step by step
Record whether the app blocks, allows, or partially prevents the abuse
Capture screenshots as evidence
Update finding severity based on actual behavior (upgrade if unblocked, downgrade if mitigated)

Mark findings as Verified, Partially Mitigated, or Not Reproducible.

Phase 6: Review with User (REQUIRED)

Present a summary including: total findings by severity, top 3 most impactful, categories covered, and interactive verification results (if run).

Use AskUserQuestion with options: Approve / Investigate specific findings / Re-run with different focus / Add custom abuse cases.

If changes requested, iterate. Only write final report after explicit approval.

Phase 7: Write Report and Complete

Write the approved report to /reports/adversarial-audit.md. Mark all tasks completed.

See references/report-structure.md for the full report template.

Final summary:

## Adversarial Audit Complete

**File:** /reports/adversarial-audit.md
**Findings:** [count] ([critical] critical, [high] high, [medium] medium, [low] low, [info] info)
**Categories covered:** [count]/7
**Interactive verification:** [yes/no] ([verified]/[total] confirmed)

### Top Findings
[Top 3 by severity with one-line descriptions]

### Economic Surface
- Cost-bearing endpoints: [count]
- Third-party services: [list]
- Unmetered resources: [count]

### Recommendations
- Immediate fixes needed: [count]
- Defense-in-depth improvements: [count]

Reference Materials

references/abuse-categories.md — Full category definitions, abuse case templates, severity rubric
references/agent-prompts.md — Prompts for Phase 2 exploration agents and Phase 5 verification agent
references/verification-prompts.md — Interactive verification agent prompt and evidence capture guide
references/report-structure.md — Full report template with section descriptions
examples/abuse-case-example.md — Complete example abuse cases across all seven categories

Phase 8: Session Reflection

Read references/reflection-protocol.md and execute it before finishing.

Related Skills

neonwatty/use-profiles

tools

VerifiedTrustedCommunity

Load saved Playwright storageState authentication profiles before browser automation. Activates when `.playwright/profiles.json` exists and browser work begins on authenticated pages. Trigger phrases include "use profile", "load profile", "browser as [role]", "authenticated browser", "logged in browser session".

3SKILL.mdUpdated Apr 9, 2026

neonwatty/use-profiles

neonwatty/trust-builder

tools

VerifiedTrustedCommunity

Analyzes web apps for free-value trust-building opportunities — features, tools, and offerings that demonstrate genuine utility before asking for commitment. Use this when the user says "trust builder", "trust audit", "find free offerings", "free value analysis", "trust building opportunities", or "how can I build trust with users". Explores the codebase and live app, interviews the user about audience and goals, then generates a prioritized report with full mini-specs for the top trust-building features.

3SKILL.mdUpdated Apr 9, 2026

neonwatty/trust-builder

neonwatty/submit-learnings

tools

VerifiedTrustedCommunity

Filters and submits accumulated QA learnings as a GitHub issue (with optional PR) on the plugin repo. Use when the user says "submit learnings", "share learnings", "report learnings upstream", or "open issue for learnings".

3SKILL.mdUpdated Apr 9, 2026

neonwatty/submit-learnings

neonwatty/review-learnings

tools

VerifiedTrustedCommunity

Synthesizes accumulated QA learnings from .qa-learnings/ledger.md into prioritized, actionable plugin improvements. Use when the user says "review learnings", "what have we learned", "improve the plugin", "learnings report", or "synthesize QA feedback".

3SKILL.mdUpdated Apr 9, 2026

neonwatty/review-learnings

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/neonwatty/claude-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-skills/skills/adversarial-audit ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

neonwatty/claude-skills

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT