Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

abelrguezr/ai-fuzzing-assistant

Name: ai-fuzzing-assistant
Author: abelrguezr

skills/AI/AI-Assisted-Fuzzing-and-Vulnerability-Discovery/SKILL.md

npx skillsauth add abelrguezr/hacktricks-skills ai-fuzzing-assistant

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

AI-Assisted Fuzzing & Vulnerability Discovery

This skill helps you leverage large language models to supercharge traditional vulnerability research pipelines. It covers seed generation, grammar evolution, crash analysis, exploit generation, and AI-guided patching.

When to Use This Skill

Use this skill when you need to:

Generate semantically valid fuzzing seeds for complex input formats (SQL, URLs, binary protocols)
Evolve fuzzing grammars based on coverage feedback
Analyze crashes and generate proof-of-vulnerability (PoV) exploits
Create mutation dictionaries for directed fuzzing
Cluster crash signatures and generate unified patches
Set up an end-to-end AI-assisted vulnerability discovery workflow

Core Techniques

1. LLM-Generated Seed Inputs

Traditional fuzzers mutate bytes blindly. LLMs can generate syntax-correct, security-relevant inputs that reach deeper code paths faster.

Use the seed generator script:

python scripts/gen_seeds.py --format <format> --count <N> --output <file>

Supported formats:

sql - SQL injection payloads
xss - Cross-site scripting payloads
path - Path traversal payloads
url - URL manipulation payloads
custom - Custom format (provide prompt)

Example:

python scripts/gen_seeds.py --format sql --count 200 --output seeds.txt
afl-fuzz -i seeds.txt -o findings/ -- ./target @@

Tips:

Ask for diverse payload lengths and encodings (UTF-8, URL-encoded, UTF-16-LE)
Keep payloads under common length limits (≤256 bytes)
Regenerate with modified prompts to target specific vulnerabilities

2. Grammar-Evolution Fuzzing

Let the LLM evolve a grammar based on coverage feedback instead of just generating seeds.

Workflow:

Generate initial grammar via prompt
Fuzz for N minutes, collect coverage metrics
Feed uncovered areas back to LLM for grammar refinement
Repeat until coverage plateaus

Use the grammar evolution script:

python scripts/evolve_grammar.py \
  --grammar grammar.txt \
  --coverage-report coverage.json \
  --output grammar_v2.txt

Key parameters:

--max-epochs - Number of refinement iterations (default: 5)
--coverage-threshold - Stop when Δcoverage < threshold (default: 0.01)
--diff-mode - Use diff/patch instructions for efficient edits

Example prompt for grammar refinement:

The previous grammar triggered 12% of program edges.
Functions not reached: parse_auth, handle_upload.
Add or modify rules to cover these areas.

3. Agent-Based PoV Generation

After finding a crash, you need a deterministic proof-of-vulnerability.

Use the crash analyzer script:

python scripts/analyze_crashes.py \
  --crash-db crashes/ \
  --target ./binary \
  --output povs/

What it does:

Reads crash signatures (PC, input slice, sanitizer messages)
Attempts to reproduce locally with gdb
Generates minimal exploit payloads
Validates in sandbox
Saves working PoVs, re-queues failures as fuzzing seeds

Output structure:

povs/
├── crash_001/
│   ├── input.bin          # Minimal triggering input
│   ├── gdb-session.txt    # Reproduction steps
│   └── analysis.md        # Vulnerability explanation
└── failed_seeds.txt       # Re-queued for fuzzing

4. Directed Fuzzing with Mutation Dictionaries

Fine-tuned code models can suggest targeted mutation patterns for specific functions.

Generate mutation dictionaries:

python scripts/gen_seeds.py \
  --format custom \
  --prompt "Give mutation dictionary entries likely to break memory safety in sprintf wrapper" \
  --output mutations.txt

Example output:

{"pattern": "%99999999s"}
{"pattern": "AAAAAAAA....<1024>....%n"}

Integrate with AFL++:

afl-fuzz -i seeds.txt -o findings/ \
  -x mutations.txt \
  -- ./target @@

5. AI-Guided Patching

Super Patches

Cluster crash signatures and generate unified patches that fix multiple bugs from a common root cause.

python scripts/analyze_crashes.py \
  --crash-db crashes/ \
  --mode super-patch \
  --output patches/

Prompt template:

Here are N stack traces + file snippets.
Identify the shared mistake and generate a unified diff fixing all occurrences.

Speculative Patch Queue

Interleave confirmed PoV-validated patches with speculative patches at a tunable ratio.

Configuration:

{
  "confirmed_ratio": 1,
  "speculative_ratio": 2,
  "penalty_threshold": 0.3
}

End-to-End Workflow

graph TD
    subgraph Discovery
        A[LLM Seed/Grammar Gen] --> B[Fuzzer]
        C[Fine-Tuned Model Dicts] --> B
    end
    B --> D[Crash DB]
    D --> E[Agent PoV Gen]
    E -->|valid PoV| PatchQueue
    D -->|cluster| F[LLM Super-Patch]
    PatchQueue --> G[Patch Submitter]

Recommended sequence:

Generate seeds with gen_seeds.py
Run fuzzer (AFL++, libFuzzer, Honggfuzz)
Collect crashes in database
Run analyze_crashes.py for PoV generation
Generate patches with super-patch mode
Submit patches, monitor scoring
Feed failed PoVs back as fuzzing seeds

Best Practices

Seed Generation

Diversify encodings: Ask for UTF-8, URL-encoded, UTF-16-LE variants
Respect limits: Keep payloads under common length thresholds
Single script: Request self-contained Python scripts to avoid JSON parsing issues

Grammar Evolution

Budget tokens: Each refinement costs tokens; set reasonable limits
Use diffs: Prefer patch instructions over full rewrites
Stop early: Halt when coverage improvement plateaus (Δ < 0.01)

Crash Analysis

Parallelize: Spawn multiple agents with different models/temperatures
Validate: Always test PoVs in sandbox before submission
Feedback loop: Failed attempts become new fuzzing seeds

Patching

Cluster first: Group crashes by signature before patching
Cost model: Track penalties vs. points to tune speculative ratio
Unified diffs: Prefer single patches fixing multiple bugs

Integration with Existing Tools

AFL++

# Generate seeds
python scripts/gen_seeds.py --format sql --output seeds/

# Run with mutation dictionary
afl-fuzz -i seeds/ -o findings/ -x mutations.txt -- ./target @@

libFuzzer

# Generate grammar
python scripts/evolve_grammar.py --grammar grammar.txt

# Compile with grammar
clang -fsanitize=fuzzer -o fuzzer fuzzer.cpp
./fuzzer grammar.txt

Honggfuzz

# Generate seeds
python scripts/gen_seeds.py --format custom --prompt "..." --output seeds/

# Run
hfuzz_run -i seeds/ -o findings/ -- ./target @@

Troubleshooting

Seeds not triggering new coverage:

Increase payload diversity (ask for more encodings)
Try grammar evolution instead of static seeds
Check if target has input validation blocking malformed inputs

Grammar not improving:

Verify coverage metrics are accurate
Increase refinement epochs
Try different LLM or temperature settings

PoV generation failing:

Check crash reproducibility manually first
Increase agent count for parallel attempts
Lower temperature for more deterministic outputs

Patches being rejected:

Validate PoVs before patching
Reduce speculative patch ratio
Review crash clustering for false positives

References

Trail of Bits – AIxCC finals: Tale of the tape
CTF Radiooo AIxCC finalist interviews
AFL++ Documentation
libFuzzer Documentation

abelrguezr/ai-fuzzing-assistant

skills/AI/AI-Assisted-Fuzzing-and-Vulnerability-Discovery/SKILL.md

AI-assisted fuzzing and vulnerability discovery. Use this skill whenever the user wants to generate fuzzing seeds, evolve grammars, analyze crashes, create proof-of-vulnerability exploits, or generate patches for discovered bugs. Trigger on mentions of fuzzing, AFL++, libFuzzer, vulnerability discovery, crash analysis, exploit generation, or security testing with LLMs.

5 stars

testing

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add abelrguezr/hacktricks-skills ai-fuzzing-assistant

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 2:34 AM44.7s4 files scanned

SKILL.md

name:: ai-fuzzing-assistant
description:: AI-assisted fuzzing and vulnerability discovery. Use this skill whenever the user wants to generate fuzzing seeds, evolve grammars, analyze crashes, create proof-of-vulnerability exploits, or generate patches for discovered bugs. Trigger on mentions of fuzzing, AFL++, libFuzzer, vulnerability discovery, crash analysis, exploit generation, or security testing with LLMs.

AI-Assisted Fuzzing & Vulnerability Discovery

When to Use This Skill

Use this skill when you need to:

Generate semantically valid fuzzing seeds for complex input formats (SQL, URLs, binary protocols)
Evolve fuzzing grammars based on coverage feedback
Analyze crashes and generate proof-of-vulnerability (PoV) exploits
Create mutation dictionaries for directed fuzzing
Cluster crash signatures and generate unified patches
Set up an end-to-end AI-assisted vulnerability discovery workflow

Core Techniques

1. LLM-Generated Seed Inputs

Traditional fuzzers mutate bytes blindly. LLMs can generate syntax-correct, security-relevant inputs that reach deeper code paths faster.

Use the seed generator script:

python scripts/gen_seeds.py --format <format> --count <N> --output <file>

Supported formats:

sql - SQL injection payloads
xss - Cross-site scripting payloads
path - Path traversal payloads
url - URL manipulation payloads
custom - Custom format (provide prompt)

Example:

python scripts/gen_seeds.py --format sql --count 200 --output seeds.txt
afl-fuzz -i seeds.txt -o findings/ -- ./target @@

Tips:

Ask for diverse payload lengths and encodings (UTF-8, URL-encoded, UTF-16-LE)
Keep payloads under common length limits (≤256 bytes)
Regenerate with modified prompts to target specific vulnerabilities

2. Grammar-Evolution Fuzzing

Let the LLM evolve a grammar based on coverage feedback instead of just generating seeds.

Workflow:

Generate initial grammar via prompt
Fuzz for N minutes, collect coverage metrics
Feed uncovered areas back to LLM for grammar refinement
Repeat until coverage plateaus

Use the grammar evolution script:

python scripts/evolve_grammar.py \
  --grammar grammar.txt \
  --coverage-report coverage.json \
  --output grammar_v2.txt

Key parameters:

--max-epochs - Number of refinement iterations (default: 5)
--coverage-threshold - Stop when Δcoverage < threshold (default: 0.01)
--diff-mode - Use diff/patch instructions for efficient edits

Example prompt for grammar refinement:

The previous grammar triggered 12% of program edges.
Functions not reached: parse_auth, handle_upload.
Add or modify rules to cover these areas.

3. Agent-Based PoV Generation

After finding a crash, you need a deterministic proof-of-vulnerability.

Use the crash analyzer script:

python scripts/analyze_crashes.py \
  --crash-db crashes/ \
  --target ./binary \
  --output povs/

What it does:

Reads crash signatures (PC, input slice, sanitizer messages)
Attempts to reproduce locally with gdb
Generates minimal exploit payloads
Validates in sandbox
Saves working PoVs, re-queues failures as fuzzing seeds

Output structure:

povs/
├── crash_001/
│   ├── input.bin          # Minimal triggering input
│   ├── gdb-session.txt    # Reproduction steps
│   └── analysis.md        # Vulnerability explanation
└── failed_seeds.txt       # Re-queued for fuzzing

4. Directed Fuzzing with Mutation Dictionaries

Fine-tuned code models can suggest targeted mutation patterns for specific functions.

Generate mutation dictionaries:

python scripts/gen_seeds.py \
  --format custom \
  --prompt "Give mutation dictionary entries likely to break memory safety in sprintf wrapper" \
  --output mutations.txt

Example output:

{"pattern": "%99999999s"}
{"pattern": "AAAAAAAA....<1024>....%n"}

Integrate with AFL++:

afl-fuzz -i seeds.txt -o findings/ \
  -x mutations.txt \
  -- ./target @@

5. AI-Guided Patching

Super Patches

Cluster crash signatures and generate unified patches that fix multiple bugs from a common root cause.

python scripts/analyze_crashes.py \
  --crash-db crashes/ \
  --mode super-patch \
  --output patches/

Prompt template:

Here are N stack traces + file snippets.
Identify the shared mistake and generate a unified diff fixing all occurrences.

Speculative Patch Queue

Interleave confirmed PoV-validated patches with speculative patches at a tunable ratio.

Configuration:

{
  "confirmed_ratio": 1,
  "speculative_ratio": 2,
  "penalty_threshold": 0.3
}

End-to-End Workflow

graph TD
    subgraph Discovery
        A[LLM Seed/Grammar Gen] --> B[Fuzzer]
        C[Fine-Tuned Model Dicts] --> B
    end
    B --> D[Crash DB]
    D --> E[Agent PoV Gen]
    E -->|valid PoV| PatchQueue
    D -->|cluster| F[LLM Super-Patch]
    PatchQueue --> G[Patch Submitter]

Recommended sequence:

Generate seeds with gen_seeds.py
Run fuzzer (AFL++, libFuzzer, Honggfuzz)
Collect crashes in database
Run analyze_crashes.py for PoV generation
Generate patches with super-patch mode
Submit patches, monitor scoring
Feed failed PoVs back as fuzzing seeds

Best Practices

Seed Generation

Diversify encodings: Ask for UTF-8, URL-encoded, UTF-16-LE variants
Respect limits: Keep payloads under common length thresholds
Single script: Request self-contained Python scripts to avoid JSON parsing issues

Grammar Evolution

Budget tokens: Each refinement costs tokens; set reasonable limits
Use diffs: Prefer patch instructions over full rewrites
Stop early: Halt when coverage improvement plateaus (Δ < 0.01)

Crash Analysis

Parallelize: Spawn multiple agents with different models/temperatures
Validate: Always test PoVs in sandbox before submission
Feedback loop: Failed attempts become new fuzzing seeds

Patching

Cluster first: Group crashes by signature before patching
Cost model: Track penalties vs. points to tune speculative ratio
Unified diffs: Prefer single patches fixing multiple bugs

Integration with Existing Tools

AFL++

# Generate seeds
python scripts/gen_seeds.py --format sql --output seeds/

# Run with mutation dictionary
afl-fuzz -i seeds/ -o findings/ -x mutations.txt -- ./target @@

libFuzzer

# Generate grammar
python scripts/evolve_grammar.py --grammar grammar.txt

# Compile with grammar
clang -fsanitize=fuzzer -o fuzzer fuzzer.cpp
./fuzzer grammar.txt

Honggfuzz

# Generate seeds
python scripts/gen_seeds.py --format custom --prompt "..." --output seeds/

# Run
hfuzz_run -i seeds/ -o findings/ -- ./target @@

Troubleshooting

Seeds not triggering new coverage:

Increase payload diversity (ask for more encodings)
Try grammar evolution instead of static seeds
Check if target has input validation blocking malformed inputs

Grammar not improving:

Verify coverage metrics are accurate
Increase refinement epochs
Try different LLM or temperature settings

PoV generation failing:

Check crash reproducibility manually first
Increase agent count for parallel attempts
Lower temperature for more deterministic outputs

Patches being rejected:

Validate PoVs before patching
Reduce speculative patch ratio
Review crash clustering for false positives

References

Trail of Bits – AIxCC finals: Tale of the tape
CTF Radiooo AIxCC finalist interviews
AFL++ Documentation
libFuzzer Documentation

Related Skills

abelrguezr/house-of-lore-exploit

testing

VerifiedTrustedCommunity

How to perform a House of Lore (small bin attack) heap exploitation. Use this skill whenever the user mentions heap exploitation, small bin attacks, fake chunks, glibc heap vulnerabilities, or needs to insert fake chunks into small bins for arbitrary read/write. Trigger for CTF challenges involving heap corruption, glibc 2.31+ exploitation, or when the user needs to bypass malloc sanity checks using fake chunk linking.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-lore-exploit

abelrguezr/house-of-force-exploit

testing

VerifiedTrustedCommunity

How to perform House of Force heap exploitation attacks. Use this skill whenever the user mentions heap exploitation, House of Force, top chunk manipulation, arbitrary memory allocation, malloc manipulation, or wants to allocate chunks at specific addresses. Also trigger for CTF challenges involving heap overflows, top chunk size overwrites, or when the user needs to calculate evil_size for heap attacks. Make sure to use this skill for any binary exploitation task involving glibc heap manipulation, even if they don't explicitly say "House of Force".

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-force-exploit

abelrguezr/house-of-einherjar

tools

VerifiedTrustedCommunity

How to perform House of Einherjar heap exploitation to allocate memory at arbitrary addresses. Use this skill whenever the user mentions heap exploitation, glibc heap attacks, arbitrary memory allocation, off-by-one overflow exploitation, tcache poisoning, fast bin attacks, or any CTF challenge involving heap manipulation. This is essential for binary exploitation tasks where you need to control malloc() return addresses.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-einherjar

abelrguezr/heap-overflow-exploitation

testing

VerifiedTrustedCommunity

How to identify, analyze, and exploit heap overflow vulnerabilities in binary exploitation challenges and real-world scenarios. Use this skill whenever the user mentions heap overflows, memory corruption, heap grooming, tcache poisoning, fast-bin attacks, or any heap-related vulnerability in CTF challenges, binary analysis, or security research. This skill covers heap overflow fundamentals, exploitation techniques, heap grooming strategies, and real-world CVE analysis.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/heap-overflow-exploitation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/abelrguezr/hacktricks-skills.git

# Copy into Claude Code skills folder (global)
cp -r hacktricks-skills/skills/AI/AI-Assisted-Fuzzing-and-Vulnerability-Discovery ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

abelrguezr/hacktricks-skills

5 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT