offensive-tools/osint/theharvester/SKILL.md
Harvest emails, subdomains, hostnames, employee names, open ports, and banners for a target domain from public sources. Use at the start of recon to build an attack surface map: enumerate email addresses for phishing, discover subdomains for web app testing, and identify infrastructure from passive sources.
npx skillsauth add aeondave/malskill theharvesterInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Email, subdomain, and hostname harvester from public OSINT sources.
# Install
pip install theHarvester
# or on Kali: already installed
# Basic harvest (google + bing sources)
theHarvester -d target.com -b google,bing
# All available sources
theHarvester -d target.com -b all -l 500
# Save results
theHarvester -d target.com -b google,bing,linkedin -l 300 -f results
# Outputs: results.html + results.xml
| Flag | Purpose |
|------|---------|
| -d <domain> | Target domain |
| -b <sources> | Data sources (comma-separated or all) |
| -l <n> | Limit results per source (default: 500) |
| -f <filename> | Save to HTML/XML (no extension needed) |
| -n | DNS lookup on discovered hosts |
| -c | DNS brute force (uses built-in wordlist) |
| -v | Verify hostnames via DNS resolution |
| -e <ip> | Use custom DNS server |
| -p | Port scan open ports on discovered hosts |
| -s <n> | Start result offset |
| --screenshot <dir> | Screenshot discovered web services |
# Passive (no key needed)
-b google,bing,yahoo,duckduckgo,baidu,crtsh,certspotter,hackertarget,dnsdumpster,rapiddns,sublist3r
# Requires API keys
-b hunter,securitytrails,shodan,censys,fullhunt,intelx,virustotal,bevigil,binaryedge
# LinkedIn (extracts names/titles — no key needed, rate limited)
-b linkedin,linkedin_links
# All sources at once
-b all
Recon on target company:
# Phase 1: passive — no noise
theHarvester -d corp.com -b google,bing,duckduckgo,crtsh,certspotter,hackertarget -l 500 -f corp_passive
# Phase 2: extended — with APIs
theHarvester -d corp.com -b all -l 1000 -f corp_full
# Phase 3: verify + port scan discovered hosts
theHarvester -d corp.com -b google,crtsh -v -p -l 200
Email harvest for phishing prep:
theHarvester -d target.com -b google,bing,linkedin,hunter -l 500 -f emails
# Check results.html for email list
grep -oE '[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}' results.xml | sort -u > emails.txt
Subdomain discovery:
theHarvester -d target.com -b crtsh,certspotter,dnsdumpster,sublist3r,hackertarget -l 500 -f subdomains
# Config: /etc/theHarvester/api-keys.yaml or ~/.theHarvester/api-keys.yaml
apikeys:
hunter:
key: YOUR_KEY # hunter.io — free 50/month
securitytrails:
key: YOUR_KEY # securitytrails.com — free 50/month
shodan:
key: YOUR_KEY # shodan.io — free tier
virustotal:
key: YOUR_KEY # virustotal.com — free
intelx:
key: YOUR_KEY # intelx.io — free tier
# Extract emails from XML output
grep -oE '[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}' results.xml | sort -u
# Extract subdomains
grep -oE '([a-zA-Z0-9_-]+\.)+target\.com' results.xml | sort -u
# Use holehe on harvested emails
while read email; do holehe "$email" --only-used; done < emails.txt
| File | When to load |
|------|--------------|
| references/sources.md | Full source list, API key setup, output parsing, integration with amass/holehe |
data-ai
Scoped routing: Linux operator; hosts, sessions, users, services, packages, logs, containers, SSH, network paths, privilege evidence.
development
Offensive methodology for ICS/OT/SCADA environments in authorized industrial penetration testing and red team operations. Use when assessing PLCs, RTUs, HMIs, engineering workstations, historians, or field devices running Modbus, DNP3, EtherNet/IP, S7comm/S7+, Profinet, IEC 60870-5-104, BACnet, or OPC-UA. Covers passive OT network enumeration, protocol-level device interrogation, PLC coil/register read-write attacks, HMI session exploitation, historian and engineering workstation compromise, and safe escalation rules for critical infrastructure scope. Does not cover: general IT network exploitation (network-technique), physical hardware interfaces UART/JTAG/SPI (hardware-technique), wireless sensor network attacks (wireless-technique), RF/SDR signal analysis (hardware-ctf or wireless-technique), or CTF-framed ICS lab tasks (ics-ctf).
tools
Offensive methodology for authorized game security assessments, game client security research, and game-adjacent penetration testing in real-world engagements. Use when assessing game clients for cheating vulnerabilities, testing anti-cheat effectiveness, auditing game server protocols for score manipulation or economic fraud, reverse engineering game DRM or license validation, analyzing game save file protection, or assessing game mod/plugin security. Covers: process memory scanning and manipulation (Cheat Engine methodology), game binary reversing for license and DRM bypass, game network protocol analysis and packet replay, anti-cheat mechanism analysis, save file format reversing and tampering, speed hack and value injection techniques. Does NOT cover: CTF game challenges (game-ctf), game engine source code auditing (web-exploit-technique or vuln-search-technique for the backend), or general binary exploitation (pwn-ctf or reversing-technique).
development
Auth assessment: hardware/embedded methodology; UART/JTAG/SWD/SPI/I2C, firmware extraction, boot/debug paths, embedded OS evidence.