claude/skills/wcb/SKILL.md
Web Context Builder — async web scraper that converts websites (docs sites, blogs, API references) into LLM-optimized Markdown. Crawls a domain, extracts content with BeautifulSoup, converts to clean Markdown, and merges pages into a single consumable file. Supports subdomain control, depth limits, include/exclude regex filters, concurrency tuning, and optional headless browser for JS-heavy sites. Use when the user asks to: 'build docs for this site', 'scrape this documentation', 'convert a website to markdown', 'make an LLM reference from a URL', 'download and flatten docs'. Also use when Claude needs to feed an external docs site into its own context. Triggers: 'wcb', 'scrape site', 'build docs from URL', 'crawl website to markdown', 'web to LLM', 'download docs site'.
npx skillsauth add kendreaditya/.config wcbInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Async crawler that scrapes a website and emits LLM-optimized Markdown. Outputs one file per page plus a merged <domain>.md for easy ingestion.
wcb https://docs.example.com # crawl, merged output in CWD
wcb https://docs.example.com -o ./my-docs # custom output dir
wcb https://docs.example.com -d 3 # max depth 3
wcb https://js-heavy-site.com --browser # headless browser for SPAs
wcb https://docs.example.com -i '/api/' -e '/old/' # include/exclude patterns
wcb https://docs.example.com --cross-subdomain # allow subdomain crawl
-o PATH — output dir-d N — max crawl depth (default: unlimited)-c N — concurrent requests (default 5; raise for fast hosts, lower to be polite)--delay SEC — per-request delay (default 0.1)--cross-subdomain — follow into subdomains (default: stay on exact subdomain)--browser — use Playwright for JS-rendered pages (slower but needed for SPAs)-i REGEX / -e REGEX — URL include/exclude filters (repeatable)--no-merge — skip the merged <domain>.md file-m NAME — custom merged filename--browser helps some but not all.curl | html2markdown or the browser "Save as".shortn afterwards: wcb ... -o out && shortn out/site.md -t 50000.--browser requires playwright install chromium first.testing
Reviews test coverage and suggests missing test cases for error paths, edge cases, and business logic. Activates when users write tests or implement new features.
development
Identify, categorize, and prioritize technical debt. Trigger with "tech debt", "technical debt audit", "what should we refactor", "code health", or when the user asks about code quality, refactoring priorities, or maintenance backlog.
tools
Comprehensive security scanning and vulnerability detection. Includes input validation, path traversal prevention, CVE detection, and secure coding pattern enforcement. Use when: authentication implementation, authorization logic, payment processing, user data handling, API endpoint creation, file upload handling, database queries, external API integration. Skip when: read-only operations on public data, internal development tooling, static documentation, styling changes.
development
Optimizes application performance. Use when performance requirements exist, when you suspect performance regressions, or when Core Web Vitals or load times need improvement. Use when profiling reveals bottlenecks that need fixing.