skills/harvest-deep-crawl/SKILL.md
Multi-page deep crawling - documentation sites, wikis, knowledge bases
npx skillsauth add rubicanjr/FinCognis harvest-deep-crawlInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Crawl multi-page websites following internal links to a specified depth. Ideal for building complete knowledge bases from documentation sites, wikis, and reference materials.
/crawl <url> --depth <N>
# Crawl docs site 3 levels deep
/crawl https://docs.example.com --depth 3
# Crawl a specific section
/crawl https://docs.example.com/api --depth 2
# Crawl with page limit
/crawl https://wiki.example.com --depth 5 --max-pages 50
| Param | Default | Description |
|-------|---------|-------------|
| --depth | 2 | Max link-following depth |
| --max-pages | 100 | Max pages to crawl |
| --same-domain | true | Stay on same domain |
| --include | * | URL pattern to include |
| --exclude | - | URL pattern to exclude |
.claude/cache/agents/harvest/crawl-{domain}/crawl-{domain}-{timestamp}/
index.md # Table of contents + summary
page-001.md # First page content
page-002.md # Second page content
...
metadata.json # Crawl stats, URLs, timings
curl -s http://localhost:11235/crawl \
-H "Content-Type: application/json" \
-d '{
"urls": ["https://docs.example.com"],
"max_depth": 3,
"same_domain": true,
"word_count_threshold": 50
}'
When Docker unavailable:
| Scenario | Depth | Max Pages | |----------|-------|-----------| | API reference | 2-3 | 50 | | Full documentation site | 3-5 | 100 | | Wiki section | 2 | 30 | | Changelog history | 1-2 | 20 | | Tutorial series | 2-3 | 30 |
development
Goal-based workflow orchestration - routes tasks to specialist agents based on user goals
tools
Wiring Verification
development
Connection management, room patterns, reconnection strategies, message buffering, and binary protocol design.
development
Screenshot comparison QA for frontend development. Takes a screenshot of the current implementation, scores it across multiple visual dimensions, and returns a structured PASS/REVISE/FAIL verdict with concrete fixes. Use when implementing UI from a design reference or verifying visual correctness.