skills/harvest-structured/SKILL.md
Structured data extraction - tables, pricing, products, API endpoints with schema
npx skillsauth add vibeeval/vibecosystem harvest-structuredInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Extract structured data from web pages using user-defined schemas. Turns messy HTML into clean JSON/CSV - pricing tables, product listings, API endpoint docs, comparison matrices.
/scrape <url> --schema "<field descriptions>"
# Extract pricing data
/scrape https://example.com/pricing --schema "plan_name, price, features[], cta_text"
# Extract product listings
/scrape https://store.example.com/products --schema "name, price, rating, reviews_count, image_url"
# Extract API endpoints
/scrape https://docs.api.com/reference --schema "method, path, description, parameters[], response_code"
Define fields as comma-separated names. Use [] for arrays:
name → Single text value
price → Single value (auto-detects currency)
features[] → Array of items
description → Long text
url → Auto-detects links
image_url → Auto-detects image sources
[
{
"plan_name": "Pro",
"price": "$29/mo",
"features": ["Unlimited projects", "Priority support", "API access"],
"source_url": "https://example.com/pricing"
}
]
plan_name,price,features,source_url
Pro,"$29/mo","Unlimited projects; Priority support; API access",https://example.com/pricing
development
Goal-based workflow orchestration - routes tasks to specialist agents based on user goals
tools
Wiring Verification
development
Connection management, room patterns, reconnection strategies, message buffering, and binary protocol design.
testing
VP Engineering perspective - org design (team topologies), process improvement, cross-team dependencies, engineering culture, OKRs, incident management maturity, platform strategy, DX optimization, release management at scale