.claude/skills/dpla-community-webs-ingest/SKILL.md
Run Community Webs ingest. Use when the user says harvest community-webs, run community-webs ingest, export community webs, or process community webs DB.
npx skillsauth add dpla/ingestion3 dpla-community-webs-ingestInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Community Webs ingest is handled by the dpla-hub-ingest skill with community-webs as the hub name.
The pre-processing step (SQLite DB → JSONL → ZIP → i3.conf update) runs entirely on EC2 via SSM — no local steps needed. See the Community Webs Pre-processing section in the dpla-hub-ingest skill for the full procedure.
Trigger phrase to use: "ingest community-webs" or "harvest community-webs"
data-ai
Show key i3.conf config for a hub (provider, harvest.type, harvest.endpoint, schedule, email, setlist). Use when user asks for hub config, harvest type/endpoint, who gets emails, schedule months, or OAI setlist details.
testing
Verify ingest outcomes and send failure or status notifications to Slack or [email protected]. Use when the user asks to verify the ingest, check if it succeeded, notify about a failure, or post to tech-alerts.
business
Report which hubs have new JSONL staged in S3 for a given month, and optionally post the report to Slack. Use when user asks what hubs are staged/ready for indexing, /ingest staged, or what changed this month in S3.
development
Follow the project workflow when adding or modifying shell or Python scripts. Use when the user asks to add a script, create a new script, modify a script, or write a script for a task. Ensures POSIX bash, common.sh, documentation in SCRIPTS.md, and tests are created or updated and run.