docs/ai-context/archive/cursor-skills/dpla-staged-report/SKILL.md
Report which hubs have new JSONL staged in S3 for a given month, and optionally post the report to Slack. Use when user asks what hubs are staged/ready for indexing, /ingest staged, or what changed this month in S3.
npx skillsauth add dpla/ingestion3 dpla-staged-reportInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Uses scheduler/orchestrator/staged_report.py to scan s3://dpla-master-dataset/<hub>/jsonl/ for timestamped dirs in a target month.
# Export vars from .env so child processes see them
set -a
source .env
set +a
# Current month (console)
./venv/bin/python -m scheduler.orchestrator.staged_report
# February (month=2)
./venv/bin/python -m scheduler.orchestrator.staged_report --month=2
# JSON output (for scripting)
./venv/bin/python -m scheduler.orchestrator.staged_report --month=2 --json
# Post to Slack (uses SLACK_TECH_WEBHOOK then SLACK_WEBHOOK)
./venv/bin/python -m scheduler.orchestrator.staged_report --month=2 --slack
# If needed, set AWS profile explicitly
./venv/bin/python -m scheduler.orchestrator.staged_report --month=2 --profile=dpla
dpla-master-dataset.SLACK_TECH_WEBHOOK or SLACK_WEBHOOK is set (usually via source .env).data-ai
Show key i3.conf config for a hub (provider, harvest.type, harvest.endpoint, schedule, email, setlist). Use when user asks for hub config, harvest type/endpoint, who gets emails, schedule months, or OAI setlist details.
development
Run Community Webs ingest. Use when the user says harvest community-webs, run community-webs ingest, export community webs, or process community webs DB.
testing
Verify ingest outcomes and send failure or status notifications to Slack or [email protected]. Use when the user asks to verify the ingest, check if it succeeded, notify about a failure, or post to tech-alerts.
development
Follow the project workflow when adding or modifying shell or Python scripts. Use when the user asks to add a script, create a new script, modify a script, or write a script for a task. Ensures POSIX bash, common.sh, documentation in SCRIPTS.md, and tests are created or updated and run.