docs/ai-context/archive/cursor-skills/dpla-monitor-ingest-remap/SKILL.md
Monitor a running IngestRemap (remap.sh / ingest.sh step 2) or orchestrator remap stages. Use when user asks whether mapping/enrichment/jsonl is done yet, to monitor remap progress, or to check which stage is currently running.
npx skillsauth add dpla/ingestion3 dpla-monitor-ingest-remapInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Prefer orchestrator status when available; otherwise monitor the latest mapping/enrichment/jsonl output dirs and _SUCCESS markers.
set -a
source .env
set +a
./scripts/status/ingest-status.sh --watch
# One hub only
./scripts/status/ingest-status.sh <hub> --watch
set -a
source .env
set +a
HUB=<hub>
LOG="logs/remap-${HUB}-$(date +%Y%m%d_%H%M%S).log"
# If you're running remap directly:
./scripts/remap.sh "$HUB" 2>&1 | tee "$LOG"
# Or if you're running full ingest and want to watch step 2:
# ./scripts/ingest.sh "$HUB" 2>&1 | tee "$LOG"
_SUCCESSset -a
source .env
set +a
HUB=<hub>
DATA_ROOT="${DPLA_DATA:-$HOME/dpla/data}"
while true; do
echo ""
date
for step in mapping enrichment jsonl; do
step_dir="$DATA_ROOT/$HUB/$step"
latest=$(ls -1d "$step_dir"/*/ 2>/dev/null | sort -r | head -1 || true)
latest="${latest%/}"
if [[ -z "$latest" ]]; then
echo "$step: (no output yet)"
continue
fi
if [[ -f "$latest/_SUCCESS" ]]; then
echo "$step: SUCCESS $(basename "$latest")"
else
echo "$step: RUNNING $(basename "$latest")"
fi
done
sleep 30
done
_temporary but no _SUCCESS, the write is incomplete (job still running or failed).data-ai
Show key i3.conf config for a hub (provider, harvest.type, harvest.endpoint, schedule, email, setlist). Use when user asks for hub config, harvest type/endpoint, who gets emails, schedule months, or OAI setlist details.
development
Run Community Webs ingest. Use when the user says harvest community-webs, run community-webs ingest, export community webs, or process community webs DB.
testing
Verify ingest outcomes and send failure or status notifications to Slack or [email protected]. Use when the user asks to verify the ingest, check if it succeeded, notify about a failure, or post to tech-alerts.
business
Report which hubs have new JSONL staged in S3 for a given month, and optionally post the report to Slack. Use when user asks what hubs are staged/ready for indexing, /ingest staged, or what changed this month in S3.