.agents/skills/run_pipeline/SKILL.md
How to set up and run the DataPipeline OS extraction pipeline
npx skillsauth add Elmanda1/nexus_datagen Run PipelineInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
# Option 1: Double-click run.bat in the project root
# Option 2: Manual
cd datapipeline_os
venv\Scripts\activate
python app.py
# Open http://localhost:8080
# Option 1: ./run.sh
# Option 2: Manual
cd datapipeline_os
source venv/bin/activate
python app.py
# Open http://localhost:8080
Create virtual environment:
python -m venv venv
Activate venv:
venv\Scripts\activate # Windows
source venv/bin/activate # macOS/Linux
Install dependencies:
pip install -r requirements.txt --timeout 120
Configure environment:
copy .env.example .env # Windows
cp .env.example .env # macOS/Linux
Then edit .env with your API credentials.
For Twitter live scraping (optional):
pip install playwright
playwright install chromium
http://localhost:8080 in browserAnalisis_Tren_Korea)curl -X POST http://localhost:8080/api/run \
-H "Content-Type: application/json" \
-d '{
"project_name": "Analisis_Tren_Korea",
"keywords": ["Drama Korea", "Kpop"],
"date_range": {"start": "2020-01-01", "end": "2024-12-31"},
"platforms": ["google_trends", "twitter"],
"language": "id",
"chunk_size": 500,
"output": {"format": "csv", "path": "./data_output/"}
}'
data_output/YYYYMMDD-HHMMSS/<project>_<source>.csv<project>_combined.csvcurl -X POST http://localhost:8080/api/stop
| Problem | Fix |
|---|---|
| Port 8080 in use | Change port in app.py last line |
| Venv not activated | Run venv\Scripts\activate first |
| Module not found | Run pip install -r requirements.txt |
| Playwright missing | Run pip install playwright && playwright install chromium |
development
Diagnose and fix common issues in DataPipeline OS
development
Step-by-step guide to add a new data source engine to DataPipeline OS
development
Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.
development
Maintainer workflow for OpenClaw releases, prereleases, changelog release notes, and publish validation. Use when Codex needs to prepare or verify stable or beta release steps, align version naming, assemble release notes, check release auth requirements, or validate publish-time commands and artifacts.