Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

Elmanda1/Troubleshoot DataPipeline OS

Name: Troubleshoot DataPipeline OS
Author: Elmanda1

.agents/skills/troubleshoot/SKILL.md

npx skillsauth add Elmanda1/nexus_datagen Troubleshoot DataPipeline OS

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Troubleshooting Guide

Diagnostic Steps

When an error occurs, follow this triage order:

Check the terminal log — Flask server prints all engine logs with timestamps
Check the UI terminal — Center panel shows color-coded logs (SYS/OK/WARN/ERR)
Check debug screenshots — config/login_*.png for Twitter login issues
Check output folder — data_output/ for partial data that was written

Known Issues and How to Handle Them

Twitter Login Failures

Symptom: Engine falls back to error state with login failure message

Root Cause: Twitter frequently changes HTML selectors on the login page.

Fix:

Check screenshot at config/login_debug.png or config/login_debug_pwd.png
Open engines/twitter_engine.py
Find USERNAME_SELECTORS (around line 314) and PASSWORD_SELECTORS (around line 320)
Open Twitter login page manually in browser → Inspect Element → find the current selector
Update the selector list
Delete old session: via UI "Clear Session" button or delete config/chromium_profile/ and config/twitter_session.json

Twitter White/Blank Login Page

Symptom: Playwright opens Twitter login but page is blank white/grey (no form rendered)

Root Cause: Twitter login is a React SPA. The page DOM loads but JavaScript hasn't mounted the React app yet.

Fix (already applied in current engine):

Engine uses wait_until="networkidle" instead of "domcontentloaded"
Stealth JS injection overrides navigator.webdriver to prevent detection
Auto-detects blank page via document.body.scrollHeight < 200 and reloads
If still blank after reload, check:
- Internet connection speed (slow connections need longer wait)
- Chromium profile may be corrupted → Clear session from UI
- Twitter may be blocking the IP → Try with VPN/different IP
Manual fix: delete config/chromium_profile/ and config/twitter_session.json, then retry

Twitter Rate Limiting

Symptom: Log shows rate limit warning, engine stops early

Fix: This is by design. Engine auto-skips to next keyword. Wait 15 minutes before rerunning. For large date ranges, reduce keywords or narrow the range.

Port 5000/8080 Conflict

Symptom: OSError: [Errno 48] Address already in use

Fix:

Check what's using the port:

netstat -ano | findstr :8080   # Windows
lsof -i :8080                  # macOS/Linux

Kill the process, or change port in app.py (last line):
```
app.run(debug=True, port=8081, use_reloader=False)
```

Reddit API Rejection

Symptom: 403 Forbidden from Reddit API

Fix: Reddit requires accounts with some activity history. New accounts are rejected. Wait 5-7 days with normal activity, then try again.

Google Trends Rate Limit

Symptom: TooManyRequestsError from pytrends

Fix: pytrends is limited to ~10 requests per minute by Google. Reduce number of keywords or increase delay between requests in trends_engine.py.

ModuleNotFoundError

Symptom: ModuleNotFoundError: No module named 'xxx'

Fix:

# Ensure venv is active
venv\Scripts\activate          # Windows
source venv/bin/activate       # macOS/Linux

# Reinstall
pip install -r requirements.txt --timeout 120

Empty Output / No Data

Symptom: Pipeline completes but CSV is empty or very small

Possible Causes:

Keywords too specific — Try broader terms
Date range too narrow — Expand the range
API credentials missing — Check .env, engines will use simulation mode
All feeds down — News engine depends on RSS feed availability

Diagnostic:

# Check if engines ran in simulation mode
# Look for "_sim" in the source column of output CSV

Playwright / Chromium Issues

Symptom: playwright._impl._errors.Error: Executable doesn't exist

Fix:

pip install playwright
playwright install chromium

If behind a proxy:

set HTTPS_PROXY=http://proxy:port      # Windows
export HTTPS_PROXY=http://proxy:port   # macOS/Linux
playwright install chromium

Engine-Specific Debugging

Check Engine Status via API

curl http://localhost:8080/api/state | python -m json.tool

Look at engines object — each engine shows status, rows, ram_mb.

Force Simulation Mode

To test without any API keys, ensure all API-related variables in .env are empty. All engines will automatically fall back to simulation.

Check Twitter Session

curl http://localhost:8080/api/twitter/info

Returns: session_exists, profile_exists, username, has_password.

Elmanda1/Troubleshoot DataPipeline OS

.agents/skills/troubleshoot/SKILL.md

Diagnose and fix common issues in DataPipeline OS

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add Elmanda1/nexus_datagen Troubleshoot DataPipeline OS

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 17, 2026, 12:28 AM146.2s1 file scanned

SKILL.md

name:: Troubleshoot DataPipeline OS
description:: Diagnose and fix common issues in DataPipeline OS

Troubleshooting Guide

Diagnostic Steps

When an error occurs, follow this triage order:

Check the terminal log — Flask server prints all engine logs with timestamps
Check the UI terminal — Center panel shows color-coded logs (SYS/OK/WARN/ERR)
Check debug screenshots — config/login_*.png for Twitter login issues
Check output folder — data_output/ for partial data that was written

Known Issues and How to Handle Them

Twitter Login Failures

Symptom: Engine falls back to error state with login failure message

Root Cause: Twitter frequently changes HTML selectors on the login page.

Fix:

Check screenshot at config/login_debug.png or config/login_debug_pwd.png
Open engines/twitter_engine.py
Find USERNAME_SELECTORS (around line 314) and PASSWORD_SELECTORS (around line 320)
Open Twitter login page manually in browser → Inspect Element → find the current selector
Update the selector list
Delete old session: via UI "Clear Session" button or delete config/chromium_profile/ and config/twitter_session.json

Twitter White/Blank Login Page

Symptom: Playwright opens Twitter login but page is blank white/grey (no form rendered)

Root Cause: Twitter login is a React SPA. The page DOM loads but JavaScript hasn't mounted the React app yet.

Fix (already applied in current engine):

Engine uses wait_until="networkidle" instead of "domcontentloaded"
Stealth JS injection overrides navigator.webdriver to prevent detection
Auto-detects blank page via document.body.scrollHeight < 200 and reloads
If still blank after reload, check:
- Internet connection speed (slow connections need longer wait)
- Chromium profile may be corrupted → Clear session from UI
- Twitter may be blocking the IP → Try with VPN/different IP
Manual fix: delete config/chromium_profile/ and config/twitter_session.json, then retry

Twitter Rate Limiting

Symptom: Log shows rate limit warning, engine stops early

Fix: This is by design. Engine auto-skips to next keyword. Wait 15 minutes before rerunning. For large date ranges, reduce keywords or narrow the range.

Port 5000/8080 Conflict

Symptom: OSError: [Errno 48] Address already in use

Fix:

Check what's using the port:

netstat -ano | findstr :8080   # Windows
lsof -i :8080                  # macOS/Linux

Kill the process, or change port in app.py (last line):
```
app.run(debug=True, port=8081, use_reloader=False)
```

Reddit API Rejection

Symptom: 403 Forbidden from Reddit API

Fix: Reddit requires accounts with some activity history. New accounts are rejected. Wait 5-7 days with normal activity, then try again.

Google Trends Rate Limit

Symptom: TooManyRequestsError from pytrends

Fix: pytrends is limited to ~10 requests per minute by Google. Reduce number of keywords or increase delay between requests in trends_engine.py.

ModuleNotFoundError

Symptom: ModuleNotFoundError: No module named 'xxx'

Fix:

# Ensure venv is active
venv\Scripts\activate          # Windows
source venv/bin/activate       # macOS/Linux

# Reinstall
pip install -r requirements.txt --timeout 120

Empty Output / No Data

Symptom: Pipeline completes but CSV is empty or very small

Possible Causes:

Keywords too specific — Try broader terms
Date range too narrow — Expand the range
API credentials missing — Check .env, engines will use simulation mode
All feeds down — News engine depends on RSS feed availability

Diagnostic:

# Check if engines ran in simulation mode
# Look for "_sim" in the source column of output CSV

Playwright / Chromium Issues

Symptom: playwright._impl._errors.Error: Executable doesn't exist

Fix:

pip install playwright
playwright install chromium

If behind a proxy:

set HTTPS_PROXY=http://proxy:port      # Windows
export HTTPS_PROXY=http://proxy:port   # macOS/Linux
playwright install chromium

Engine-Specific Debugging

Check Engine Status via API

curl http://localhost:8080/api/state | python -m json.tool

Look at engines object — each engine shows status, rows, ram_mb.

Force Simulation Mode

To test without any API keys, ensure all API-related variables in .env are empty. All engines will automatically fall back to simulation.

Check Twitter Session

curl http://localhost:8080/api/twitter/info

Returns: session_exists, profile_exists, username, has_password.

Related Skills

Elmanda1/Run Pipeline

development

VerifiedTrustedCommunity

How to set up and run the DataPipeline OS extraction pipeline

SKILL.mdUpdated Apr 16, 2026

Elmanda1/Run Pipeline

Elmanda1/Add New Engine

development

VerifiedTrustedCommunity

Step-by-step guide to add a new data source engine to DataPipeline OS

SKILL.mdUpdated Apr 16, 2026

Elmanda1/Add New Engine

openclaw/openclaw-secret-scanning-maintainer

development

VerifiedTrustedCommunity

Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.

357,764SKILL.mdUpdated Apr 15, 2026

openclaw/openclaw-secret-scanning-maintainer

openclaw/openclaw-release-maintainer

development

VerifiedTrustedCommunity

Maintainer workflow for OpenClaw releases, prereleases, changelog release notes, and publish validation. Use when Codex needs to prepare or verify stable or beta release steps, align version naming, assemble release notes, check release auth requirements, or validate publish-time commands and artifacts.

357,764SKILL.mdUpdated Apr 10, 2026

openclaw/openclaw-release-maintainer

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/Elmanda1/nexus_datagen.git

# Copy into Claude Code skills folder (global)
cp -r nexus_datagen/.agents/skills/troubleshoot ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

Elmanda1/nexus_datagen

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT