skills/aitherhub-deploy/SKILL.md
Aitherhub project deployment guide. Use when deploying changes to Aitherhub frontend, backend, or worker. Covers GitHub Actions workflows, Azure App Service, Azure Static Web Apps deployment, and RunPod Serverless GPU Worker operations (MuseTalk lipsync, FaceFusion face-swap).
npx skillsauth add lcj-group/aitherhub aitherhub-deployInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
LCJ-Group/aitherhubgh repo clone LCJ-Group/aitherhub/home/ubuntu/aitherhub-repomaster (single branch, push to deploy)| Component | Technology | Hosting | Deploy Trigger |
|---|---|---|---|
| Frontend | Vite + React | Azure Static Web Apps | Push to master |
| Backend (main) | FastAPI | Azure App Service (aitherhubAPI) | Push to master |
| GPU Worker | RunPod Serverless (FastAPI) | RunPod Serverless (L4/A5000) | Push to master (gpu-worker/serverless/) |
All workflows trigger on push to master branch automatically.
deploy-swa-frontend.yml)cd frontend && npm install && npm run buildmaster_aitherhubapi.yml)requirements.txtaitherhubAPIbuild-gpu-worker.yml)gpu-worker/serverless/Dockerfile)ghcr.io/lcj-group/aitherhub-gpu-worker:latest)gpu-worker/serverless/ path in master branch.The GPU Worker has been migrated from a persistent RunPod Pod to a RunPod Serverless endpoint. This provides auto-scaling, eliminates the need for manual deployments, and ensures high availability without managing a specific Pod.
The core of this architecture is a self-contained Docker image that includes all necessary code (FaceFusion, MuseTalk), dependencies, and essential models. This removes the dependency on a persistent /workspace/ volume.
| Setting | Value |
|---|---|
| Endpoint ID | 2noptqoq7n8f8g |
| Docker Image | ghcr.io/lcj-group/aitherhub-gpu-worker:latest |
| GPUs | L4, RTX A5000, etc. (auto-scales) |
| Cold Start Time | ~1-2 minutes |
Deployment is now fully automated:
gpu-worker/serverless/ directory in the master branch will automatically trigger the build-gpu-worker.yml GitHub Actions workflow.latest tag of the Docker image. When a new image is pushed, RunPod automatically initiates a rolling update, replacing old workers with new ones without downtime.There are no manual steps required to deploy the GPU worker anymore.
aitherhubAPI) now uses the RunPodServerlessService to send jobs to the Serverless endpoint./run or /runsync) and check their status (/status).RUNPOD_API_KEY and RUNPOD_ENDPOINT_ID are configured as fallback values directly in runpod_serverless_service.py. This was done to avoid complexities with Azure App Service environment variable management.https://aitherhubapi-cpcjcnezbgf5f7e2.japaneast-01.azurewebsites.net/api/v1/digital-human/musetalk/healthX-Admin-Key: aither:hubcold_start status is normal if the worker is idle.https://api.runpod.ai/v2/2noptqoq7n8f8g/runsyncPOST{"input": {"action": "health"}}Bearer <RUNPOD_API_KEY>https://aitherhubapi-cpcjcnezbgf5f7e2.japaneast-01.azurewebsites.nethttps://www.aitherhub.comhttps://api.runpod.ai/v2/2noptqoq7n8f8gdevelopment
Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.
development
Maintainer workflow for OpenClaw releases, prereleases, changelog release notes, and publish validation. Use when Codex needs to prepare or verify stable or beta release steps, align version naming, assemble release notes, check release auth requirements, or validate publish-time commands and artifacts.
development
Run, watch, debug, and extend OpenClaw QA testing with qa-lab and qa-channel. Use when Codex needs to execute the repo-backed QA suite, inspect live QA artifacts, debug failing scenarios, add new QA scenarios, or explain the OpenClaw QA workflow. Prefer the live OpenAI lane with regular openai/gpt-5.4 in fast mode; do not use gpt-5.4-pro or gpt-5.4-mini unless the user explicitly overrides that policy.
development
End-to-end Parallels smoke, upgrade, and rerun workflow for OpenClaw across macOS, Windows, and Linux guests. Use when Codex needs to run, rerun, debug, or interpret VM-based install, onboarding, gateway smoke tests, latest-release-to-main upgrade checks, fresh snapshot retests, or optional Discord roundtrip verification under Parallels.