Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

hsliuustc0106/skills/vllm-omni-nightly-local

Name: skills/vllm-omni-nightly-local
Author: hsliuustc0106

skills/vllm-omni-nightly-local/SKILL.md

npx skillsauth add hsliuustc0106/vllm-omni-skills skills/vllm-omni-nightly-local

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

vLLM-Omni Nightly Local (cluster run & log sync)

Overview

Login - SSH, squeue / srun --overlap, docker exec without -it (see sections below).
Run cases — In the same container shell as the script: source /rebase/.venv/bin/activate (see references/nightly-local-environment.md), then set Hugging Face / vLLM env (see below and that reference), optionally nvidia-smi → CUDA_VISIBLE_DEVICES, then bash tools/nightly/run_nightly_jobs.sh with LOG_DIR="$REPO_ROOT/logs/nightly_jobs"。Long runs: run inside tmux / screen on the cluster, or start the script with nohup and redirect stdout/stderr to a log file under $REPO_ROOT/logs/, so an SSH disconnect does not stop the workload (details: references/nightly-local-environment.md — Long runs / SSH disconnect).
Sync logs — On your laptop, (1) if $REPO_ROOT/logs/nightly_perf_manual.xlsx exists from the last run, copy it to nightly_perf_manual.prev.xlsx (baseline for report ↑/↓). (2) Remove the local $REPO_ROOT/logs/nightly_jobs tree (rm -rf) so the new pull does not mix old job folders with the latest run. (3) Copy nightly_jobs and nightly_perf_manual.xlsx into $REPO_ROOT/logs/: references/nightly-local-log-fetch.md.

Analyze and write the HTML test report in vllm-omni-test-report (report kind nightly): export REPO_ROOT=/path/to/local/vllm-omni then python scripts/nightly_local_log_report.py --html-report ... (defaults to $REPO_ROOT/logs/nightly_jobs; pass --log-dir only if you used a different tree).

Required user inputs

| Input | Meaning | |-------|---------| | SSH connection name | Host alias or user@host. | | Slurm username | For squeue -u .... | | Docker container name | For docker exec. | | Empty GPU count | Optional but recommended before run_nightly_jobs.sh: how many free GPUs to use (X). The agent runs nvidia-smi, picks X indices with no (or minimal) load, and export CUDA_VISIBLE_DEVICES=… in the same shell / docker exec bash -lc as the script. See §2 and the section CUDA_VISIBLE_DEVICES — empty GPUs in references/nightly-local-environment.md. |

Optional: REPO_ROOT inside the container.

1. Login environment

1.1 SSH

ssh -v "<SSH_CONNECTION_NAME>"

Load module load slurm (or site equivalent) before srun if needed.

1.2 Find JOBID

SLURM_USER="<username>"
squeue -u "$SLURM_USER" -t RUNNING -h -o "%i"

Confirm JOBID when multiple rows exist.

1.3 Run in container (no TTY)

JOBID="<chosen_jobid>"
srun --jobid="$JOBID" --overlap docker exec "<CONTAINER_NAME>" bash -lc '<commands>'

Nightly one-liner:

srun --jobid="$JOBID" --overlap docker exec "<CONTAINER_NAME>" bash -lc 'source /rebase/.venv/bin/activate && export REPO_ROOT=/path/to/vllm-omni && cd "$REPO_ROOT" && bash tools/nightly/run_nightly_jobs.sh'

1.4 New allocation if no JOBID

srun -p q-fq9hpsac -w hk01dgx006 --gres=gpu:0 --mem-per-cpu=8G --pty  --job-name=ci_local_test

Then docker exec "<CONTAINER_NAME>" bash -lc '<commands>'.

1.5 Optional: `docker exec -it` for debugging only

1.6 Agent: BatchMode SSH

ssh -o BatchMode=yes -o ConnectTimeout=30 "<SSH_CONNECTION_NAME>" \
  "bash -lc 'type module >/dev/null 2>&1 && module load slurm 2>/dev/null; squeue -u \"<SLURM_USER>\" -t RUNNING -h -o \"%i\"'"

ssh -o BatchMode=yes -o ConnectTimeout=120 "<SSH_CONNECTION_NAME>" \
  "bash -lc 'type module >/dev/null 2>&1 && module load slurm 2>/dev/null; srun --jobid=\"<JOBID>\" --overlap docker exec \"<CONTAINER_NAME>\" bash -lc \"<INNER_CMD>\"'"

Details: references/nightly-local-environment.md.

2. Run test cases

Before bash tools/nightly/run_nightly_jobs.sh (inside the same docker exec … bash -lc '…' or interactive shell on the node):

Python venv (required inside the container) — run first in that inner shell:
```
source /rebase/.venv/bin/activate
```
Details: references/nightly-local-environment.md (Python venv inside the container).
Model / HF / vLLM environment (required unless the user gives a different site policy) — same shell as the script:
```
export HF_HOME="/home/models/"
unset HF_HUB_CACHE
unset TRANSFORMERS_CACHE
export VLLM_ALLOW_LONG_MAX_MODEL_LEN="1"
```
Details: references/nightly-local-environment.md (Hugging Face cache and vLLM).
Ask the user for X = number of empty / free GPUs to use (or use a value they already gave in Required user inputs).
Run nvidia-smi, select X GPU indices that are idle (typically 0 MiB used and 0% util — site-specific thresholds in the reference), then set:
```
export CUDA_VISIBLE_DEVICES='<comma-separated GPU indices>'
```
In that same environment, cd to $REPO_ROOT and run bash tools/nightly/run_nightly_jobs.sh (or your local test entrypoint).

Copy-paste patterns and fallback when fewer than X GPUs are strictly empty: references/nightly-local-environment.md (CUDA_VISIBLE_DEVICES — empty GPUs).

Example inner command (replace placeholders; X and device list come from nvidia-smi selection):

srun --jobid="$JOBID" --overlap docker exec "$CONTAINER_NAME" bash -lc '
  source /rebase/.venv/bin/activate
  export REPO_ROOT=/path/to/vllm-omni
  export HF_HOME="/home/models/"
  unset HF_HUB_CACHE
  unset TRANSFORMERS_CACHE
  export VLLM_ALLOW_LONG_MAX_MODEL_LEN="1"
  export CUDA_VISIBLE_DEVICES="0,1"
  cd "$REPO_ROOT" && bash tools/nightly/run_nightly_jobs.sh
'

(CUDA_VISIBLE_DEVICES shown as "0,1" only for illustration — derive from nvidia-smi, do not hardcode unless the user insists. HF_HOME must match the cluster mount for shared models; if the user specifies another path, use that instead.)

2.1 Background / resilient shell (recommended when tests run for a long time)

Preferred: open tmux new -s nightly (or screen) before srun / docker exec, run the full §2 command inside that session, then detach (Ctrl-b d in tmux). Re-attach later with tmux attach -t nightly to watch progress.
Alternative: inside docker exec … bash -lc '…', start the nightly script with nohup and append logs to a file under $REPO_ROOT/logs/ (see copy-paste in references/nightly-local-environment.md).

3. Sync logs off-cluster

Follow references/nightly-local-log-fetch.md (scp / rsync / tarball). Include logs/nightly_perf_manual.xlsx when it exists on the server.

Agent workflow

Collect SSH name, Slurm user, container name, and optional X empty GPUs for CUDA_VISIBLE_DEVICES; do not guess JOBID when ambiguous.
Run sections 1–2; inside the container, run source /rebase/.venv/bin/activate first; before the nightly/local test script, set HF / vLLM env vars (§2 step 1). Then, if X is set (or the user asks for GPU selection), run nvidia-smi, export CUDA_VISIBLE_DEVICES, and run the script in the same environment. For multi-hour jobs, use §2.1 (tmux / nohup) so a dropped SSH session does not kill the run.
Verify files under logs/nightly_jobs (and logs/nightly_perf_manual.xlsx when your workflow produces it).
For fetch to a laptop: (a) copy nightly_perf_manual.xlsx → nightly_perf_manual.prev.xlsx when baselines are needed; (b) rm -rf local logs/nightly_jobs before sync (see references/nightly-local-log-fetch.md); (c) then pull. For HTML report, point to vllm-omni-test-report nightly_local_log_report.py and layout ../vllm-omni-test-report/references/nightly-local-log-layout.md.

References

Fetch logs (scp / rsync / tar): references/nightly-local-log-fetch.md
Environment notes: references/nightly-local-environment.md

hsliuustc0106/skills/vllm-omni-nightly-local

skills/vllm-omni-nightly-local/SKILL.md

--- name: vllm-omni-nightly-local description: On HK - SSH, Slurm, non-interactive docker exec (bash -lc): **`source /rebase/.venv/bin/activate`** inside the container before repo commands, then run `tools/nightly/run_nightly_jobs.sh` and write logs under logs/nightly_jobs. Sync logs and optional logs/nightly_perf_manual.xlsx to your laptop, then use vllm-omni-test-report report kind nightly + scripts/nightly_local_log_report.py — **default output HTML** (`--html-report`) unless the user explici

64 stars

tools

Updated May 19, 2026

$ install --global

skillsauth

npx skillsauth add hsliuustc0106/vllm-omni-skills skills/vllm-omni-nightly-local

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 19, 2026, 2:53 AM394.2s3 files scanned

SKILL.md

name:: vllm-omni-nightly-local
description:: On HK - SSH, Slurm, non-interactive docker exec (bash -lc): **`source /rebase/.venv/bin/activate`** inside the container before repo commands, then run `tools/nightly/run_nightly_jobs.sh` and write logs under logs/nightly_jobs. Sync logs and optional logs/nightly_perf_manual.xlsx to your laptop, then use vllm-omni-test-report report kind nightly + scripts/nightly_local_log_report.py — **default output HTML** (`--html-report`) unless the user explicitly asks for Markdown. Use when allocating GPUs, running cluster nightly jobs, or fetching nightly_jobs before offline analysis.

vLLM-Omni Nightly Local (cluster run & log sync)

Overview

Login - SSH, squeue / srun --overlap, docker exec without -it (see sections below).
Run cases — In the same container shell as the script: source /rebase/.venv/bin/activate (see references/nightly-local-environment.md), then set Hugging Face / vLLM env (see below and that reference), optionally nvidia-smi → CUDA_VISIBLE_DEVICES, then bash tools/nightly/run_nightly_jobs.sh with LOG_DIR="$REPO_ROOT/logs/nightly_jobs"。Long runs: run inside tmux / screen on the cluster, or start the script with nohup and redirect stdout/stderr to a log file under $REPO_ROOT/logs/, so an SSH disconnect does not stop the workload (details: references/nightly-local-environment.md — Long runs / SSH disconnect).
Sync logs — On your laptop, (1) if $REPO_ROOT/logs/nightly_perf_manual.xlsx exists from the last run, copy it to nightly_perf_manual.prev.xlsx (baseline for report ↑/↓). (2) Remove the local $REPO_ROOT/logs/nightly_jobs tree (rm -rf) so the new pull does not mix old job folders with the latest run. (3) Copy nightly_jobs and nightly_perf_manual.xlsx into $REPO_ROOT/logs/: references/nightly-local-log-fetch.md.

Required user inputs

Optional: REPO_ROOT inside the container.

1. Login environment

1.1 SSH

ssh -v "<SSH_CONNECTION_NAME>"

Load module load slurm (or site equivalent) before srun if needed.

1.2 Find JOBID

SLURM_USER="<username>"
squeue -u "$SLURM_USER" -t RUNNING -h -o "%i"

Confirm JOBID when multiple rows exist.

1.3 Run in container (no TTY)

JOBID="<chosen_jobid>"
srun --jobid="$JOBID" --overlap docker exec "<CONTAINER_NAME>" bash -lc '<commands>'

Nightly one-liner:

srun --jobid="$JOBID" --overlap docker exec "<CONTAINER_NAME>" bash -lc 'source /rebase/.venv/bin/activate && export REPO_ROOT=/path/to/vllm-omni && cd "$REPO_ROOT" && bash tools/nightly/run_nightly_jobs.sh'

1.4 New allocation if no JOBID

srun -p q-fq9hpsac -w hk01dgx006 --gres=gpu:0 --mem-per-cpu=8G --pty  --job-name=ci_local_test

Then docker exec "<CONTAINER_NAME>" bash -lc '<commands>'.

1.5 Optional: `docker exec -it` for debugging only

1.6 Agent: BatchMode SSH

ssh -o BatchMode=yes -o ConnectTimeout=30 "<SSH_CONNECTION_NAME>" \
  "bash -lc 'type module >/dev/null 2>&1 && module load slurm 2>/dev/null; squeue -u \"<SLURM_USER>\" -t RUNNING -h -o \"%i\"'"

ssh -o BatchMode=yes -o ConnectTimeout=120 "<SSH_CONNECTION_NAME>" \
  "bash -lc 'type module >/dev/null 2>&1 && module load slurm 2>/dev/null; srun --jobid=\"<JOBID>\" --overlap docker exec \"<CONTAINER_NAME>\" bash -lc \"<INNER_CMD>\"'"

Details: references/nightly-local-environment.md.

2. Run test cases

Before bash tools/nightly/run_nightly_jobs.sh (inside the same docker exec … bash -lc '…' or interactive shell on the node):

Python venv (required inside the container) — run first in that inner shell:
```
source /rebase/.venv/bin/activate
```
Details: references/nightly-local-environment.md (Python venv inside the container).
Model / HF / vLLM environment (required unless the user gives a different site policy) — same shell as the script:
```
export HF_HOME="/home/models/"
unset HF_HUB_CACHE
unset TRANSFORMERS_CACHE
export VLLM_ALLOW_LONG_MAX_MODEL_LEN="1"
```
Details: references/nightly-local-environment.md (Hugging Face cache and vLLM).
Ask the user for X = number of empty / free GPUs to use (or use a value they already gave in Required user inputs).
Run nvidia-smi, select X GPU indices that are idle (typically 0 MiB used and 0% util — site-specific thresholds in the reference), then set:
```
export CUDA_VISIBLE_DEVICES='<comma-separated GPU indices>'
```
In that same environment, cd to $REPO_ROOT and run bash tools/nightly/run_nightly_jobs.sh (or your local test entrypoint).

Copy-paste patterns and fallback when fewer than X GPUs are strictly empty: references/nightly-local-environment.md (CUDA_VISIBLE_DEVICES — empty GPUs).

Example inner command (replace placeholders; X and device list come from nvidia-smi selection):

srun --jobid="$JOBID" --overlap docker exec "$CONTAINER_NAME" bash -lc '
  source /rebase/.venv/bin/activate
  export REPO_ROOT=/path/to/vllm-omni
  export HF_HOME="/home/models/"
  unset HF_HUB_CACHE
  unset TRANSFORMERS_CACHE
  export VLLM_ALLOW_LONG_MAX_MODEL_LEN="1"
  export CUDA_VISIBLE_DEVICES="0,1"
  cd "$REPO_ROOT" && bash tools/nightly/run_nightly_jobs.sh
'

2.1 Background / resilient shell (recommended when tests run for a long time)

Preferred: open tmux new -s nightly (or screen) before srun / docker exec, run the full §2 command inside that session, then detach (Ctrl-b d in tmux). Re-attach later with tmux attach -t nightly to watch progress.
Alternative: inside docker exec … bash -lc '…', start the nightly script with nohup and append logs to a file under $REPO_ROOT/logs/ (see copy-paste in references/nightly-local-environment.md).

3. Sync logs off-cluster

Follow references/nightly-local-log-fetch.md (scp / rsync / tarball). Include logs/nightly_perf_manual.xlsx when it exists on the server.

Agent workflow

Collect SSH name, Slurm user, container name, and optional X empty GPUs for CUDA_VISIBLE_DEVICES; do not guess JOBID when ambiguous.
Run sections 1–2; inside the container, run source /rebase/.venv/bin/activate first; before the nightly/local test script, set HF / vLLM env vars (§2 step 1). Then, if X is set (or the user asks for GPU selection), run nvidia-smi, export CUDA_VISIBLE_DEVICES, and run the script in the same environment. For multi-hour jobs, use §2.1 (tmux / nohup) so a dropped SSH session does not kill the run.
Verify files under logs/nightly_jobs (and logs/nightly_perf_manual.xlsx when your workflow produces it).
For fetch to a laptop: (a) copy nightly_perf_manual.xlsx → nightly_perf_manual.prev.xlsx when baselines are needed; (b) rm -rf local logs/nightly_jobs before sync (see references/nightly-local-log-fetch.md); (c) then pull. For HTML report, point to vllm-omni-test-report nightly_local_log_report.py and layout ../vllm-omni-test-report/references/nightly-local-log-layout.md.

References

Fetch logs (scp / rsync / tar): references/nightly-local-log-fetch.md
Environment notes: references/nightly-local-environment.md

Related Skills

hsliuustc0106/vllm-omni-pre-check

development

VerifiedTrustedCommunity

Use before submitting a PR to vllm-project/vllm-omni — self-check the branch against project conventions, catch dead code, verify accuracy/performance claims, and confirm merge readiness. Use when the user says "pre-check", "self review", "pre-submit check", or "check my PR before I open it."

69SKILL.mdUpdated May 29, 2026

hsliuustc0106/vllm-omni-pre-check

hsliuustc0106/skills/vllm-omni-test-report

development

VerifiedTrustedCommunity

--- name: vllm-omni-test-report description: Two report kinds; **default output is always HTML** unless the user explicitly asks for Markdown (.md). **Release** — `scripts/compose_full_report.py` (**测试结论**, Buildkite metrics, **Test Result** = Common stack + optional `--log-dir-h*` nightly-style summaries + H100/CI block, **Issue tracking** = GitHub `ci-failure` + *local test* in:title, Open bugs); use `--format markdown` only when the user wants .md or `patch_report_*.py`. **Nightly** — `script

69SKILL.mdUpdated May 3, 2026

hsliuustc0106/skills/vllm-omni-test-report

hsliuustc0106/vllm-omni-review

testing

VerifiedTrustedCommunity

Review PRs on vllm-project/vllm-omni by routing to the right domain skills, checking critical evidence, and focusing comments on blocking issues. Use when reviewing pull requests or local branches, triaging review depth, running detailed or default review, or checking tests, benchmarks, and breaking changes in vllm-omni.

69SKILL.mdUpdated May 3, 2026

hsliuustc0106/vllm-omni-review

hsliuustc0106/vllm-omni-video-gen

data-ai

VerifiedTrustedCommunity

Generate videos with vLLM-Omni using Wan2.2 and other video generation models. Use when generating videos from text, creating videos from images, configuring video generation parameters, or working with text-to-video or image-to-video models.

67SKILL.mdUpdated May 3, 2026

hsliuustc0106/vllm-omni-video-gen

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/hsliuustc0106/vllm-omni-skills.git

# Copy into Claude Code skills folder (global)
cp -r vllm-omni-skills/skills/vllm-omni-nightly-local ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

hsliuustc0106/vllm-omni-skills

64 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT

Adoption

hsliuustc0106/skills/vllm-omni-nightly-local

$ install --global

Security Scan Results

SKILL.md

vLLM-Omni Nightly Local (cluster run & log sync)

Overview

Required user inputs

1. Login environment

1.1 SSH

1.2 Find JOBID

1.3 Run in container (no TTY)

1.4 New allocation if no JOBID

1.5 Optional: docker exec -it for debugging only

1.6 Agent: BatchMode SSH

2. Run test cases

2.1 Background / resilient shell (recommended when tests run for a long time)

3. Sync logs off-cluster

Agent workflow

References

Related Skills

hsliuustc0106/vllm-omni-pre-check

hsliuustc0106/skills/vllm-omni-test-report

hsliuustc0106/vllm-omni-review

hsliuustc0106/vllm-omni-video-gen

hsliuustc0106/skills/vllm-omni-nightly-local

$ install --global

Security Scan Results

SKILL.md

vLLM-Omni Nightly Local (cluster run & log sync)

Overview

Required user inputs

1. Login environment

1.1 SSH

1.2 Find JOBID

1.3 Run in container (no TTY)

1.4 New allocation if no JOBID

1.5 Optional: docker exec -it for debugging only

1.6 Agent: BatchMode SSH

2. Run test cases

2.1 Background / resilient shell (recommended when tests run for a long time)

3. Sync logs off-cluster

Agent workflow

References

Related Skills

hsliuustc0106/vllm-omni-pre-check

hsliuustc0106/skills/vllm-omni-test-report

hsliuustc0106/vllm-omni-review

hsliuustc0106/vllm-omni-video-gen

1.5 Optional: `docker exec -it` for debugging only

1.5 Optional: `docker exec -it` for debugging only