Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

hsliuustc0106/vllm-omni-contrib

Name: vllm-omni-contrib
Author: hsliuustc0106

skills/vllm-omni-contrib/SKILL.md

npx skillsauth add hsliuustc0106/vllm-omni-skills vllm-omni-contrib

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Contributing to vLLM-Omni

Overview

vLLM-Omni welcomes contributions including new model integrations, bug fixes, performance improvements, and documentation. This skill covers the development workflow, model integration process, and testing practices.

Development Environment Setup

Step 1: Fork and Clone

git clone https://github.com/<your-username>/vllm-omni.git
cd vllm-omni

Step 2: Install in Development Mode

uv venv --python $PYTHON_VERSION --seed
source .venv/bin/activate
uv pip install vllm==$VLLM_VERSION --torch-backend=auto
uv pip install -e ".[dev]"

Step 3: Install Pre-commit Hooks

pre-commit install

Code Organization

vllm_omni/
├── entrypoints/          # API entry points (Omni, AsyncOmni, API server)
├── engine/               # OmniRouter, pipeline orchestration
├── stages/               # Stage implementations (AR, Diffusion)
├── models/               # Model-specific implementations
├── connectors/           # OmniConnector for disaggregation
├── worker/               # Worker processes for distributed execution
└── utils/                # Shared utilities

Adding a New Model

Step 1: Identify Model Architecture

Determine which category your model falls into:

AR-only: Text generation models (use existing vLLM model support)
Diffusion-only: Image/video generation (DiT architecture)
Multi-stage: AR + Diffusion pipeline (e.g., Qwen-Image)
Omni: Full multi-modal input/output (e.g., Qwen-Omni)

Step 2: Implement Model Class

Create a new file in vllm_omni/models/:

# vllm_omni/models/my_new_model.py

from vllm_omni.stages.base import BaseStage

class MyNewModelPipeline:
    """Pipeline for MyNewModel."""

    def __init__(self, model_config, ...):
        ...

    def generate(self, prompts, ...):
        ...

Step 3: Register the Model

Add your model to the model registry so vLLM-Omni can discover it:

# In the appropriate registry file
SUPPORTED_MODELS = {
    ...
    "MyNewModelPipeline": ("my_new_model", "MyNewModelPipeline"),
}

For out-of-tree plugins, use the public API instead:

from vllm_omni.diffusion.registry import register_diffusion_model

register_diffusion_model(
    model_arch="MyNewModel",
    module_name="my_plugin.models.my_new_model",
    class_name="MyNewModelPipeline",
    pre_process_func_name="pre_process",  # optional
    post_process_func_name="post_process",  # optional
)

This registers custom diffusion pipelines without modifying core source. For out-of-tree plugins, module_name should be the full import path of the module containing the pipeline class.

Step 4: Add Stage Configuration

Create a default stage config YAML:

# vllm_omni/configs/my_new_model.yaml
stages:
  - name: "main"
    stage_type: "diffusion"  # or "ar"
    stage_args:
      runtime:
        max_batch_size: 1

Step 5: Write Tests

# tests/models/test_my_new_model.py
import pytest
from vllm_omni.entrypoints.omni import Omni

@pytest.mark.parametrize("prompt", [
    "a simple test image",
    "a red circle on white background",
])
def test_basic_generation(prompt):
    omni = Omni(model="my-org/my-new-model")
    outputs = omni.generate(prompt)
    assert len(outputs) > 0
    assert outputs[0].request_output[0].images is not None

Step 6: Add Documentation

Add your model to docs/models/supported_models.md with:

Architecture name
Model name
Example HF model ID
Any special requirements

Testing

Run Unit Tests

pytest tests/ -v

Run Specific Test File

pytest tests/models/test_my_new_model.py -v

Run with Coverage

pytest tests/ --cov=vllm_omni --cov-report=html

Code Style

Follow the existing code patterns in the repository
Use type hints consistently
Run pre-commit hooks before committing:
```
pre-commit run --all-files
```

Pull Request Process

Create a feature branch: git checkout -b feat/add-my-model
Make changes and write tests
Run tests locally: pytest tests/
Run linting: pre-commit run --all-files
Push and open a PR against main
Fill in the PR template with description and test results
Address review feedback

Troubleshooting Development

Import errors after install: Reinstall with uv pip install -e .

Tests fail with GPU errors: Some tests require a GPU. Run with pytest -m "not gpu" to skip GPU tests.

Pre-commit hook fails: Run pre-commit run --all-files to see specific issues.

OmniDiffusionConfig field name collision with vLLM attention_config: Fixed in #3489. Use diffusion_attention_config (not attention_config) in deploy YAML and code for diffusion pipelines. The old key is silently dropped.

RMSNorm inductor KeyError under HSDP + torch.compile: Fixed in #3460. fused_rms_norm inductor tracing now avoids calling .data on DTensor objects.

References

For detailed model integration patterns, see references/model-integration.md

hsliuustc0106/vllm-omni-contrib

skills/vllm-omni-contrib/SKILL.md

Contribute to vLLM-Omni by adding new model support, fixing bugs, or improving features. Use when integrating a new model into vllm-omni, setting up a development environment, writing tests, or submitting pull requests to the vllm-omni project.

67 stars

development

Updated May 25, 2026

$ install --global

skillsauth

npx skillsauth add hsliuustc0106/vllm-omni-skills vllm-omni-contrib

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 25, 2026, 2:17 AM10.0s2 files scanned

SKILL.md

name:: vllm-omni-contrib
description:: Contribute to vLLM-Omni by adding new model support, fixing bugs, or improving features. Use when integrating a new model into vllm-omni, setting up a development environment, writing tests, or submitting pull requests to the vllm-omni project.

Contributing to vLLM-Omni

Overview

Development Environment Setup

Step 1: Fork and Clone

git clone https://github.com/<your-username>/vllm-omni.git
cd vllm-omni

Step 2: Install in Development Mode

uv venv --python $PYTHON_VERSION --seed
source .venv/bin/activate
uv pip install vllm==$VLLM_VERSION --torch-backend=auto
uv pip install -e ".[dev]"

Step 3: Install Pre-commit Hooks

pre-commit install

Code Organization

vllm_omni/
├── entrypoints/          # API entry points (Omni, AsyncOmni, API server)
├── engine/               # OmniRouter, pipeline orchestration
├── stages/               # Stage implementations (AR, Diffusion)
├── models/               # Model-specific implementations
├── connectors/           # OmniConnector for disaggregation
├── worker/               # Worker processes for distributed execution
└── utils/                # Shared utilities

Adding a New Model

Step 1: Identify Model Architecture

Determine which category your model falls into:

AR-only: Text generation models (use existing vLLM model support)
Diffusion-only: Image/video generation (DiT architecture)
Multi-stage: AR + Diffusion pipeline (e.g., Qwen-Image)
Omni: Full multi-modal input/output (e.g., Qwen-Omni)

Step 2: Implement Model Class

Create a new file in vllm_omni/models/:

# vllm_omni/models/my_new_model.py

from vllm_omni.stages.base import BaseStage

class MyNewModelPipeline:
    """Pipeline for MyNewModel."""

    def __init__(self, model_config, ...):
        ...

    def generate(self, prompts, ...):
        ...

Step 3: Register the Model

Add your model to the model registry so vLLM-Omni can discover it:

# In the appropriate registry file
SUPPORTED_MODELS = {
    ...
    "MyNewModelPipeline": ("my_new_model", "MyNewModelPipeline"),
}

For out-of-tree plugins, use the public API instead:

from vllm_omni.diffusion.registry import register_diffusion_model

register_diffusion_model(
    model_arch="MyNewModel",
    module_name="my_plugin.models.my_new_model",
    class_name="MyNewModelPipeline",
    pre_process_func_name="pre_process",  # optional
    post_process_func_name="post_process",  # optional
)

This registers custom diffusion pipelines without modifying core source. For out-of-tree plugins, module_name should be the full import path of the module containing the pipeline class.

Step 4: Add Stage Configuration

Create a default stage config YAML:

# vllm_omni/configs/my_new_model.yaml
stages:
  - name: "main"
    stage_type: "diffusion"  # or "ar"
    stage_args:
      runtime:
        max_batch_size: 1

Step 5: Write Tests

# tests/models/test_my_new_model.py
import pytest
from vllm_omni.entrypoints.omni import Omni

@pytest.mark.parametrize("prompt", [
    "a simple test image",
    "a red circle on white background",
])
def test_basic_generation(prompt):
    omni = Omni(model="my-org/my-new-model")
    outputs = omni.generate(prompt)
    assert len(outputs) > 0
    assert outputs[0].request_output[0].images is not None

Step 6: Add Documentation

Add your model to docs/models/supported_models.md with:

Architecture name
Model name
Example HF model ID
Any special requirements

Testing

Run Unit Tests

pytest tests/ -v

Run Specific Test File

pytest tests/models/test_my_new_model.py -v

Run with Coverage

pytest tests/ --cov=vllm_omni --cov-report=html

Code Style

Follow the existing code patterns in the repository
Use type hints consistently
Run pre-commit hooks before committing:
```
pre-commit run --all-files
```

Pull Request Process

Create a feature branch: git checkout -b feat/add-my-model
Make changes and write tests
Run tests locally: pytest tests/
Run linting: pre-commit run --all-files
Push and open a PR against main
Fill in the PR template with description and test results
Address review feedback

Troubleshooting Development

Import errors after install: Reinstall with uv pip install -e .

Tests fail with GPU errors: Some tests require a GPU. Run with pytest -m "not gpu" to skip GPU tests.

Pre-commit hook fails: Run pre-commit run --all-files to see specific issues.

RMSNorm inductor KeyError under HSDP + torch.compile: Fixed in #3460. fused_rms_norm inductor tracing now avoids calling .data on DTensor objects.

References

For detailed model integration patterns, see references/model-integration.md

Related Skills

hsliuustc0106/vllm-omni-pre-check

development

VerifiedTrustedCommunity

Use before submitting a PR to vllm-project/vllm-omni — self-check the branch against project conventions, catch dead code, verify accuracy/performance claims, and confirm merge readiness. Use when the user says "pre-check", "self review", "pre-submit check", or "check my PR before I open it."

69SKILL.mdUpdated May 29, 2026

hsliuustc0106/vllm-omni-pre-check

hsliuustc0106/skills/vllm-omni-test-report

development

VerifiedTrustedCommunity

--- name: vllm-omni-test-report description: Two report kinds; **default output is always HTML** unless the user explicitly asks for Markdown (.md). **Release** — `scripts/compose_full_report.py` (**测试结论**, Buildkite metrics, **Test Result** = Common stack + optional `--log-dir-h*` nightly-style summaries + H100/CI block, **Issue tracking** = GitHub `ci-failure` + *local test* in:title, Open bugs); use `--format markdown` only when the user wants .md or `patch_report_*.py`. **Nightly** — `script

69SKILL.mdUpdated May 3, 2026

hsliuustc0106/skills/vllm-omni-test-report

hsliuustc0106/vllm-omni-review

testing

VerifiedTrustedCommunity

Review PRs on vllm-project/vllm-omni by routing to the right domain skills, checking critical evidence, and focusing comments on blocking issues. Use when reviewing pull requests or local branches, triaging review depth, running detailed or default review, or checking tests, benchmarks, and breaking changes in vllm-omni.

69SKILL.mdUpdated May 3, 2026

hsliuustc0106/vllm-omni-review

hsliuustc0106/vllm-omni-video-gen

data-ai

VerifiedTrustedCommunity

Generate videos with vLLM-Omni using Wan2.2 and other video generation models. Use when generating videos from text, creating videos from images, configuring video generation parameters, or working with text-to-video or image-to-video models.

67SKILL.mdUpdated May 3, 2026

hsliuustc0106/vllm-omni-video-gen

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/hsliuustc0106/vllm-omni-skills.git

# Copy into Claude Code skills folder (global)
cp -r vllm-omni-skills/skills/vllm-omni-contrib ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

hsliuustc0106/vllm-omni-skills

67 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT