Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

hsliuustc0106/vllm-omni-recipe

Name: vllm-omni-recipe
Author: hsliuustc0106

skills/vllm-omni-recipe/SKILL.md

npx skillsauth add hsliuustc0106/vllm-omni-skills vllm-omni-recipe

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

vLLM-Omni Recipe Creation

Overview

vLLM-Omni extends vLLM to support non-autoregressive models like Diffusion Transformers (DiT) for omnimodal generation: text-to-image, text-to-video, text-to-audio, image-to-video, and any-to-any generation.

This skill guides creating deployment guides for omnimodal models in the vLLM recipes repository.

When to Use

Adding text-to-image, text-to-video, text-to-audio, any-to-any model recipes
Documenting Diffusion Transformer (DiT) deployments
Creating recipes for hybrid AR + diffusion architectures

Recipe Structure

Every recipe follows this structure. Sections marked ⚪ are optional.

# ModelName Usage Guide

[Introduction with HuggingFace link, architecture description]

## Installing vLLM-Omni
[Version-variable based installation]

## [Modality] Generation
[Python API and CLI examples]

## Recommended Deployment Strategy
[Hardware recommendations by model size]

## Key Parameters
[Generation config table]

## Expected Performance ⚪
[Only if verified measurements available]

## Accuracy Comparison ⚪
[Only if verified measurements available]

## Online Serving ⚪
[If supported]

## Additional Resources
[Model card, examples, related links]

For detailed section templates and code examples, see references/recipe-template.md.

Required Sections

1. Introduction

Include:

HuggingFace model link
Architecture type (DiT, AR+Diffusion, MoE)
Key capabilities and parameters

2. Installing vLLM-Omni

Use version variables:

export VLLM_VERSION="0.16.0"

uv venv
source .venv/bin/activate
uv pip install vllm==$VLLM_VERSION
uv pip install git+https://github.com/vllm-project/vllm-omni.git

Add modality-specific dependencies: pillow/diffusers for image/video, soundfile for audio.

3. Generation Examples

Provide Python API examples for all supported modalities. See references/recipe-template.md for code examples.

4. Recommended Deployment Strategy

Include hardware recommendations table with:

Model sizes and variants
Recommended GPU configurations
Memory requirements
Notes on MoE, batching, etc.

5. Key Parameters Table

Document generation parameters: height, width, num_inference_steps, guidance_scale, negative_prompt, num_frames (video), audio_end_in_s (audio).

Optional Sections

Performance & Accuracy ⚪

Only include if you have verified measurements. Do not fabricate benchmark numbers.

Expected Performance: generation time, memory usage on specific hardware
Accuracy Comparison: FID/CLIP scores vs Diffusers baseline

Online Serving ⚪

If model supports OpenAI-compatible serving:

vllm serve org/model-name --omni

Cache-DiT Acceleration ⚪

For DiT models that support caching:

omni = Omni(model="org/model-name", cache_backend="cache_dit")

File Naming

Directory: {OrgName}/ (e.g., Qwen/, DeepSeek/)
File: {ModelName}.md (e.g., Qwen-Image.md)
Use underscores for versions: Wan2_2.md or Wan2.2.md

Common Mistakes

| Mistake | Fix | |---------|-----| | Placeholder version (0.XX.0) | Use $VLLM_VERSION variable | | Missing modality dependencies | Add soundfile for audio, diffusers for video | | Wrong Omni import | Use from vllm_omni.entrypoints.omni import Omni | | Fabricated benchmarks | Only include verified measurements | | Missing from README | Add to skills index |

Checklist

[ ] Title follows # ModelName Usage Guide format
[ ] HuggingFace link in introduction
[ ] Architecture description (DiT, AR+Diffusion, MoE)
[ ] Installing vLLM-Omni with $VLLM_VERSION
[ ] Modality-specific dependencies
[ ] Python API examples for supported modalities
[ ] Recommended deployment strategy by hardware
[ ] Key parameters table
[ ] Performance/accuracy sections (optional, only if verified)
[ ] Online serving section (if supported)
[ ] File named correctly
[ ] README.md updated with new entry

References

recipe-template.md - Detailed section templates and code examples

Related Skills

vllm-omni-contrib: Contributing new models and development workflow to vLLM-Omni
For standard LLM/vLLM recipes (autoregressive models), refer to the vLLM recipes repository for examples

hsliuustc0106/vllm-omni-recipe

skills/vllm-omni-recipe/SKILL.md

Use when adding a recipe for omnimodal models (text-to-image, text-to-video, text-to-audio, image-to-video, any-to-any, diffusion transformers) to the vLLM recipes repository, or documenting vLLM-Omni deployment

59 stars

testing

Updated May 3, 2026

$ install --global

skillsauth

npx skillsauth add hsliuustc0106/vllm-omni-skills vllm-omni-recipe

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 3, 2026, 2:55 AM116.5s2 files scanned

SKILL.md

name:: vllm-omni-recipe
description:: Use when adding a recipe for omnimodal models (text-to-image, text-to-video, text-to-audio, image-to-video, any-to-any, diffusion transformers) to the vLLM recipes repository, or documenting vLLM-Omni deployment

vLLM-Omni Recipe Creation

Overview

This skill guides creating deployment guides for omnimodal models in the vLLM recipes repository.

When to Use

Adding text-to-image, text-to-video, text-to-audio, any-to-any model recipes
Documenting Diffusion Transformer (DiT) deployments
Creating recipes for hybrid AR + diffusion architectures

Recipe Structure

Every recipe follows this structure. Sections marked ⚪ are optional.

# ModelName Usage Guide

[Introduction with HuggingFace link, architecture description]

## Installing vLLM-Omni
[Version-variable based installation]

## [Modality] Generation
[Python API and CLI examples]

## Recommended Deployment Strategy
[Hardware recommendations by model size]

## Key Parameters
[Generation config table]

## Expected Performance ⚪
[Only if verified measurements available]

## Accuracy Comparison ⚪
[Only if verified measurements available]

## Online Serving ⚪
[If supported]

## Additional Resources
[Model card, examples, related links]

For detailed section templates and code examples, see references/recipe-template.md.

Required Sections

1. Introduction

Include:

HuggingFace model link
Architecture type (DiT, AR+Diffusion, MoE)
Key capabilities and parameters

2. Installing vLLM-Omni

Use version variables:

export VLLM_VERSION="0.16.0"

uv venv
source .venv/bin/activate
uv pip install vllm==$VLLM_VERSION
uv pip install git+https://github.com/vllm-project/vllm-omni.git

Add modality-specific dependencies: pillow/diffusers for image/video, soundfile for audio.

3. Generation Examples

Provide Python API examples for all supported modalities. See references/recipe-template.md for code examples.

4. Recommended Deployment Strategy

Include hardware recommendations table with:

Model sizes and variants
Recommended GPU configurations
Memory requirements
Notes on MoE, batching, etc.

5. Key Parameters Table

Document generation parameters: height, width, num_inference_steps, guidance_scale, negative_prompt, num_frames (video), audio_end_in_s (audio).

Optional Sections

Performance & Accuracy ⚪

Only include if you have verified measurements. Do not fabricate benchmark numbers.

Expected Performance: generation time, memory usage on specific hardware
Accuracy Comparison: FID/CLIP scores vs Diffusers baseline

Online Serving ⚪

If model supports OpenAI-compatible serving:

vllm serve org/model-name --omni

Cache-DiT Acceleration ⚪

For DiT models that support caching:

omni = Omni(model="org/model-name", cache_backend="cache_dit")

File Naming

Directory: {OrgName}/ (e.g., Qwen/, DeepSeek/)
File: {ModelName}.md (e.g., Qwen-Image.md)
Use underscores for versions: Wan2_2.md or Wan2.2.md

Common Mistakes

Checklist

[ ] Title follows # ModelName Usage Guide format
[ ] HuggingFace link in introduction
[ ] Architecture description (DiT, AR+Diffusion, MoE)
[ ] Installing vLLM-Omni with $VLLM_VERSION
[ ] Modality-specific dependencies
[ ] Python API examples for supported modalities
[ ] Recommended deployment strategy by hardware
[ ] Key parameters table
[ ] Performance/accuracy sections (optional, only if verified)
[ ] Online serving section (if supported)
[ ] File named correctly
[ ] README.md updated with new entry

References

recipe-template.md - Detailed section templates and code examples

Related Skills

vllm-omni-contrib: Contributing new models and development workflow to vLLM-Omni
For standard LLM/vLLM recipes (autoregressive models), refer to the vLLM recipes repository for examples

Related Skills

hsliuustc0106/vllm-omni-pre-check

development

VerifiedTrustedCommunity

Use before submitting a PR to vllm-project/vllm-omni — self-check the branch against project conventions, catch dead code, verify accuracy/performance claims, and confirm merge readiness. Use when the user says "pre-check", "self review", "pre-submit check", or "check my PR before I open it."

69SKILL.mdUpdated May 29, 2026

hsliuustc0106/vllm-omni-pre-check

hsliuustc0106/skills/vllm-omni-test-report

development

VerifiedTrustedCommunity

--- name: vllm-omni-test-report description: Two report kinds; **default output is always HTML** unless the user explicitly asks for Markdown (.md). **Release** — `scripts/compose_full_report.py` (**测试结论**, Buildkite metrics, **Test Result** = Common stack + optional `--log-dir-h*` nightly-style summaries + H100/CI block, **Issue tracking** = GitHub `ci-failure` + *local test* in:title, Open bugs); use `--format markdown` only when the user wants .md or `patch_report_*.py`. **Nightly** — `script

69SKILL.mdUpdated May 3, 2026

hsliuustc0106/skills/vllm-omni-test-report

hsliuustc0106/vllm-omni-review

testing

VerifiedTrustedCommunity

Review PRs on vllm-project/vllm-omni by routing to the right domain skills, checking critical evidence, and focusing comments on blocking issues. Use when reviewing pull requests or local branches, triaging review depth, running detailed or default review, or checking tests, benchmarks, and breaking changes in vllm-omni.

69SKILL.mdUpdated May 3, 2026

hsliuustc0106/vllm-omni-review

hsliuustc0106/vllm-omni-video-gen

data-ai

VerifiedTrustedCommunity

Generate videos with vLLM-Omni using Wan2.2 and other video generation models. Use when generating videos from text, creating videos from images, configuring video generation parameters, or working with text-to-video or image-to-video models.

67SKILL.mdUpdated May 3, 2026

hsliuustc0106/vllm-omni-video-gen

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/hsliuustc0106/vllm-omni-skills.git

# Copy into Claude Code skills folder (global)
cp -r vllm-omni-skills/skills/vllm-omni-recipe ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

hsliuustc0106/vllm-omni-skills

59 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT