Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

BBuf/model-architecture-diagram

Name: model-architecture-diagram
Author: BBuf

skills/model-architecture-diagram/SKILL.md

npx skillsauth add BBuf/AI-Infra-Auto-Driven-SKILLS model-architecture-diagram

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Model Architecture Diagram

Workflow

Return only public original diagrams indexed by this skill.

Run the bundled resolver:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "<model name>"

If the resolver returns kind: existing, return the raw image Markdown it prints and preserve the source attribution line.
If the resolver returns kind: no_match, tell the user that no public original architecture diagram is indexed for that model.

Source Priority

Use references/diagram-index.json as the source of truth. It stores raw GitHub image URLs from:

datawhalechina/self-llm
CalvinXKY/InfraTech
Tongyi-MAI/Z-Image
Wan-Video/Wan2.1
Wan-Video/Wan2.2
Tencent-Hunyuan/HunyuanVideo
Tencent-Hunyuan/Hunyuan3D-2
brayevalerien/Flux.1-Architecture-Diagram

Prefer detailed implementation, cookbook, or architecture-card diagrams over paper figures. Good sources show module boundaries, dataflow, MoE / attention / cache paths, or model-specific runtime structure rather than only a high-level paper overview. Official repository diagrams and curated implementation diagrams are first choice; paper figures are fallback only when no more detailed public original diagram is indexed.

Do not copy remote image binaries into the skill. Return the raw GitHub URLs so the chat renderer can display the original image.

Existing Diagram Rule

For a direct match, show the original image. Good direct matches include:

DeepSeek V3/V3.2/V4, GLM-5, Kimi K2/K2.5, MiniMax M2.5, Qwen3.5, Qwen3-VL, and Step 3.5 Flash from InfraTech.
Hunyuan-A13B, Kimi-VL, Qwen3, Qwen3-VL detail flows, MiniMax M2, and Llama 4 architecture/module diagrams from self-llm.
Z-Image, Wan2.1, Wan2.2, HunyuanVideo, Hunyuan3D 2.0, and FLUX.1 diffusion architecture/module diagrams from public GitHub sources.

If multiple diagrams match, show all high-confidence matches up to the resolver's default limit. For example, DeepSeek V3 may return the full architecture plus MLA MHA/MQA diagrams.

Hosted Original Diagram Gallery

Do not commit the sgl-cookbook-model-architecture-images/ gallery into the repository. The public-original image set is hosted as a GitHub Release asset and indexed by a GitHub issue.

Current hosted artifact:

Issue index: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/issues/31
Release page: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/tag/sgl-cookbook-architecture-images-2026-05-02
Zip download: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
Digest: sha256:ea432081849a250429d3d1ecf246e267c5cc42f989aaf4b9ca695b581e7fa50f

The artifact contains 44 public original diagram image files from the indexed upstream repositories, plus a lightweight index.html, index.md, manifest.json, HTML contact sheet, and architecture-audit.md.

To inspect the gallery locally:

curl -L -o /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip \
  https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
unzip -q /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip -d /tmp
open /tmp/sgl-cookbook-model-architecture-images/index.html

Useful Commands

List known original diagram aliases:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py --list-known

Emit JSON for automation:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "GLM-5" --format json

References

references/diagram-index.json: original diagram link index and aliases.
references/source-notes.md: audited source repositories and local cache paths.

BBuf/model-architecture-diagram

skills/model-architecture-diagram/SKILL.md

Return public original model architecture diagrams for user-specified LLM, VLM, MoE, diffusion, OCR, and SGLang/sgl-cookbook model families. Use when the user asks for a model structure chart, architecture diagram, or rendered image link for a specific model such as DeepSeek, GLM, Qwen, Kimi, MiniMax, Step, Hunyuan, or Qwen3-VL.

202 stars

testing

Updated May 3, 2026

$ install --global

skillsauth

npx skillsauth add BBuf/AI-Infra-Auto-Driven-SKILLS model-architecture-diagram

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 3, 2026, 2:27 AM143.5s4 files scanned

SKILL.md

name:: model-architecture-diagram
description:: Return public original model architecture diagrams for user-specified LLM, VLM, MoE, diffusion, OCR, and SGLang/sgl-cookbook model families. Use when the user asks for a model structure chart, architecture diagram, or rendered image link for a specific model such as DeepSeek, GLM, Qwen, Kimi, MiniMax, Step, Hunyuan, or Qwen3-VL.

Model Architecture Diagram

Workflow

Return only public original diagrams indexed by this skill.

Run the bundled resolver:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "<model name>"

If the resolver returns kind: existing, return the raw image Markdown it prints and preserve the source attribution line.
If the resolver returns kind: no_match, tell the user that no public original architecture diagram is indexed for that model.

Source Priority

Use references/diagram-index.json as the source of truth. It stores raw GitHub image URLs from:

datawhalechina/self-llm
CalvinXKY/InfraTech
Tongyi-MAI/Z-Image
Wan-Video/Wan2.1
Wan-Video/Wan2.2
Tencent-Hunyuan/HunyuanVideo
Tencent-Hunyuan/Hunyuan3D-2
brayevalerien/Flux.1-Architecture-Diagram

Do not copy remote image binaries into the skill. Return the raw GitHub URLs so the chat renderer can display the original image.

Existing Diagram Rule

For a direct match, show the original image. Good direct matches include:

DeepSeek V3/V3.2/V4, GLM-5, Kimi K2/K2.5, MiniMax M2.5, Qwen3.5, Qwen3-VL, and Step 3.5 Flash from InfraTech.
Hunyuan-A13B, Kimi-VL, Qwen3, Qwen3-VL detail flows, MiniMax M2, and Llama 4 architecture/module diagrams from self-llm.
Z-Image, Wan2.1, Wan2.2, HunyuanVideo, Hunyuan3D 2.0, and FLUX.1 diffusion architecture/module diagrams from public GitHub sources.

If multiple diagrams match, show all high-confidence matches up to the resolver's default limit. For example, DeepSeek V3 may return the full architecture plus MLA MHA/MQA diagrams.

Hosted Original Diagram Gallery

Do not commit the sgl-cookbook-model-architecture-images/ gallery into the repository. The public-original image set is hosted as a GitHub Release asset and indexed by a GitHub issue.

Current hosted artifact:

Issue index: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/issues/31
Release page: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/tag/sgl-cookbook-architecture-images-2026-05-02
Zip download: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
Digest: sha256:ea432081849a250429d3d1ecf246e267c5cc42f989aaf4b9ca695b581e7fa50f

To inspect the gallery locally:

curl -L -o /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip \
  https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
unzip -q /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip -d /tmp
open /tmp/sgl-cookbook-model-architecture-images/index.html

Useful Commands

List known original diagram aliases:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py --list-known

Emit JSON for automation:

python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "GLM-5" --format json

References

references/diagram-index.json: original diagram link index and aliases.
references/source-notes.md: audited source repositories and local cache paths.

Related Skills

BBuf/sglang-humanize-review

development

VerifiedTrustedCommunity

Perform SGLang code review in the style of human maintainers by consulting the full non-agent PR review episode corpus from project start through the latest refresh (June 2026), including inline review threads, top-level PR comments, review submissions, original multilingual text, and multi-round discussions. Use when reviewing SGLang PRs, diffs, patches, or local changes for correctness, tests, performance, GPU/runtime risks, API compatibility, and maintainability.

531SKILL.mdUpdated May 21, 2026

BBuf/sglang-humanize-review

BBuf/model-pr-history-knowledge

documentation

VerifiedTrustedCommunity

Use when an SGLang, vLLM, or TensorRT-LLM serving/model optimization task needs prior model-family PR evidence. Query and read the PR-driven history docs under model-pr-optimization-history before choosing source paths, fast paths, kernel/fusion ideas, regression risks, or validation lanes.

531SKILL.mdUpdated May 17, 2026

BBuf/model-pr-history-knowledge

BBuf/vllm-sota-humanize-loop

development

VerifiedTrustedCommunity

Run an autonomous Humanize-governed vLLM SOTA performance loop for one LLM model: first perform the fixed fair vLLM/SGLang/TensorRT-LLM deployment search and benchmark, then start one RLCR loop that repeatedly decides the gap, profiles the current bottleneck, runs layer/kernel pipeline analysis, patches vLLM code, optionally uses ncu-report-skill for kernel evidence, and revalidates until vLLM matches or beats the best observed framework under the same workload and SLA.

423SKILL.mdUpdated May 27, 2026

BBuf/vllm-sota-humanize-loop

BBuf/llm-pipeline-analysis

devops

VerifiedTrustedCommunity

Inspect LLM torch profiler traces at forward-pass, layer, and kernel level. Use when you need layer timings, anchor-kernel boundaries, representative kernel flows, or Perfetto time ranges.

423SKILL.mdUpdated May 21, 2026

BBuf/llm-pipeline-analysis

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS.git

# Copy into Claude Code skills folder (global)
cp -r AI-Infra-Auto-Driven-SKILLS/skills/model-architecture-diagram ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

BBuf/AI-Infra-Auto-Driven-SKILLS

202 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT