tools/image/qwen-image-2/SKILL.md
Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI. Models: Qwen-Image-2.0 (fast), Qwen-Image-2.0-Pro (professional text rendering). Capabilities: text-to-image, multi-image editing, complex text rendering. Triggers: qwen image, qwen-image, alibaba image, dashscope image, qwen image 2, qwen image pro
npx skillsauth add inference-sh/agent-skills-registry qwen-image-2Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
belt app run alibaba/qwen-image-2 --input '{"prompt": "A serene mountain landscape at sunset"}'
| Model | App ID | Speed | Text Rendering | Best For |
|-------|--------|-------|----------------|----------|
| Qwen-Image-2.0 | alibaba/qwen-image-2 | Fast | Good | General use |
| Qwen-Image-2.0-Pro | alibaba/qwen-image-2-pro | Standard | Professional | Posters, text-heavy designs |
belt app list --search "qwen image"
belt app run alibaba/qwen-image-2 --input '{
"prompt": "A futuristic cityscape at sunset with flying cars"
}'
belt app run alibaba/qwen-image-2 --input '{
"prompt": "Minimalist logo design for a coffee shop",
"num_images": 4
}'
belt app run alibaba/qwen-image-2-pro --input '{
"prompt": "Panoramic mountain landscape with northern lights",
"width": 1536,
"height": 1024
}'
belt app run alibaba/qwen-image-2-pro --input '{
"prompt": "Poster with title \"Summer Sale!\" in bold red text at the top. Subtitle \"50% Off Everything\" in blue below. Beach background with palm trees.",
"width": 1024,
"height": 1536,
"prompt_extend": false
}'
belt app run alibaba/qwen-image-2 --input '{
"prompt": "Make the girl from Image 1 wear the dress from Image 2 in the pose from Image 3",
"reference_images": [
{"uri": "https://example.com/person.jpg"},
{"uri": "https://example.com/dress.jpg"},
{"uri": "https://example.com/pose.jpg"}
]
}'
belt app run alibaba/qwen-image-2-pro --input '{
"prompt": "Professional headshot portrait, studio lighting",
"negative_prompt": "low resolution, blurry, deformed, oversaturated"
}'
belt app run alibaba/qwen-image-2 --input '{
"prompt": "Abstract geometric art in blue and gold",
"seed": 12345
}'
| Parameter | Type | Description |
|-----------|------|-------------|
| prompt | string | Required. What to generate or edit (max 800 chars) |
| reference_images | array | Input images for editing (1-3 images) |
| num_images | integer | Number of images to generate (1-6) |
| width | integer | Output width in pixels (512-2048) |
| height | integer | Output height in pixels (512-2048) |
| watermark | boolean | Add "Qwen-Image" watermark |
| negative_prompt | string | Content to avoid (max 500 chars) |
| prompt_extend | boolean | Enable prompt rewriting (default: true) |
| seed | integer | Random seed for reproducibility (0-2147483647) |
Size constraint: Total pixels must be between 512×512 and 2048×2048.
| Field | Type | Description |
|-------|------|-------------|
| images | array | The generated or edited images (PNG format) |
| output_meta | object | Metadata with dimensions and count |
For Text Rendering (use Pro model):
"Title: \"Hello World!\""prompt_extend: false for precise controlStyles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render
Composition: close-up, wide shot, aerial view, macro, portrait, landscape
Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon
# 1. Generate sample input to see all options
belt app sample alibaba/qwen-image-2-pro --save input.json
# 2. Edit the prompt
# 3. Run
belt app run alibaba/qwen-image-2-pro --input input.json
| Feature | qwen-image-2 | qwen-image-2-pro | |---------|--------------|------------------| | Speed | Faster | Standard | | Text Rendering | Good | Professional | | Realism | Standard | Fine-grained | | Semantic Adherence | Good | Enhanced |
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# All image generation models
npx skills add inference-sh/skills@ai-image-generation
# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation
Browse all image apps: belt app list --category image
development
Declarative UI widgets from JSON for React/Next.js from ui.inference.sh. Render rich interactive UIs from structured agent responses. Capabilities: forms, buttons, cards, layouts, inputs, selects, checkboxes. Use for: agent-generated UIs, dynamic forms, data display, interactive cards. Triggers: widgets, declarative ui, json ui, widget renderer, agent widgets, dynamic ui, form widgets, card widgets, shadcn widgets, structured output ui
tools
Tool lifecycle UI components for React/Next.js from ui.inference.sh. Display tool calls: pending, progress, approval required, results. Capabilities: tool status, progress indicators, approval flows, results display. Use for: showing agent tool calls, human-in-the-loop approvals, tool output. Triggers: tool ui, tool calls, tool status, tool approval, tool results, agent tools, mcp tools ui, function calling ui, tool lifecycle, tool pending
development
Chat UI building blocks for React/Next.js from ui.inference.sh. Components: container, messages, input, typing indicators, avatars. Capabilities: chat interfaces, message lists, input handling, streaming. Use for: building custom chat UIs, messaging interfaces, AI assistants. Triggers: chat ui, chat component, message list, chat input, shadcn chat, react chat, chat interface, messaging ui, conversation ui, chat building blocks
tools
Batteries-included agent component for React/Next.js from ui.inference.sh. One component with runtime, tools, streaming, approvals, and widgets built in. Capabilities: drop-in agent, human-in-the-loop, client-side tools, form filling. Use for: building AI chat interfaces, agentic UIs, SaaS copilots, assistants. Triggers: agent component, agent ui, chat agent, shadcn agent, react agent, agentic ui, ai assistant ui, copilot ui, inference ui, human in the loop