Skills/invokeai-image-gen/SKILL.md
Generate images using InvokeAI's local API. Use when asked to generate, create, or make images with InvokeAI, FLUX.2 Klein, Z-Image Turbo, FLUX, or SDXL models. Supports text-to-image generation, automatic model detection, image download, and parameter selection based on model architecture.
npx skillsauth add sammcj/agentic-coding invokeai-image-genInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate images via InvokeAI's REST API. Supports FLUX.2 Klein (default), Z-Image Turbo, FLUX.1, and SDXL.
Simply call the script with your prompt and the output file name:
python scripts/generate.py -p "A dramatic sunset over snow-capped mountains, warm orange light reflecting off a still alpine lake in the foreground. Soft clouds catch the fading light." -o sunset.png
If the user asks you to use a specific model, first find the model key, then use it in the command:
python scripts/generate.py --list-models | grep -i 'flux'
python scripts/generate.py -p "A tabby cat with bright green eyes sits on a weathered wooden windowsill, soft afternoon light streaming through lace curtains. Cosy, intimate mood." --model MODEL_KEY -o cat.png
| Option | Description |
|--------|-------------|
| --prompt, -p | Generation prompt (required) |
| --negative, -n | Negative prompt (SDXL only) |
| --model, -m | Model key (UUID) or partial name match |
| --width, -W / --height, -H | Dimensions |
| --steps, -s | Denoising steps |
| --cfg, -c | CFG scale |
| --guidance, -g | Guidance strength (FLUX.1 only) |
| --scheduler | Sampling scheduler |
| --seed | Random seed |
| --output, -o | Output path (default: invokeai-{seed}.png) |
| --list-models | List installed models |
| --json | JSON output |
Note: FLUX.2 Klein is the latest model which is used by default.
| Model | Steps | Guidance | CFG | Scheduler | |-------|-------|----------|-----|-----------| | FLUX.2 Klein | 4 | 3.5 | 1.0 | euler | | Z-Image Turbo | 9 | - | 1.0 | euler | | FLUX.1 dev | 28 | 3.5 | 1.0 | euler | | FLUX.1 Krea dev | 28 | 4.5 | 1.0 | euler | | FLUX.1 Kontext dev | 28 | 2.5 | 1.0 | euler | | FLUX.1 schnell | 4 | 0.0 | 1.0 | euler | | SDXL | 25 | - | 6.0 | dpmpp_2m_k | | SDXL Turbo | 8 | - | 1.0 | dpmpp_sde |
All models default to 1024x1024. FLUX requires dimensions divisible by 16, SDXL by 8.
Auto-priority: Klein > Z-Image > FLUX > SDXL
Detection by name/base:
Write prose, not keywords. Structure: Subject -> Setting -> Details -> Lighting -> Atmosphere
A weathered fisherman in his late sixties stands at the bow of a wooden boat,
wearing a salt-stained wool sweater. Golden hour sunlight filters through
morning mist, creating quiet determination and solitude.
Key techniques:
Good: "A woman with short blonde hair poses against a light neutral background wearing colourful earrings, resting her chin on her hand."
Bad: "woman, blonde, short hair, neutral background, earrings"
Append style tags: Style: Country chic. Mood: Serene, romantic.
| Issue | Solution |
|-------|----------|
| Connection refused | Check InvokeAI is running |
| Model not found | Use --list-models for valid keys |
| Dimensions error | FLUX: multiples of 16, SDXL: 8 |
| Black images (macOS) | Set precision: bfloat16 in invokeai.yaml |
If the script fails to find the URL or authentication token, you can set or ask the user to set environment variables:
export INVOKEAI_API_URL='http://localhost:9090'
export INVOKEAI_AUTH_TOKEN='your-token' # Optional
scripts/generate.py - Main generation scriptdevelopment
Use when answering questions from this machine-learning knowledge base. Triggers: questions about transformers, attention cost and efficiency, and long-context scaling; 'what do we know about attention', 'check the ML wiki'. Read-only querying of compiled knowledge; to add, update, supersede, lint, or audit, use the llm-wiki skill instead.
development
Use when building or maintaining a self-contained personal knowledge base (an LLM wiki) as plain markdown, optionally opened as an Obsidian vault. Triggers: ingesting sources into a wiki, querying wiki knowledge, linting wiki health, auditing article claims against their sources, superseding stale knowledge, 'add to wiki', or any mention of 'LLM wiki' or 'Karpathy wiki'.
tools
Provides guidance and tools for hardware design. Activate when using KiCAD, looking up electronic parts or designing PCBs.
testing
Grilling session that challenges your plan against the existing domain model, sharpens terminology, and updates documentation (CONTEXT.md, ADRs) inline as decisions crystallise.