skills/image-generation-workflow-engine/SKILL.md
Build image generation pipelines with Stable Diffusion, FLUX, ControlNet, LoRA, and ComfyUI workflows. Activate on: image generation pipeline, ComfyUI workflow, ControlNet, LoRA training, diffusion model. NOT for: video generation (ai-video-production-master), image classification (computer-vision-pipeline).
npx skillsauth add curiositech/windags-skills image-generation-workflow-engineInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Build production image generation pipelines with FLUX, Stable Diffusion 3.5, ControlNet, LoRA, and ComfyUI for automated creative workflows.
Activate on: "image generation pipeline", "ComfyUI workflow", "ControlNet conditioning", "LoRA training", "FLUX generation", "Stable Diffusion pipeline", "batch image generation", "img2img workflow", "inpainting pipeline"
NOT for: Video generation from images (ai-video-production-master), image classification or object detection (computer-vision-pipeline), or multimodal search embeddings (multimodal-embedding-generator)
| Domain | Technologies | Notes | |--------|-------------|-------| | Models | FLUX.1-dev/schnell, SD 3.5, SDXL, Kandinsky 3 | FLUX is 2025-2026 standard for quality | | Conditioning | ControlNet (canny, depth, pose, segmentation), IP-Adapter | Structural and style guidance | | Fine-Tuning | LoRA, DreamBooth, textual inversion | Custom concepts in 20 min on consumer GPU | | Workflows | ComfyUI, diffusers (Python), A1111 | ComfyUI for complex multi-step; diffusers for code | | APIs | Replicate, fal.ai, Together AI, HF Inference | Managed GPU, pay-per-image | | Local | qwen-image-mps (Apple Silicon), CUDA, ROCm | M4 Max: FLUX.1-schnell in 4-8 sec/image |
[Load Checkpoint] ──→ [CLIP Text Encode] ──→ [KSampler] ──→ [VAE Decode] ──→ [Save Image]
│ │ │
FLUX.1-dev positive + negative steps: 20-30
or SD 3.5 prompts with weights cfg: 3.5-7.5
scheduler: euler
│
[ControlNet Apply] (optional)
│
canny/depth/pose
from reference image
ComfyUI workflows are JSON-serializable. Store them in version control:
workflows/
├── txt2img-flux-base.json # Basic FLUX text-to-image
├── controlnet-canny-sd35.json # Canny edge guided generation
├── lora-character-flux.json # Character LoRA application
├── inpaint-background-swap.json # Background replacement
└── batch-product-shots.json # Automated product photography
# FLUX.1-dev with ControlNet conditioning
from diffusers import FluxPipeline, FluxControlNetPipeline
from diffusers.utils import load_image
import torch
# Basic text-to-image
pipe = FluxPipeline.from_pretrained(
"black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16
).to("cuda")
image = pipe(
prompt="A cozy library with warm lighting, bookshelves floor to ceiling",
num_inference_steps=28,
guidance_scale=3.5,
width=1024, height=1024,
).images[0]
# Batch generation with parameter sweep
params = [
{"guidance_scale": 3.0, "num_inference_steps": 20},
{"guidance_scale": 3.5, "num_inference_steps": 28},
{"guidance_scale": 4.0, "num_inference_steps": 35},
]
for i, p in enumerate(params):
img = pipe(prompt=prompt, **p).images[0]
img.save(f"output/sweep_{i}_cfg{p['guidance_scale']}.png")
Training:
20-50 images of concept ──→ [kohya_ss / ai-toolkit] ──→ LoRA weights (.safetensors)
│
captioned with BLIP/Florence2
trained 1000-3000 steps
rank 16-32, alpha = rank
Application:
[Base Model] + [LoRA weights @ strength 0.6-0.9] ──→ [Generate with trigger word]
File organization:
loras/
├── character-alice-v2.safetensors # trigger: "alice_character"
├── style-watercolor-v1.safetensors # trigger: "watercolor_style"
└── product-shoe-v3.safetensors # trigger: "brandx_shoe"
tools
Building resilient distributed systems with circuit breakers, retries with full-jitter exponential backoff, retry budgets (per-request 3-attempt + per-client 10% ratio per Google SRE), deadline propagation, and the cascading-failure math (4 layers × 3 retries = 64x amplification). Grounded in Resilience4j, Microsoft Cloud Patterns, AWS Architecture Blog (Marc Brooker), and Google SRE Book.
testing
Designing HTTP cache headers that work correctly across browsers, CDNs, and shared proxies — `Cache-Control` directives per RFC 9111, `stale-while-revalidate` and `stale-if-error` per RFC 5861, the Vary header for varying responses, and surrogate keys for tag-based purging. Grounded in IETF RFCs and Cloudflare/Fastly docs.
development
Use when designing or fixing a Content Security Policy on a real site, choosing between nonce-based and hash-based CSP, adding strict-dynamic, debugging "Refused to execute inline script" errors, deploying CSP in report-only mode first, configuring report-to / report-uri, or auditing an existing policy for unsafe-inline / unsafe-eval / wildcards. Triggers: "CSP blocks legitimate inline script", strict-dynamic, nonce-{RANDOM}, sha256-{HASH}, object-src none, base-uri none, frame-ancestors, Trusted Types, X-Content-Security-Policy obsolete, report-only vs enforced. NOT for general HTTP security headers (HSTS, COOP/COEP), Trusted Types deep dive, CORS configuration, or building a WAF.
tools
Choosing and operating an HTTP API versioning strategy that doesn't break clients — Stripe's date-based pinned versions, the Deprecation/Sunset header pair (RFC 9745 + RFC 8594), URI vs header vs media-type approaches, and the version-transformer pattern. Grounded in Stripe's published architecture and IETF RFCs.