Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

curiositech/image-generation-workflow-engine

Name: image-generation-workflow-engine
Author: curiositech

skills/image-generation-workflow-engine/SKILL.md

npx skillsauth add curiositech/windags-skills image-generation-workflow-engine

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Image Generation Workflow Engine

Build production image generation pipelines with FLUX, Stable Diffusion 3.5, ControlNet, LoRA, and ComfyUI for automated creative workflows.

Activation Triggers

Activate on: "image generation pipeline", "ComfyUI workflow", "ControlNet conditioning", "LoRA training", "FLUX generation", "Stable Diffusion pipeline", "batch image generation", "img2img workflow", "inpainting pipeline"

NOT for: Video generation from images (ai-video-production-master), image classification or object detection (computer-vision-pipeline), or multimodal search embeddings (multimodal-embedding-generator)

Quick Start

Choose model — FLUX.1-dev for quality, FLUX.1-schnell for speed, SD 3.5 for ControlNet ecosystem, SDXL for LoRA abundance.
Design workflow — Text-to-image (simplest), img2img (style transfer), ControlNet (structural guidance), inpainting (targeted edits).
Build pipeline — ComfyUI for visual node graphs, diffusers library for code-first, or API services (Replicate, fal.ai) for managed.
Add conditioning — ControlNet (canny, depth, pose), IP-Adapter (style transfer), LoRA (fine-tuned concepts).
Automate — Batch generation with parameter sweeps, quality filtering, and output organization.

Core Capabilities

| Domain | Technologies | Notes | |--------|-------------|-------| | Models | FLUX.1-dev/schnell, SD 3.5, SDXL, Kandinsky 3 | FLUX is 2025-2026 standard for quality | | Conditioning | ControlNet (canny, depth, pose, segmentation), IP-Adapter | Structural and style guidance | | Fine-Tuning | LoRA, DreamBooth, textual inversion | Custom concepts in 20 min on consumer GPU | | Workflows | ComfyUI, diffusers (Python), A1111 | ComfyUI for complex multi-step; diffusers for code | | APIs | Replicate, fal.ai, Together AI, HF Inference | Managed GPU, pay-per-image | | Local | qwen-image-mps (Apple Silicon), CUDA, ROCm | M4 Max: FLUX.1-schnell in 4-8 sec/image |

Architecture Patterns

Pattern 1: ComfyUI Production Pipeline

[Load Checkpoint] ──→ [CLIP Text Encode] ──→ [KSampler] ──→ [VAE Decode] ──→ [Save Image]
       │                      │                    │
   FLUX.1-dev          positive + negative     steps: 20-30
   or SD 3.5           prompts with weights    cfg: 3.5-7.5
                                                scheduler: euler
                                                    │
                                    [ControlNet Apply] (optional)
                                           │
                                    canny/depth/pose
                                    from reference image

ComfyUI workflows are JSON-serializable. Store them in version control:

workflows/
├── txt2img-flux-base.json          # Basic FLUX text-to-image
├── controlnet-canny-sd35.json      # Canny edge guided generation
├── lora-character-flux.json        # Character LoRA application
├── inpaint-background-swap.json    # Background replacement
└── batch-product-shots.json        # Automated product photography

Pattern 2: Code-First with diffusers

# FLUX.1-dev with ControlNet conditioning
from diffusers import FluxPipeline, FluxControlNetPipeline
from diffusers.utils import load_image
import torch

# Basic text-to-image
pipe = FluxPipeline.from_pretrained(
    "black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16
).to("cuda")

image = pipe(
    prompt="A cozy library with warm lighting, bookshelves floor to ceiling",
    num_inference_steps=28,
    guidance_scale=3.5,
    width=1024, height=1024,
).images[0]

# Batch generation with parameter sweep
params = [
    {"guidance_scale": 3.0, "num_inference_steps": 20},
    {"guidance_scale": 3.5, "num_inference_steps": 28},
    {"guidance_scale": 4.0, "num_inference_steps": 35},
]
for i, p in enumerate(params):
    img = pipe(prompt=prompt, **p).images[0]
    img.save(f"output/sweep_{i}_cfg{p['guidance_scale']}.png")

Pattern 3: LoRA Training and Application

Training:
  20-50 images of concept ──→ [kohya_ss / ai-toolkit] ──→ LoRA weights (.safetensors)
                                    │
                              captioned with BLIP/Florence2
                              trained 1000-3000 steps
                              rank 16-32, alpha = rank

Application:
  [Base Model] + [LoRA weights @ strength 0.6-0.9] ──→ [Generate with trigger word]

File organization:
  loras/
  ├── character-alice-v2.safetensors    # trigger: "alice_character"
  ├── style-watercolor-v1.safetensors   # trigger: "watercolor_style"
  └── product-shoe-v3.safetensors       # trigger: "brandx_shoe"

Anti-Patterns

CFG scale too high — FLUX works best at 3.0-4.0 CFG. Using 7-12 (old SD habits) produces oversaturated, artifact-heavy images. Check model-specific recommendations.
Ignoring negative prompts where supported — SD 3.5 and SDXL benefit from negative prompts ("blurry, low quality, distorted"). FLUX does not use negatives the same way.
Training LoRA on uncaptioned data — Images without accurate captions produce LoRAs that activate unpredictably. Always caption training data with BLIP-2 or Florence-2.
No seed tracking — Without recording seeds, you cannot reproduce good results or iterate systematically. Always log seed, prompt, and all parameters.
Single-image evaluation — Generating one image and judging the model is like evaluating a coin flip from one toss. Generate 4-8 images per prompt and evaluate the distribution.

Quality Checklist

[ ] Model chosen based on use case: FLUX for quality, schnell for speed, SD 3.5 for ControlNet
[ ] CFG scale appropriate for model (FLUX: 3.0-4.0, SD 3.5: 4.0-7.5, SDXL: 7.0-12.0)
[ ] Seeds logged for every generation (reproducibility)
[ ] ControlNet conditioning validated: reference image preprocessed correctly
[ ] LoRA training data captioned with automated tool (BLIP-2, Florence-2)
[ ] Batch generation used for evaluation (minimum 4 images per prompt)
[ ] Output organized with metadata (prompt, seed, model, params in filename or sidecar)
[ ] ComfyUI workflows version-controlled as JSON
[ ] GPU memory managed: model offloading for large models, attention slicing for VRAM limits
[ ] Generation latency profiled: target < 10 sec for interactive, < 60 sec for batch

curiositech/image-generation-workflow-engine

skills/image-generation-workflow-engine/SKILL.md

Build image generation pipelines with Stable Diffusion, FLUX, ControlNet, LoRA, and ComfyUI workflows. Activate on: image generation pipeline, ComfyUI workflow, ControlNet, LoRA training, diffusion model. NOT for: video generation (ai-video-production-master), image classification (computer-vision-pipeline).

development

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add curiositech/windags-skills image-generation-workflow-engine

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 4, 2026, 2:15 PM7.7s1 file scanned

SKILL.md

license:: Apache-2.0
name:: image-generation-workflow-engine
description:: Build image generation pipelines with Stable Diffusion, FLUX, ControlNet, LoRA, and ComfyUI workflows. Activate on: image generation pipeline, ComfyUI workflow, ControlNet, LoRA training, diffusion model. NOT for: video generation (ai-video-production-master), image classification (computer-vision-pipeline).
allowed-tools:: Read,Write,Edit,Bash(python:*,pip:*,npm:*,npx:*)
category:: AI & Machine Learning
- skill:: multimodal-embedding-generator
reason:: CLIP/SigLIP embeddings guide generation and enable style search

Image Generation Workflow Engine

Build production image generation pipelines with FLUX, Stable Diffusion 3.5, ControlNet, LoRA, and ComfyUI for automated creative workflows.

Activation Triggers

Quick Start

Choose model — FLUX.1-dev for quality, FLUX.1-schnell for speed, SD 3.5 for ControlNet ecosystem, SDXL for LoRA abundance.
Design workflow — Text-to-image (simplest), img2img (style transfer), ControlNet (structural guidance), inpainting (targeted edits).
Build pipeline — ComfyUI for visual node graphs, diffusers library for code-first, or API services (Replicate, fal.ai) for managed.
Add conditioning — ControlNet (canny, depth, pose), IP-Adapter (style transfer), LoRA (fine-tuned concepts).
Automate — Batch generation with parameter sweeps, quality filtering, and output organization.

Core Capabilities

Architecture Patterns

Pattern 1: ComfyUI Production Pipeline

[Load Checkpoint] ──→ [CLIP Text Encode] ──→ [KSampler] ──→ [VAE Decode] ──→ [Save Image]
       │                      │                    │
   FLUX.1-dev          positive + negative     steps: 20-30
   or SD 3.5           prompts with weights    cfg: 3.5-7.5
                                                scheduler: euler
                                                    │
                                    [ControlNet Apply] (optional)
                                           │
                                    canny/depth/pose
                                    from reference image

ComfyUI workflows are JSON-serializable. Store them in version control:

workflows/
├── txt2img-flux-base.json          # Basic FLUX text-to-image
├── controlnet-canny-sd35.json      # Canny edge guided generation
├── lora-character-flux.json        # Character LoRA application
├── inpaint-background-swap.json    # Background replacement
└── batch-product-shots.json        # Automated product photography

Pattern 2: Code-First with diffusers

# FLUX.1-dev with ControlNet conditioning
from diffusers import FluxPipeline, FluxControlNetPipeline
from diffusers.utils import load_image
import torch

# Basic text-to-image
pipe = FluxPipeline.from_pretrained(
    "black-forest-labs/FLUX.1-dev", torch_dtype=torch.bfloat16
).to("cuda")

image = pipe(
    prompt="A cozy library with warm lighting, bookshelves floor to ceiling",
    num_inference_steps=28,
    guidance_scale=3.5,
    width=1024, height=1024,
).images[0]

# Batch generation with parameter sweep
params = [
    {"guidance_scale": 3.0, "num_inference_steps": 20},
    {"guidance_scale": 3.5, "num_inference_steps": 28},
    {"guidance_scale": 4.0, "num_inference_steps": 35},
]
for i, p in enumerate(params):
    img = pipe(prompt=prompt, **p).images[0]
    img.save(f"output/sweep_{i}_cfg{p['guidance_scale']}.png")

Pattern 3: LoRA Training and Application

Training:
  20-50 images of concept ──→ [kohya_ss / ai-toolkit] ──→ LoRA weights (.safetensors)
                                    │
                              captioned with BLIP/Florence2
                              trained 1000-3000 steps
                              rank 16-32, alpha = rank

Application:
  [Base Model] + [LoRA weights @ strength 0.6-0.9] ──→ [Generate with trigger word]

File organization:
  loras/
  ├── character-alice-v2.safetensors    # trigger: "alice_character"
  ├── style-watercolor-v1.safetensors   # trigger: "watercolor_style"
  └── product-shoe-v3.safetensors       # trigger: "brandx_shoe"

Anti-Patterns

CFG scale too high — FLUX works best at 3.0-4.0 CFG. Using 7-12 (old SD habits) produces oversaturated, artifact-heavy images. Check model-specific recommendations.
Ignoring negative prompts where supported — SD 3.5 and SDXL benefit from negative prompts ("blurry, low quality, distorted"). FLUX does not use negatives the same way.
Training LoRA on uncaptioned data — Images without accurate captions produce LoRAs that activate unpredictably. Always caption training data with BLIP-2 or Florence-2.
No seed tracking — Without recording seeds, you cannot reproduce good results or iterate systematically. Always log seed, prompt, and all parameters.
Single-image evaluation — Generating one image and judging the model is like evaluating a coin flip from one toss. Generate 4-8 images per prompt and evaluate the distribution.

Quality Checklist

[ ] Model chosen based on use case: FLUX for quality, schnell for speed, SD 3.5 for ControlNet
[ ] CFG scale appropriate for model (FLUX: 3.0-4.0, SD 3.5: 4.0-7.5, SDXL: 7.0-12.0)
[ ] Seeds logged for every generation (reproducibility)
[ ] ControlNet conditioning validated: reference image preprocessed correctly
[ ] LoRA training data captioned with automated tool (BLIP-2, Florence-2)
[ ] Batch generation used for evaluation (minimum 4 images per prompt)
[ ] Output organized with metadata (prompt, seed, model, params in filename or sidecar)
[ ] ComfyUI workflows version-controlled as JSON
[ ] GPU memory managed: model offloading for large models, attention slicing for VRAM limits
[ ] Generation latency profiled: target < 10 sec for interactive, < 60 sec for batch

Related Skills

curiositech/revisiting-interview-data-analysing-turn

data-ai

VerifiedTrustedCommunity

license: Apache-2.0 NOT for unrelated tasks outside this domain.

8SKILL.mdUpdated Jul 19, 2026

curiositech/revisiting-interview-data-analysing-turn

curiositech/redis-patterns-expert

development

VerifiedTrustedCommunity

Use when designing caching strategies (cache-aside, write-through, write-behind), implementing distributed locks, building rate limiters, leaderboards, real-time streams (XADD/consumer groups), pub/sub, or tuning eviction policies. Triggers: thundering-herd on cache miss, dogpile on key expiry, Redlock vs SET-NX-PX choice, sliding-window rate limiter, hot-key on a single cluster slot, big-key blowup, MULTI/EXEC across slots, KEYS in production. NOT for Redis Cluster operations/admin (different domain), embedded KV (SQLite, leveldb), in-process LRU caches, or Memcached.

8SKILL.mdUpdated Jul 19, 2026

curiositech/redis-patterns-expert

curiositech/react-server-components-boundary

tools

VerifiedTrustedCommunity

Drawing the `'use client'` boundary correctly in React Server Components apps (Next.js App Router, RSC frameworks) — leaf-pushing, slot composition, serialization rules, and environment poisoning prevention. Grounded in react.dev and Next.js 16 docs.

8SKILL.mdUpdated Jul 19, 2026

curiositech/react-server-components-boundary

curiositech/rate-limiting-strategy

development

VerifiedTrustedCommunity

Use when designing rate limiting for an API, choosing between token bucket / sliding window / leaky bucket / fixed window, implementing it in Redis, deciding edge (Cloudflare/Upstash) vs origin enforcement, sizing per-user vs per-IP vs per-endpoint quotas, returning the right 429 response with Retry-After, or fixing the boundary-burst bug in fixed-window limiters. Triggers: 429 too many requests, INCR + EXPIRE, ZADD + ZREMRANGEBYSCORE + ZCARD, X-RateLimit-Remaining header, Cloudflare WAF rate limiting rules, Upstash @upstash/ratelimit, leaky bucket shaping vs policing, distributed rate limiter consistency. NOT for DDoS mitigation specifically (different scale), CAPTCHA / bot management, full WAF design, or per-user quota billing.

8SKILL.mdUpdated Jul 19, 2026

curiositech/rate-limiting-strategy

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/curiositech/windags-skills.git

# Copy into Claude Code skills folder (global)
cp -r windags-skills/skills/image-generation-workflow-engine ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

curiositech/windags-skills

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT