Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

scaryrawr/image-gen

Name: image-gen
Author: scaryrawr

plugins/ollama/skills/image-gen/SKILL.md

npx skillsauth add scaryrawr/scarypilot image-gen

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Ollama Image Generation

Generate images locally with a single helper that supports both Ollama CLI and REST backends. This feature is currently macOS-only and uses Ollama's experimental image generation support.

Installed Models

Before generating images, check which models are locally available by calling the Ollama API:

curl -s http://localhost:11434/api/tags | jq -r '.models[].name'

Important: If you see a quantized variant with a suffix (e.g. x/z-image-turbo:q4_K_M), use that exact ID rather than the base name.

Models

Z-Image Turbo (default)

6B parameter model from Alibaba's Tongyi Lab. Best for photorealistic images and bilingual (English/Chinese) text rendering. Apache 2.0 licensed.

FLUX.2 Klein

Black Forest Labs' fast image-generation model (4B and 9B sizes). Best for readable text in images, UI mockups, and typography-heavy designs.

4B: Apache 2.0 (commercial use OK)
9B: FLUX Non-Commercial License v2.1

Model Selection Guide

| Need | Recommended Model | | ----------------------------------- | ----------------------------------------- | | Photorealistic portraits/scenes | x/z-image-turbo | | Chinese text rendering | x/z-image-turbo | | Readable text in images (signs, UI) | x/flux2-klein | | Commercial use | x/z-image-turbo or x/flux2-klein (4B) | | General purpose | x/z-image-turbo |

Default to x/z-image-turbo unless the user has a specific need for text rendering in images.

Generating Images

Use the helper script (backend defaults to auto, which prefers CLI when available):

./scripts/generate-image.sh --prompt "Young woman in a cozy coffee shop, natural window lighting, wearing a cream knit sweater, holding a ceramic mug, soft bokeh background"

Choose a specific model and image size:

./scripts/generate-image.sh \
  --model x/flux2-klein \
  --size 512x512 \
  --output my-image.png \
  --prompt "A neon sign reading OPEN 24 HOURS in a rainy alley"

Use richer generation controls (CLI backend):

./scripts/generate-image.sh \
  --backend cli \
  --model x/flux2-klein \
  --width 1024 \
  --height 1024 \
  --steps 20 \
  --seed 42 \
  --negative-prompt "blurry, low quality, distorted" \
  --output detailed.png \
  --prompt "UI dashboard mockup with clean typography and clear labels"

Images are saved to the current working directory by default unless --output is provided. Always tell the user where the generated image was saved.

Backend Selection

| Backend | Description | | ------- | ----------- | | auto (default) | Prefers ollama run when available; falls back to REST | | cli | Uses ollama run directly (supports richer options) | | rest | Uses POST /v1/images/generations |

If requested model is missing, the script attempts ollama pull <model>.

Script Options

| Option | Default | Description | | ------ | ------- | ----------- | | --prompt | (required) | The text prompt | | --model | x/z-image-turbo | Ollama model name | | --size | 1024x1024 | Image dimensions (WxH) | | --width / --height | unset | Size aliases (must be used together) | | --output | image_YYYYMMDD_HHMMSS.png | Output file path | | --backend | auto | auto, cli, or rest | | --steps | unset | Denoising steps (CLI backend only) | | --seed | unset | Random seed (CLI backend only) | | --negative-prompt | unset | Negative prompt text (CLI backend only) |

Capability Matrix

| Capability | CLI backend | REST backend | | ---------- | ----------- | ------------ | | Basic prompt + model + size | ✅ | ✅ | | steps / seed / negative prompt | ✅ | ❌ | | width / height aliases | ✅ | ✅ (normalized to size) |

If --backend rest is forced with CLI-only options, the script fails with an actionable error.

Direct curl Example

You can still call the REST API directly:

curl -s http://localhost:11434/v1/images/generations \
  -H "Content-Type: application/json" \
  -d '{
    "model": "x/z-image-turbo",
    "prompt": "A sunset over the ocean with dramatic clouds",
    "size": "1024x1024",
    "response_format": "b64_json"
  }' | jq -r '.data[0].b64_json' | base64 -d > output.png

Image Size

Control the output dimensions with --size (or --width + --height). Format is WxH. Smaller images generate faster and use less memory.

Common sizes: 512x512, 768x768, 1024x1024

Workflow

Check installed models — run curl -s http://localhost:11434/api/tags to list available models. Note any quantized variants (e.g. :q4_K_M) and use the exact ID.
Confirm the user's prompt and preferences (model, size, style, optional steps/seed/negative prompt)
Pick the starting model based on the request: default to x/z-image-turbo, but prefer x/flux2-klein for text-heavy images, signage, UI, or other typography-sensitive work
Run the helper script with --backend auto unless the user explicitly requests a backend (use the exact model ID from step 1, including any quantization suffix)
If advanced controls are requested, prefer --backend cli (or rely on auto when CLI is available)
Inspect the generated image before surfacing it to the user and compare it against the prompt, with extra attention to text accuracy, composition, subject fidelity, and obvious artifacts
If the image misses the prompt, refine the next attempt instead of immediately returning it: tighten the prompt, switch to a more suitable model, adjust size, or add CLI-only controls such as --negative-prompt, --steps, or --seed
Repeat the evaluate-and-refine loop for up to 4 total attempts, keeping track of which output best matches the request
Return the best image you produced, report the output file location, and note any meaningful limitation only if the best result still falls short of the prompt
If the user wants further changes after that, continue iterating with modified prompt, size, model, or advanced controls

Prompting Tips

Be specific and descriptive: include lighting, composition, style, and mood
For photorealistic results, mention camera settings (e.g., "shot on 35mm film", "soft bokeh")
For creative work, reference art styles (e.g., "watercolor", "surreal double exposure")
Keep prompts focused on a single subject or scene for best results

scaryrawr/image-gen

plugins/ollama/skills/image-gen/SKILL.md

Generate images from text prompts using Ollama's local image generation models.

2 stars

data-ai

Updated May 13, 2026

$ install --global

skillsauth

npx skillsauth add scaryrawr/scarypilot image-gen

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 13, 2026, 4:47 AM206.4s2 files scanned

SKILL.md

name:: image-gen
description:: Generate images from text prompts using Ollama's local image generation models.

Ollama Image Generation

Generate images locally with a single helper that supports both Ollama CLI and REST backends. This feature is currently macOS-only and uses Ollama's experimental image generation support.

Installed Models

Before generating images, check which models are locally available by calling the Ollama API:

curl -s http://localhost:11434/api/tags | jq -r '.models[].name'

Important: If you see a quantized variant with a suffix (e.g. x/z-image-turbo:q4_K_M), use that exact ID rather than the base name.

Models

Z-Image Turbo (default)

6B parameter model from Alibaba's Tongyi Lab. Best for photorealistic images and bilingual (English/Chinese) text rendering. Apache 2.0 licensed.

FLUX.2 Klein

Black Forest Labs' fast image-generation model (4B and 9B sizes). Best for readable text in images, UI mockups, and typography-heavy designs.

4B: Apache 2.0 (commercial use OK)
9B: FLUX Non-Commercial License v2.1

Model Selection Guide

Default to x/z-image-turbo unless the user has a specific need for text rendering in images.

Generating Images

Use the helper script (backend defaults to auto, which prefers CLI when available):

./scripts/generate-image.sh --prompt "Young woman in a cozy coffee shop, natural window lighting, wearing a cream knit sweater, holding a ceramic mug, soft bokeh background"

Choose a specific model and image size:

./scripts/generate-image.sh \
  --model x/flux2-klein \
  --size 512x512 \
  --output my-image.png \
  --prompt "A neon sign reading OPEN 24 HOURS in a rainy alley"

Use richer generation controls (CLI backend):

./scripts/generate-image.sh \
  --backend cli \
  --model x/flux2-klein \
  --width 1024 \
  --height 1024 \
  --steps 20 \
  --seed 42 \
  --negative-prompt "blurry, low quality, distorted" \
  --output detailed.png \
  --prompt "UI dashboard mockup with clean typography and clear labels"

Images are saved to the current working directory by default unless --output is provided. Always tell the user where the generated image was saved.

Backend Selection

If requested model is missing, the script attempts ollama pull <model>.

Script Options

Capability Matrix

If --backend rest is forced with CLI-only options, the script fails with an actionable error.

Direct curl Example

You can still call the REST API directly:

curl -s http://localhost:11434/v1/images/generations \
  -H "Content-Type: application/json" \
  -d '{
    "model": "x/z-image-turbo",
    "prompt": "A sunset over the ocean with dramatic clouds",
    "size": "1024x1024",
    "response_format": "b64_json"
  }' | jq -r '.data[0].b64_json' | base64 -d > output.png

Image Size

Control the output dimensions with --size (or --width + --height). Format is WxH. Smaller images generate faster and use less memory.

Common sizes: 512x512, 768x768, 1024x1024

Workflow

Check installed models — run curl -s http://localhost:11434/api/tags to list available models. Note any quantized variants (e.g. :q4_K_M) and use the exact ID.
Confirm the user's prompt and preferences (model, size, style, optional steps/seed/negative prompt)
Pick the starting model based on the request: default to x/z-image-turbo, but prefer x/flux2-klein for text-heavy images, signage, UI, or other typography-sensitive work
Run the helper script with --backend auto unless the user explicitly requests a backend (use the exact model ID from step 1, including any quantization suffix)
If advanced controls are requested, prefer --backend cli (or rely on auto when CLI is available)
Inspect the generated image before surfacing it to the user and compare it against the prompt, with extra attention to text accuracy, composition, subject fidelity, and obvious artifacts
If the image misses the prompt, refine the next attempt instead of immediately returning it: tighten the prompt, switch to a more suitable model, adjust size, or add CLI-only controls such as --negative-prompt, --steps, or --seed
Repeat the evaluate-and-refine loop for up to 4 total attempts, keeping track of which output best matches the request
Return the best image you produced, report the output file location, and note any meaningful limitation only if the best result still falls short of the prompt
If the user wants further changes after that, continue iterating with modified prompt, size, model, or advanced controls

Prompting Tips

Be specific and descriptive: include lighting, composition, style, and mood
For photorealistic results, mention camera settings (e.g., "shot on 35mm film", "soft bokeh")
For creative work, reference art styles (e.g., "watercolor", "surreal double exposure")
Keep prompts focused on a single subject or scene for best results

Related Skills

scaryrawr/worktrunk

testing

VerifiedTrustedCommunity

Manage parallel git worktrees with Worktrunk (`wt`) and enforce disk-fit preflight checks before creating new worktrees.

2SKILL.mdUpdated Apr 27, 2026

scaryrawr/ghostty

tools

VerifiedTrustedCommunity

Create Ghostty windows/tabs/splits and drive terminals with focus/input for multitasking workflows on macOS.

2SKILL.mdUpdated Apr 27, 2026

scaryrawr/init

testing

VerifiedTrustedCommunity

Quickly bootstrap repo-specific Copilot instructions with high signal and low context bloat.

2SKILL.mdUpdated Apr 27, 2026

scaryrawr/codespaces

tools

VerifiedTrustedCommunity

Connect to and interact with GitHub Codespaces. Manages connections via gh ado-codespaces (port forwarding, Azure auth), runs commands via gh cs ssh, invokes Copilot CLI remotely, and supports multiple codespaces.

2SKILL.mdUpdated Apr 27, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/scaryrawr/scarypilot.git

# Copy into Claude Code skills folder (global)
cp -r scarypilot/plugins/ollama/skills/image-gen ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

scaryrawr/scarypilot

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT