skills/ai-image-gen/SKILL.md
Generate images using AI (OpenAI GPT Image or Stability AI). Use when the user asks to generate an image, create an AI image, make an illustration, or produce artwork from a text prompt.
npx skillsauth add OpenClaudia/openclaudia-skills generate-imageInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate images from text prompts using OpenAI GPT Image (gpt-image-2 / gpt-image-1 / variants) or Stability AI (SD 3.5 Large).
~/.agents/tools/generate-image.py~/.agents/tools/.env (contains OPENAI_API_KEY and STABILITY_API_KEY)/v1/models 2026-05-15)| Model ID | Notes |
|---|---|
| gpt-image-2 | Flagship. Released 2026-04-21. Best prompt adherence, best photorealism. Default when image quality matters. |
| gpt-image-2-2026-04-21 | Pinned dated variant of gpt-image-2 |
| gpt-image-1.5 | Intermediate release between 1 and 2 |
| gpt-image-1-mini | Smaller/cheaper gpt-image-1 variant — use for batch/draft generation where cost matters |
| gpt-image-1 | Original gpt-image. Still works; superseded by gpt-image-2. |
| chatgpt-image-latest | Always-current alias of the model ChatGPT.com uses (currently gpt-image-2-class). Use when you want "whatever ChatGPT uses today" |
| dall-e-3 | Legacy fallback. Different quality semantics (standard/hd, not low/medium/high). |
Pass any of these to --model. The script branches on gpt-image* for the quality/format handling, so all gpt-image-* variants work out of the box.
Default in the script is still gpt-image-1 — pass --model gpt-image-2 (or set up an alias) when you want the flagship.
python ~/.agents/tools/generate-image.py \
--prompt "a sunset over mountains, oil painting style" \
--output ./sunset.png \
--model gpt-image-2 \
--quality high
python ~/.agents/tools/generate-image.py \
--prompt "a sunset over mountains, oil painting style" \
--output ./sunset.png
python ~/.agents/tools/generate-image.py \
--prompt "modern logo design for a tech company" \
--output ./logo.png \
--size 1024x1024 \
--quality high
python ~/.agents/tools/generate-image.py \
--prompt "abstract digital art with blue tones" \
--output ./banner.png \
--size 1536x1024
python ~/.agents/tools/generate-image.py \
--prompt "portrait of a futuristic city" \
--output ./city.png \
--size 1024x1536
python ~/.agents/tools/generate-image.py \
--prompt "a minimalist cat icon, flat design" \
--output ./icon.png \
--background transparent
python ~/.agents/tools/generate-image.py \
--prompt "watercolor painting of a garden" \
--output ./garden.png \
--provider stability
| Option | Description |
|--------|-------------|
| --prompt, -p | (Required) Text prompt describing the desired image |
| --output, -o | (Required) Output file path |
| --provider | openai (default) or stability |
| --size | Image size for OpenAI: 1024x1024 (default), 1536x1024 (wide), 1024x1536 (tall) |
| --quality | OpenAI quality: low, medium (default), or high |
| --background | OpenAI background: auto (default), transparent, or opaque |
| --model | Override model (default: gpt-image-1 for OpenAI, sd3.5-large for Stability) |
| Feature | OpenAI gpt-image-2 | OpenAI gpt-image-1 | Stability AI SD 3.5 | |---------|--------------------|--------------------|---------------------| | Released | 2026-04-21 | 2025 | — | | Prompt adherence | Best | Excellent | Good | | Size options | 1024x1024, 1536x1024, 1024x1536 | 1024x1024, 1536x1024, 1024x1536 | 1024x1024 | | Quality options | low, medium, high | low, medium, high | N/A | | Transparent bg | Yes | Yes | No | | Style | Photorealistic + artistic | Photorealistic + artistic | Artistic + photorealistic | | Cost per image (high) | Higher than 1 | Baseline | N/A |
requests (for API calls)Install if needed:
pip install requests
The script prints:
testing
Edit podcast audio — trim pre/post-show chat, remove filler words, cut silences, and enhance audio quality. Use when the user asks to edit a podcast, clean up audio, remove fillers, trim a recording, or improve voice quality.
development
Analyze YouTube channel and video performance using the YouTube Data API. Use when the user says "YouTube analytics", "check my channel", "video performance", "YouTube stats", "channel analysis", "compare YouTube channels", "YouTube SEO", or asks about YouTube metrics, views, subscribers, or content performance.
development
Create high-converting landing page copy and structure. Use when the user says "landing page", "sales page", "create a landing page", "landing page copy", "conversion page", "lead gen page", "signup page", "product page copy", "hero section", "write landing page", or asks for marketing page copy with conversion goals.
documentation
Generate a full SEO-optimized blog post. Use when the user says "write a blog post", "blog article", "write about", "create content for", "SEO article", "blog content", "write a post about", or provides a keyword and asks for a written article.