skills/gpt-image-skill/SKILL.md
Generate or edit images using OpenAI GPT Image API (gpt-image-2, gpt-image-1, etc). Triggers: "gpt image", "openai image", "generate image with openai", "draw image", "create image", "image generation", "AI drawing", "图片生成", "AI绘图", "生成图片", "画图". Use this skill whenever the user wants to generate or edit images and mentions OpenAI, GPT, or when OPENAI_API_KEY is available.
npx skillsauth add feiskyer/claude-code-settings gpt-image-skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate or edit images using OpenAI's GPT Image models through a bundled Python script.
~/.gpt-image.env or export OPENAI_API_KEY=<your-key>~/.gpt-image.env or export it.python3 -m pip install -r ./requirements.txt if not installed yet../gpt_image.pyAsk the user for:
Run the script:
python3 ./gpt_image.py --prompt "description of image" --output "filename.png"
Show the user the saved image path when complete.
Ask the user for:
Run with input images:
python3 ./gpt_image.py edit --prompt "editing instructions" --input image1.png image2.png --output "edited.png"
gpt-image-2 (default) — Latest model with strong instruction following, text rendering, and broad world knowledgegpt-image-1.5 — Mid-tier modelgpt-image-1 — First-generation GPT image modelgpt-image-1-mini — Lightweight, faster generation1024x1024 (default) — Square1024x1536 — Portrait (2:3)1536x1024 — Landscape (3:2)auto — Let the model decideauto (default) — Model decides optimal qualityhigh — Higher detail, slowermedium — Balancedlow — Fastestpng (default) — Losslessjpeg — Smaller file sizewebp — Modern format, good compressionauto (default) — Model decidestransparent — Transparent background (png/webp only)opaque — Solid background--n <count> — Number of images to generate (default: 1)--output <filename> — Output filename (default: auto-generated)python3 ./gpt_image.py --prompt "A serene mountain landscape at sunset with a lake"
python3 ./gpt_image.py \
--prompt "Modern minimalist logo for a tech startup" \
--size 1024x1024 \
--quality high \
--output "logo.png"
python3 ./gpt_image.py \
--prompt "Futuristic cityscape with flying cars" \
--size 1536x1024 \
--output "cityscape.png"
python3 ./gpt_image.py \
--prompt "A cute cartoon cat mascot" \
--background transparent \
--format png \
--output "mascot.png"
python3 ./gpt_image.py \
--prompt "Abstract art in the style of Kandinsky" \
--n 3 \
--output "art.png"
python3 ./gpt_image.py edit \
--prompt "Add a rainbow in the sky" \
--input photo.png \
--output "photo-with-rainbow.png"
python3 ./gpt_image.py edit \
--prompt "Create a gift basket containing all items shown" \
--input item1.png item2.png item3.png \
--output "gift-basket.png"
python3 ./gpt_image.py \
--prompt "Detailed portrait of a cat in watercolor style" \
--model gpt-image-1 \
--output "cat-portrait.png"
If the script fails:
OPENAI_API_KEY is exportedOPENAI_API_BASE is correcthigh quality for final output, auto for quick iterationsdevelopment
Create, refine, and benchmark agent skills. Use when building a new skill, updating an existing one, running evals, checking trigger quality, or improving a skill description.
development
Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片".
development
Execute long-running, multi-session tasks autonomously using Claude Code headless mode or in-session hook-based loops. Supports structured task decomposition (for complex projects) and lightweight Ralph-style iteration (for TDD, bug fixing, refactoring). Use this skill whenever the user says "autonomous", "long-running task", "multi-session", "run this in the background", "keep working on this", "batch process", "iterate until done", "ralph loop", or wants any task that requires sustained, unattended execution.
development
Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片".