skills/gpt-image-1-5/SKILL.md
Generate and edit images using OpenAI's GPT Image 1.5 model. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports text-to-image generation, multi-image compositing, style transfer, and image editing with optional mask. DO NOT read the image file first - use this skill directly with the --input-image parameter.
npx skillsauth add tamtom/image-generation-skills gpt-image-1-5Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate new images or edit existing ones using OpenAI's GPT Image 1.5 model via the Images API.
Run the script using absolute path (do NOT cd to skill directory first):
Generate new image:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "your image description" --filename "output.png" [--quality low|medium|high] [--size 1024x1024|1024x1536|1536x1024|auto] [--background transparent|opaque|auto] [--output-format png|webp|jpeg] [--output-compression 0-100] [--n 1-4] [--api-key KEY]
Generate multiple variations:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "your image description" --filename "output.png" --n 4 --quality high
Edit existing image (without mask - full image edit):
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "editing instructions" --filename "output.png" --input-image "path/to/input.png" [--quality low|medium|high] [--input-fidelity high|low] [--output-format png|webp|jpeg] [--api-key KEY]
Edit with multiple reference images (compositing/style transfer):
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "combine the subject from first image with the style of second image" --filename "output.png" --input-image "subject.png" --input-image "style-ref.png" [--input-fidelity high] [--api-key KEY]
Edit existing image (with mask - precise inpainting):
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "what to put in masked area" --filename "output.png" --input-image "path/to/input.png" --mask "path/to/mask.png" [--api-key KEY]
Important: Always run from the user's current working directory so images are saved where the user is working, not in the skill directory.
Map user requests:
mediumlowhighMap user requests:
auto1024x10241024x15361536x1024Map user requests:
pngwebpjpegpng (or webp)webp and jpeg output formatspngoutput-1.png, output-2.png, etc.Map user requests:
--n 4--n 21Map user requests:
--input-fidelity high--input-fidelity highThe script checks for API key in this order:
--api-key argument (use if user provided key in chat)OPENAI_API_KEY environment variableIf neither is available, the script exits with an error message.
Generate filenames with the pattern: yyyy-mm-dd-hh-mm-ss-name.{ext}
Format: {timestamp}-{descriptive-name}.{ext}
yyyy-mm-dd-hh-mm-ss (24-hour format)--output-format (.png, .webp, .jpg)x9k2, a7b3)Examples:
2025-12-17-14-23-05-japanese-garden.png2025-12-17-15-30-12-sunset-mountains.webp2025-12-17-16-45-33-robot.jpgAll editing uses the Images API (images.edit endpoint) with gpt-image-1.5.
When the user wants to modify an existing image without specifying exact regions:
--input-image parameter with the path to the imageWhen the user wants to edit specific regions:
--input-image parameter with the path to the image--mask parameter with a PNG mask fileWhen the user wants to combine elements from multiple images or apply a style from one image to another:
--input-image multiple times (e.g., --input-image "subject.png" --input-image "style.png")--input-fidelity high when identity/detail preservation mattersCommon editing tasks: add/remove elements, change style, adjust colors, replace backgrounds, composite images, style transfer, virtual try-on, text translation in images.
For generation: Pass user's image description as-is to --prompt. Only rework if clearly insufficient.
For editing: Pass editing instructions in --prompt (e.g., "add a rainbow in the sky", "make it look like a watercolor painting")
For multi-image: Reference images by position (e.g., "combine the person from the first image with the background of the second image")
Preserve user's creative intent in all cases.
--output-formatname-1.ext, name-2.ext, etc.Generate new image:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "A serene Japanese garden with cherry blossoms" --filename "2025-12-17-14-23-05-japanese-garden.png" --quality high --size 1536x1024
Generate with transparent background:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "A cute cartoon cat mascot" --filename "2025-12-17-14-25-30-cat-mascot.png" --background transparent --quality high
Generate multiple variations as compressed webp:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "Minimalist logo for a coffee shop" --filename "2025-12-17-14-26-00-coffee-logo.webp" --n 4 --output-format webp --output-compression 80 --quality high
Edit existing image (full image):
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "make the sky more dramatic with storm clouds" --filename "2025-12-17-14-27-00-dramatic-sky.png" --input-image "original-photo.jpg" --quality high
Edit with identity preservation:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "change the outfit to a red dress" --filename "2025-12-17-14-28-00-red-dress.png" --input-image "portrait.png" --input-fidelity high --quality high
Composite from multiple images:
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "place the person from the first image into the beach scene from the second image" --filename "2025-12-17-14-29-00-beach-composite.png" --input-image "person.png" --input-image "beach.png" --input-fidelity high
Edit with mask (inpainting):
uv run ~/.claude/skills/gpt-image-1-5/scripts/generate_image.py --prompt "a flamingo swimming" --filename "2025-12-17-14-30-00-lounge-flamingo.png" --input-image "lounge.png" --mask "mask.png"
development
Generate and edit images using Google's Gemini Nano Banana 2 (gemini-3.1-flash-image-preview). Use when the user asks to generate, create, edit, modify, change, alter, or update images using Gemini. Also use when user references an existing image file and asks to modify it in any way. Supports text-to-image generation, image editing, multi-image compositing (up to 14 reference images), style transfer, Google Web Search and Image Search grounding for real-time data, high-resolution output up to 4K, controllable thinking levels, 14 aspect ratios, and advanced text rendering. DO NOT read the image file first - use this skill directly with the --input-image parameter.
data-ai
Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.
data-ai
Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.
data-ai
OpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.