Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

feiskyer/nanobanana-skill

Name: nanobanana-skill
Author: feiskyer

skills/nanobanana-skill/SKILL.md

npx skillsauth add feiskyer/codex-settings nanobanana-skill

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Nanobanana Image Skill

Use the bundled nanobanana.py tool to generate or edit images with Gemini image models. The default path now targets Nano Banana 2 (gemini-3.1-flash-image-preview) and turns on thinking summaries plus Google Search grounding by default, because those defaults are the main reason to use this skill instead of a generic image prompt.

When to use this skill

Use it for:

Creating a new image from text
Editing or remixing one or more existing images
Combining several reference images into one composition
Requests where factual freshness matters and image grounding should use live web data
Requests that need strong text rendering, layout reasoning, or more deliberate composition

Do not use it for:

Non-image tasks
Image work that explicitly must use another provider or another installed image skill

Requirements

GEMINI_API_KEY must exist in ~/.nanobanana.env or the shell environment.
Python dependencies from requirements.txt must be installed.
The executable is the nanobanana.py file in this same skill directory. Resolve its absolute path once before running it.

Example env file:

GEMINI_API_KEY=sk-dummy

Default behavior

Unless the user explicitly asks otherwise, prefer these defaults:

Model: gemini-3.1-flash-image-preview
Search grounding: enabled
Thinking summaries: enabled
Thinking level: high
Resolution: 1K
Aspect ratio: leave unspecified unless the user clearly wants a shape

Leaving aspect ratio unspecified is usually better for edits because Gemini can match the input image shape. For text-only generation, pick an aspect ratio only when the user implies a format such as poster, square post, banner, phone wallpaper, or ultrawide hero image.

Workflow

1. Clarify only the missing constraints

Ask only for details that materially affect the result:

The image brief or editing instruction
Any input/reference images to use
Target format or aspect ratio if implied by the use case
Output filename if the user cares where it lands

Do not force the user to choose a model, search mode, or thinking mode unless they asked for that level of control. The latest Nanobanana 2 path is already the default.

2. Choose the right mode

Use generate when there are no input images.

Use edit/composite when there are one or more input images. Nanobanana 2 can mix multiple references, so do not artificially limit the task to a single image if the user is clearly asking for a blend, lineup, storyboard, or consistency pass.

3. Run the bundled tool

Use the script beside this skill file.

Basic generation:

python3 /absolute/path/to/nanobanana.py \
  --prompt "Create a high-end coffee bag package design with tactile paper texture and clear typography" \
  --output /absolute/path/to/output/package.png

Editing or compositing:

python3 /absolute/path/to/nanobanana.py \
  --prompt "Turn these product photos into a clean 4:5 ecommerce hero image with a soft studio shadow and subtle headline area" \
  --input /absolute/path/to/ref1.png /absolute/path/to/ref2.png \
  --aspect-ratio 4:5 \
  --output /absolute/path/to/output/hero.png

Grounded generation with saved text metadata:

python3 /absolute/path/to/nanobanana.py \
  --prompt "Use Google Search to ground an editorial illustration about the most recent lunar mission and create a clean magazine cover concept" \
  --aspect-ratio 2:3 \
  --text-output /absolute/path/to/output/cover.txt \
  --metadata-output /absolute/path/to/output/cover.json \
  --output /absolute/path/to/output/cover.png

4. Return the result clearly

Always tell the user:

The saved image path or paths
Whether search grounding stayed enabled
Whether text/thought summaries were saved anywhere
Any relevant limitation or warning from the run

If the model returns text but no image, report that plainly and suggest a more explicit image-focused prompt instead of pretending the run succeeded.

Recommended options

Models

gemini-3.1-flash-image-preview: default. Best default for fast, high-volume image generation and editing with Nanobanana 2 features.
gemini-3-pro-image-preview: slower, but a good override for very detail-heavy or typography-sensitive work.
gemini-2.5-flash-image: legacy fallback if the user specifically wants the older Nanobanana model.

Aspect ratios

Supported ratios:

1:1
1:4
1:8
2:3
3:2
3:4
4:1
4:3
4:5
5:4
8:1
9:16
16:9
21:9

Quick picks:

1:1 for logos, icons, thumbnails, and general social posts
4:5 for feed posts and product cards
2:3 for posters and book-cover style work
9:16 for stories, shorts, and phone wallpaper
16:9 or 21:9 for slides, banners, and desktop hero art
1:4, 4:1, 1:8, 8:1 for very tall or very wide experimental layouts now supported by Nanobanana 2

Resolution

512px: Nanobanana 2 only. Best for quick ideation.
1K: default. Good tradeoff for most requests.
2K: use for polished deliverables.
4K: use when the user explicitly needs a high-resolution final.

Thinking and search

Keep search grounding on by default when the request could benefit from live facts, current events, or real product references.
Keep thinking summaries on by default because Nanobanana 2 often composes better on multi-constraint tasks.
Lower --thinking-level to low or minimal when the user prioritizes latency over refinement.
Disable search only when the user wants a purely imaginative result or explicitly requests no web grounding.

Prompting guidance

Good Nanobanana prompts are direct production briefs, not vague art wishes. Include:

Subject
Visual style
Composition or camera framing
Required text if any
Output use case
Constraints such as brand colors, empty space, or realism level

Prefer prompts like:

Create a premium sparkling water can advertisement. Use a cold studio product-photo look, silver highlights, condensation droplets, and a clean dark-teal background. Leave negative space in the upper-right for headline copy.

Instead of:

make a cool drink ad

For edits, tell the model what to preserve and what to change:

Keep the shoe silhouette and logo placement intact. Replace the background with a bright outdoor basketball court, add dynamic afternoon shadows, and keep the image looking like a real sports campaign photo.

Nanobanana 2 capabilities to lean on

Multi-reference composition with many input images
Better grounded visuals with Google Search and Google Image Search support behind the built-in search tool
Wider aspect-ratio support, including very tall and very wide outputs
512px fast ideation output in addition to 1K, 2K, and 4K
Stronger iterative reasoning via Gemini 3 thinking controls

Error handling

If the run fails:

Check ~/.nanobanana.env and confirm GEMINI_API_KEY is present.
Confirm each input image path exists and is readable.
Confirm the output directory is writable.
If a feature looks unsupported, retry with gemini-3.1-flash-image-preview first.
If the response contains only text, rewrite the prompt so the image deliverable is explicit.

Examples

Fast ideation

python3 /absolute/path/to/nanobanana.py \
  --prompt "Create three-dimensional sticker-style fruit mascots on white" \
  --resolution 512px \
  --output /absolute/path/to/output/stickers.png

Grounded current-events visual

python3 /absolute/path/to/nanobanana.py \
  --prompt "Use Google Search to ground a newspaper-style illustration about the latest Mars mission and create a restrained front-page visual" \
  --aspect-ratio 3:2 \
  --output /absolute/path/to/output/mars.png

Multi-reference composite

python3 /absolute/path/to/nanobanana.py \
  --prompt "Create a single brand moodboard from these references. Keep the ceramic texture from the first image, the palette from the second, and the lighting mood from the third." \
  --input /absolute/path/to/a.png /absolute/path/to/b.png /absolute/path/to/c.png \
  --aspect-ratio 16:9 \
  --output /absolute/path/to/output/moodboard.png

feiskyer/nanobanana-skill

skills/nanobanana-skill/SKILL.md

Generate, remix, or edit images with Nanobanana / Nano Banana 2 through the bundled Gemini CLI wrapper. Use this whenever the user wants AI image generation or editing, especially for reference-image composition, character consistency, grounded visuals that may need live web search, style transfer, marketing graphics, product mockups, social assets, or when they explicitly mention Nanobanana, Gemini image models, Google image generation, AI drawing, 图片生成, AI绘图, 图片编辑, or 生成图片.

170 stars

tools

Updated Apr 4, 2026

$ install --global

skillsauth

npx skillsauth add feiskyer/codex-settings nanobanana-skill

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 4, 2026, 10:28 PM20.1s3 files scanned

SKILL.md

name:: nanobanana-skill
description:: Generate, remix, or edit images with Nanobanana / Nano Banana 2 through the bundled Gemini CLI wrapper. Use this whenever the user wants AI image generation or editing, especially for reference-image composition, character consistency, grounded visuals that may need live web search, style transfer, marketing graphics, product mockups, social assets, or when they explicitly mention Nanobanana, Gemini image models, Google image generation, AI drawing, 图片生成, AI绘图, 图片编辑, or 生成图片.

Nanobanana Image Skill

When to use this skill

Use it for:

Creating a new image from text
Editing or remixing one or more existing images
Combining several reference images into one composition
Requests where factual freshness matters and image grounding should use live web data
Requests that need strong text rendering, layout reasoning, or more deliberate composition

Do not use it for:

Non-image tasks
Image work that explicitly must use another provider or another installed image skill

Requirements

GEMINI_API_KEY must exist in ~/.nanobanana.env or the shell environment.
Python dependencies from requirements.txt must be installed.
The executable is the nanobanana.py file in this same skill directory. Resolve its absolute path once before running it.

Example env file:

GEMINI_API_KEY=sk-dummy

Default behavior

Unless the user explicitly asks otherwise, prefer these defaults:

Model: gemini-3.1-flash-image-preview
Search grounding: enabled
Thinking summaries: enabled
Thinking level: high
Resolution: 1K
Aspect ratio: leave unspecified unless the user clearly wants a shape

Workflow

1. Clarify only the missing constraints

Ask only for details that materially affect the result:

The image brief or editing instruction
Any input/reference images to use
Target format or aspect ratio if implied by the use case
Output filename if the user cares where it lands

Do not force the user to choose a model, search mode, or thinking mode unless they asked for that level of control. The latest Nanobanana 2 path is already the default.

2. Choose the right mode

Use generate when there are no input images.

3. Run the bundled tool

Use the script beside this skill file.

Basic generation:

python3 /absolute/path/to/nanobanana.py \
  --prompt "Create a high-end coffee bag package design with tactile paper texture and clear typography" \
  --output /absolute/path/to/output/package.png

Editing or compositing:

python3 /absolute/path/to/nanobanana.py \
  --prompt "Turn these product photos into a clean 4:5 ecommerce hero image with a soft studio shadow and subtle headline area" \
  --input /absolute/path/to/ref1.png /absolute/path/to/ref2.png \
  --aspect-ratio 4:5 \
  --output /absolute/path/to/output/hero.png

Grounded generation with saved text metadata:

python3 /absolute/path/to/nanobanana.py \
  --prompt "Use Google Search to ground an editorial illustration about the most recent lunar mission and create a clean magazine cover concept" \
  --aspect-ratio 2:3 \
  --text-output /absolute/path/to/output/cover.txt \
  --metadata-output /absolute/path/to/output/cover.json \
  --output /absolute/path/to/output/cover.png

4. Return the result clearly

Always tell the user:

The saved image path or paths
Whether search grounding stayed enabled
Whether text/thought summaries were saved anywhere
Any relevant limitation or warning from the run

If the model returns text but no image, report that plainly and suggest a more explicit image-focused prompt instead of pretending the run succeeded.

Recommended options

Models

gemini-3.1-flash-image-preview: default. Best default for fast, high-volume image generation and editing with Nanobanana 2 features.
gemini-3-pro-image-preview: slower, but a good override for very detail-heavy or typography-sensitive work.
gemini-2.5-flash-image: legacy fallback if the user specifically wants the older Nanobanana model.

Aspect ratios

Supported ratios:

1:1
1:4
1:8
2:3
3:2
3:4
4:1
4:3
4:5
5:4
8:1
9:16
16:9
21:9

Quick picks:

1:1 for logos, icons, thumbnails, and general social posts
4:5 for feed posts and product cards
2:3 for posters and book-cover style work
9:16 for stories, shorts, and phone wallpaper
16:9 or 21:9 for slides, banners, and desktop hero art
1:4, 4:1, 1:8, 8:1 for very tall or very wide experimental layouts now supported by Nanobanana 2

Resolution

512px: Nanobanana 2 only. Best for quick ideation.
1K: default. Good tradeoff for most requests.
2K: use for polished deliverables.
4K: use when the user explicitly needs a high-resolution final.

Thinking and search

Keep search grounding on by default when the request could benefit from live facts, current events, or real product references.
Keep thinking summaries on by default because Nanobanana 2 often composes better on multi-constraint tasks.
Lower --thinking-level to low or minimal when the user prioritizes latency over refinement.
Disable search only when the user wants a purely imaginative result or explicitly requests no web grounding.

Prompting guidance

Good Nanobanana prompts are direct production briefs, not vague art wishes. Include:

Subject
Visual style
Composition or camera framing
Required text if any
Output use case
Constraints such as brand colors, empty space, or realism level

Prefer prompts like:

Create a premium sparkling water can advertisement. Use a cold studio product-photo look, silver highlights, condensation droplets, and a clean dark-teal background. Leave negative space in the upper-right for headline copy.

Instead of:

make a cool drink ad

For edits, tell the model what to preserve and what to change:

Keep the shoe silhouette and logo placement intact. Replace the background with a bright outdoor basketball court, add dynamic afternoon shadows, and keep the image looking like a real sports campaign photo.

Nanobanana 2 capabilities to lean on

Multi-reference composition with many input images
Better grounded visuals with Google Search and Google Image Search support behind the built-in search tool
Wider aspect-ratio support, including very tall and very wide outputs
512px fast ideation output in addition to 1K, 2K, and 4K
Stronger iterative reasoning via Gemini 3 thinking controls

Error handling

If the run fails:

Check ~/.nanobanana.env and confirm GEMINI_API_KEY is present.
Confirm each input image path exists and is readable.
Confirm the output directory is writable.
If a feature looks unsupported, retry with gemini-3.1-flash-image-preview first.
If the response contains only text, rewrite the prompt so the image deliverable is explicit.

Examples

Fast ideation

python3 /absolute/path/to/nanobanana.py \
  --prompt "Create three-dimensional sticker-style fruit mascots on white" \
  --resolution 512px \
  --output /absolute/path/to/output/stickers.png

Grounded current-events visual

python3 /absolute/path/to/nanobanana.py \
  --prompt "Use Google Search to ground a newspaper-style illustration about the latest Mars mission and create a restrained front-page visual" \
  --aspect-ratio 3:2 \
  --output /absolute/path/to/output/mars.png

Multi-reference composite

python3 /absolute/path/to/nanobanana.py \
  --prompt "Create a single brand moodboard from these references. Keep the ceramic texture from the first image, the palette from the second, and the lighting mood from the third." \
  --input /absolute/path/to/a.png /absolute/path/to/b.png /absolute/path/to/c.png \
  --aspect-ratio 16:9 \
  --output /absolute/path/to/output/moodboard.png

Related Skills

feiskyer/youtube-transcribe-skill

content-media

VerifiedTrustedCommunity

Extract subtitles/transcripts from a YouTube video URL and save as a local file. Use when you need to extract subtitles from a YouTube video.

170SKILL.mdUpdated Apr 4, 2026

feiskyer/youtube-transcribe-skill

feiskyer/spec-kit-skill

tools

VerifiedTrustedCommunity

GitHub Spec-Kit integration for constitution-based spec-driven development. 7-phase workflow (constitution, specify, clarify, plan, tasks, analyze, implement). Use when working with spec-kit CLI, .specify/ directories, or creating specifications with constitution-driven development. Triggered by "spec-kit", "speckit", "constitution", "specify", references to .specify/ directory, or spec-kit commands.

170SKILL.mdUpdated Apr 4, 2026

feiskyer/spec-kit-skill

feiskyer/kiro-skill

development

VerifiedTrustedCommunity

Interactive feature development workflow from idea to implementation. Creates requirements (EARS format), design documents, and implementation task lists. Use when creating feature specs, requirements documents, design documents, or implementation plans. Triggered by "kiro" or references to .kiro/specs/ directory.

170SKILL.mdUpdated Apr 4, 2026

feiskyer/deep-research

tools

VerifiedTrustedCommunity

深度调研的多实例（多 Agent）编排工作流：把一个调研目标拆成可并行子目标，用 Codex CLI（`codex exec`）在默认 `workspace-write` 沙箱内运行子进程；联网与采集优先使用已安装的 skills，其次使用 MCP 工具；用脚本聚合子结果并分章精修，最终交付“成品报告文件路径 + 关键结论/建议摘要”。用于：系统性网页/资料调研、竞品/行业分析、批量链接/数据集分片检索、长文写作与证据整合，或用户提及“深度调研/Deep Research/Wide Research/多 Agent 并行调研/多进程调研”等场景。

170SKILL.mdUpdated Apr 4, 2026

feiskyer/deep-research

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/feiskyer/codex-settings.git

# Copy into Claude Code skills folder (global)
cp -r codex-settings/skills/nanobanana-skill ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

feiskyer/codex-settings

170 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT