plugins/nano-banana/skills/nano-banana/SKILL.md
Generates AI images using the nano-banana CLI (Gemini 3.1 Flash default, Pro available). Handles multi-resolution (512-4K), aspect ratios, reference images for style transfer, green screen workflow for transparent assets, cost tracking, and exact dimension control. Use when asked to "generate an image", "create a sprite", "make an asset", "generate artwork", or any image generation task for UI mockups, game assets, videos, or marketing materials.
npx skillsauth add kingbootoshi/nano-banana-2-skill nano-bananaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
AI image generation CLI. Default model: Gemini 3.1 Flash Image Preview (Nano Banana 2).
When the user says "init", "setup nano-banana", or "install nano-banana", run these commands to get the CLI tool on their machine. No sudo required.
Prerequisites: Bun must be installed. If not: curl -fsSL https://bun.sh/install | bash
# 1. Clone the repo
git clone https://github.com/kingbootoshi/nano-banana-2-skill.git ~/tools/nano-banana-2
# 2. Install dependencies
cd ~/tools/nano-banana-2 && bun install
# 3. Link globally (creates `nano-banana` command via Bun - no sudo)
cd ~/tools/nano-banana-2 && bun link
# 4. Set up API key
mkdir -p ~/.nano-banana
echo "GEMINI_API_KEY=<ask user for their key>" > ~/.nano-banana/.env
After init, the user can type nano-banana "prompt" from anywhere.
If bun link fails or the command is not found after linking, fall back to:
mkdir -p ~/.local/bin
ln -sf ~/tools/nano-banana-2/src/cli.ts ~/.local/bin/nano-banana
# Then ensure ~/.local/bin is on PATH:
echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.zshrc
source ~/.zshrc
Get a Gemini API key at: https://aistudio.google.com/apikey
nano-banana "prompt" [options]| Option | Default | Description |
|--------|---------|-------------|
| -o, --output | nano-gen-{timestamp} | Output filename (no extension) |
| -s, --size | 1K | Image size: 512, 1K, 2K, or 4K |
| -a, --aspect | model default | Aspect ratio: 1:1, 16:9, 9:16, 4:3, 3:4, etc. |
| -m, --model | flash | Model: flash/nb2, pro/nb-pro, or any model ID |
| -d, --dir | current directory | Output directory |
| -r, --ref | - | Reference image (can use multiple times) |
| -t, --transparent | - | Generate on green screen, remove background (FFmpeg) |
| --api-key | - | Gemini API key (overrides env/file) |
| --costs | - | Show cost summary |
| Alias | Model | Use When |
|-------|-------|----------|
| flash, nb2 | Gemini 3.1 Flash | Default. Fast, cheap (~$0.067/1K image) |
| pro, nb-pro | Gemini 3 Pro | Highest quality needed (~$0.134/1K image) |
| Size | Cost (Flash) | Cost (Pro) |
|------|-------------|------------|
| 512 | ~$0.045 | Flash only |
| 1K | ~$0.067 | ~$0.134 |
| 2K | ~$0.101 | ~$0.201 |
| 4K | ~$0.151 | ~$0.302 |
Supported: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 4:5, 5:4, 21:9
Use -a flag: nano-banana "cinematic scene" -a 16:9
nano-banana "minimal dashboard UI with dark theme"
nano-banana "cinematic landscape" -s 2K -a 16:9
nano-banana "quick concept sketch" -s 512
# Default (Flash - fast, cheap)
nano-banana "your prompt"
# Pro (highest quality)
nano-banana "detailed portrait" --model pro -s 2K
# Edit existing image
nano-banana "change the background to pure white" -r dark-ui.png -o light-ui
# Style transfer - multiple references
nano-banana "combine these two styles" -r style1.png -r style2.png -o combined
nano-banana "robot mascot character" -t -o mascot
nano-banana "pixel art treasure chest" -t -o chest
The -t flag automatically prompts the AI to generate on a green screen, then uses FFmpeg colorkey + despill to key out the background and remove green spill from edge pixels. Pixel-perfect transparency with no manual prompting needed.
Requires: brew install ffmpeg imagemagick
To get a specific output dimension:
-r flag: your reference/style image-r flag: blank image in target dimensionsnano-banana "pixel art character in style of first image, 256x256" -r style.png -r blank-256x256.png -o sprite
Every generation is logged to ~/.nano-banana/costs.json. View summary:
nano-banana --costs
# UI mockups
nano-banana "clean SaaS dashboard with analytics charts, white background"
# Widescreen cinematic
nano-banana "cyberpunk cityscape at sunset" -a 16:9 -s 2K
# Product shots with Pro quality
nano-banana "premium software product hero image" --model pro
# Quick low-res concept
nano-banana "rough sketch of a robot" -s 512
# Dark mode UI
nano-banana "Premium SaaS chat interface, dark mode, minimal, Linear-style aesthetic"
# Game assets with transparency (green screen auto-prompted)
nano-banana "pixel art treasure chest" -t -o chest
# Portrait aspect ratio
nano-banana "mobile app onboarding screen" -a 9:16
The CLI resolves the Gemini API key in this order:
--api-key flagGEMINI_API_KEY environment variable.env file in current directory.env file next to the CLI script~/.nano-banana/.envGet a key at: https://aistudio.google.com/apikey
tools
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.