skills/nano-banana/SKILL.md
Generate and edit high-quality AI images using Google's Gemini image models (Nano Banana series) via MCP. Use when user wants to create images, edit photos, generate graphics, or needs visual content with text rendering. Supports gemini-3.1-flash-image-preview (Nano Banana 2) and gemini-2.5-flash-image-preview (Nano Banana).
npx skillsauth add ckorhonen/claude-skills nano-bananaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate stunning 4K images, edit photos, and create graphics with perfect text rendering using Google's Gemini image models via MCP. The default model is gemini-3.1-flash-image-preview (Nano Banana 2), launched February 2026.
Invoke when user:
Get a free API key from Google AI Studio:
Recommended: NanoBanana-MCP (uses Gemini 3.1 Flash Image for highest quality)
# Quick install via Claude Code CLI
claude mcp add nano-banana --env GEMINI_API_KEY=your-key-here -- npx -y nanobanana-mcp
Or add to ~/.claude/settings.json manually:
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["-y", "nanobanana-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Alternative: Nano-Banana-MCP by ConechoAI (Gemini 2.0 Flash - faster, lower cost)
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["nano-banana-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Once MCP is configured, these tools become available:
| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| gemini_generate_image | Create new images from text prompts | prompt, model, aspectRatio, imageSize |
| gemini_edit_image | Modify existing images with instructions | imagePath, instructions, model |
| continue_editing | Refine the last generated image | instructions |
| get_image_history | List all generated images in session | - |
| Model ID | Description |
|----------|-------------|
| gemini-3.1-flash-image-preview | Default. Highest quality, 4K support, best text rendering (Nano Banana 2, launched Feb 2026) |
| gemini-2.5-flash-image-preview | Fast generation, 1K resolution, good quality (Nano Banana) |
| gemini-3-pro-image-preview | Previous-gen Pro model, still high quality (Nano Banana Pro) |
| gemini-2.0-flash-exp | Older experimental model; prefer gemini-2.5-flash-image-preview for new work |
| Size | Use Case |
|------|----------|
| 4K | Final assets, print, marketing materials |
| 2K | Balanced quality and speed |
| 1K | Fast iteration, prototyping |
| Feature | Capability | |---------|------------| | 4K Output | Up to 5632×3072 pixels | | Text Rendering | Accurate text in images (signs, labels, UI) | | Multi-Image Composition | Combine up to 14 reference images | | Character Consistency | Maintain same character across 5+ images | | Google Search Grounding | Real-world accurate imagery |
[Subject] + [Style] + [Details] + [Technical Specs]
Example:
"A cozy coffee shop interior, watercolor illustration style, warm lighting, wooden furniture, steaming cup on table, 4K resolution, soft morning light through windows"
For high-stakes work requiring exact reproducibility, use structured JSON schemas.
Trigger phrases:
| Type | Use Case | Key Controls |
|------|----------|--------------|
| marketing_image | Product shots, hero images | subject, props, lighting, camera, brand locks |
| ui_builder | App screens, dashboards | tokens, screens, containers, components |
| diagram_spec | Flowcharts, infographics | nodes, edges, data constraints |
User: "I need a hero shot for Aurora Lime seltzer"
Claude asks: "For the Aurora Lime hero shot:
Result: Structured JSON with exact specifications that can be iterated field-by-field.
JSON enables changing ONE thing without regenerating everything:
| Change | What Stays Fixed | |--------|------------------| | Swap lighting direction | Subject, props, background | | Try different camera angle | Lighting, props, environment | | Change background color | Subject geometry, lighting setup | | Add/remove props | Everything else |
references/json-prompting.md - Full JSON prompting guidereferences/translator-prompt.md - Translator system promptreferences/schemas/ - Template schemas for each typereferences/examples-json.md - Filled-out examplesNano Banana 2 excels at text rendering:
"A vintage movie poster for 'COSMIC ADVENTURE' with bold retro typography, starfield background, astronaut silhouette, 1970s sci-fi aesthetic"
For consistent characters across images:
history:0 reference in subsequent promptsFirst: "A young woman with red curly hair, freckles, green eyes, wearing a blue jacket"
Then: "The same woman from history:0, now sitting at a café, reading a book"
User: "Create an image of a futuristic city at sunset"
Claude uses: gemini_generate_image
Prompt: "Futuristic cityscape at golden hour sunset, towering glass skyscrapers with holographic advertisements, flying vehicles, warm orange and purple sky, photorealistic, 4K resolution, cinematic lighting"
User: "Edit this photo to make it look like winter"
Claude uses: gemini_edit_image
Input: [user's image path]
Instructions: "Transform to winter scene: add snow on ground and surfaces, frost on windows, visible breath, overcast sky, cool blue color grading"
User: "Make the lighting warmer"
Claude uses: continue_editing
Instructions: "Adjust lighting to warmer tones, add golden hour glow, enhance orange/yellow highlights, softer shadows"
Images save to: ~/Documents/nanobanana_generated/
Naming format: generated-[timestamp]-[id].png
.env files for key management in projects| Model | Speed | Quality | Max Res | Cost | Best For |
|-------|-------|---------|---------|------|----------|
| gemini-3.1-flash-image-preview | Fast | Highest | 4K | ~$0.045/img (1K) | Final assets, print, marketing |
| gemini-2.5-flash-image-preview | Fastest | Good | 1K | ~$0.039/img | Prototyping, iteration, drafts |
| gemini-3-pro-image-preview | Slower | High | 4K | ~$0.134/img | Previous-gen Pro fallback |
| gemini-2.0-flash-exp | Fast | Good | 1K | Low | Legacy; use 2.5-flash-image-preview instead |
Pricing as of March 2026. Free tier (~500 req/day) available via Google AI Studio for all models.
| Issue | Solution | |-------|----------| | "API key invalid" | Verify key at AI Studio | | "Rate limited" | Wait 60s, or upgrade API tier | | "MCP not connected" | Restart Claude Code, check config syntax | | "Image not saving" | Check write permissions on output directory |
Works well with:
references/prompting-guide.md - Detailed prompting techniquesreferences/examples.md - Sample prompts by categorydocumentation
Create or expand an Idea.md / IDEA.md file from a rough description, existing repo, conversation history, notes, or other early-stage product inputs. Use when the user asks to "write an Idea.md", "turn this into an idea file", "capture this product idea", "expand this concept", or wants a repo-grounded concept brief before validation, PRD, or implementation work.
development
Write structured implementation plans from specs or requirements before touching code. Use when given a spec, requirements doc, or feature description, when user says "plan this out", "write a plan for", "how should we implement", or before starting any multi-step coding task.
testing
Expert guidance for video editing with ffmpeg, encoding best practices, and quality optimization. Use when working with video files, transcoding, remuxing, encoding settings, color spaces, or troubleshooting video quality issues.
development
Opinionated constraints for building better interfaces with agents. Use when building UI components, implementing animations, designing layouts, reviewing frontend accessibility, or working with Tailwind CSS, motion/react, or accessible primitives like Radix/Base UI.