.cursor/skills/nano-banana/SKILL.md
Generate and edit high-quality AI images using Google's Gemini 3 Pro Image model (Nano Banana Pro) via MCP. Use when user wants to create images, edit photos, generate graphics, or needs visual content with text rendering.
npx skillsauth add nilecui/SkillsBase nano-bananaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate stunning 4K images, edit photos, and create graphics with perfect text rendering using Google's latest Gemini 3 Pro Image model via MCP.
Invoke when user:
Get a free API key from Google AI Studio:
Recommended: NanoBanana-MCP (uses Gemini 3 Pro for highest quality)
# Quick install via Claude Code CLI
claude mcp add nano-banana --env GEMINI_API_KEY=your-key-here -- npx -y nanobanana-mcp
Or add to ~/.claude/settings.json manually:
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["-y", "nanobanana-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Alternative: Nano-Banana-MCP by ConechoAI (Gemini 2.5 Flash - faster, lower cost)
{
"mcpServers": {
"nano-banana": {
"command": "npx",
"args": ["nano-banana-mcp"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Once MCP is configured, these tools become available:
| Tool | Purpose | Key Parameters |
|------|---------|----------------|
| gemini_generate_image | Create new images from text prompts | prompt, model, aspectRatio, imageSize |
| gemini_edit_image | Modify existing images with instructions | imagePath, instructions, model |
| continue_editing | Refine the last generated image | instructions |
| get_image_history | List all generated images in session | - |
| Model ID | Description |
|----------|-------------|
| gemini-3-pro-image-preview | Default. Highest quality, 4K support, best text rendering |
| gemini-2.0-flash-exp | Faster generation, good quality, lower cost |
| gemini-2.0-flash-preview-image-generation | Alternative 2.0 model |
| Size | Use Case |
|------|----------|
| 4K | Final assets, print, marketing materials |
| 2K | Balanced quality and speed |
| 1K | Fast iteration, prototyping |
| Feature | Capability | |---------|------------| | 4K Output | Up to 5632×3072 pixels | | Text Rendering | Accurate text in images (signs, labels, UI) | | Multi-Image Composition | Combine up to 14 reference images | | Character Consistency | Maintain same character across 5+ images | | Google Search Grounding | Real-world accurate imagery |
[Subject] + [Style] + [Details] + [Technical Specs]
Example:
"A cozy coffee shop interior, watercolor illustration style, warm lighting, wooden furniture, steaming cup on table, 4K resolution, soft morning light through windows"
For high-stakes work requiring exact reproducibility, use structured JSON schemas.
Trigger phrases:
| Type | Use Case | Key Controls |
|------|----------|--------------|
| marketing_image | Product shots, hero images | subject, props, lighting, camera, brand locks |
| ui_builder | App screens, dashboards | tokens, screens, containers, components |
| diagram_spec | Flowcharts, infographics | nodes, edges, data constraints |
User: "I need a hero shot for Aurora Lime seltzer"
Claude asks: "For the Aurora Lime hero shot:
Result: Structured JSON with exact specifications that can be iterated field-by-field.
JSON enables changing ONE thing without regenerating everything:
| Change | What Stays Fixed | |--------|------------------| | Swap lighting direction | Subject, props, background | | Try different camera angle | Lighting, props, environment | | Change background color | Subject geometry, lighting setup | | Add/remove props | Everything else |
references/json-prompting.md - Full JSON prompting guidereferences/translator-prompt.md - Translator system promptreferences/schemas/ - Template schemas for each typereferences/examples-json.md - Filled-out examplesNano Banana Pro excels at text rendering:
"A vintage movie poster for 'COSMIC ADVENTURE' with bold retro typography, starfield background, astronaut silhouette, 1970s sci-fi aesthetic"
For consistent characters across images:
history:0 reference in subsequent promptsFirst: "A young woman with red curly hair, freckles, green eyes, wearing a blue jacket"
Then: "The same woman from history:0, now sitting at a café, reading a book"
User: "Create an image of a futuristic city at sunset"
Claude uses: gemini_generate_image
Prompt: "Futuristic cityscape at golden hour sunset, towering glass skyscrapers with holographic advertisements, flying vehicles, warm orange and purple sky, photorealistic, 4K resolution, cinematic lighting"
User: "Edit this photo to make it look like winter"
Claude uses: gemini_edit_image
Input: [user's image path]
Instructions: "Transform to winter scene: add snow on ground and surfaces, frost on windows, visible breath, overcast sky, cool blue color grading"
User: "Make the lighting warmer"
Claude uses: continue_editing
Instructions: "Adjust lighting to warmer tones, add golden hour glow, enhance orange/yellow highlights, softer shadows"
Images save to: ~/Documents/nanobanana_generated/
Naming format: generated-[timestamp]-[id].png
.env files for key management in projects| Model | Speed | Quality | Cost | Best For |
|-------|-------|---------|------|----------|
| gemini-3-pro-image-preview | Slower | Highest (4K) | Higher | Final assets, print, marketing |
| gemini-2.0-flash-exp | Fast | Good | Lower | Prototyping, iteration, drafts |
| Issue | Solution | |-------|----------| | "API key invalid" | Verify key at AI Studio | | "Rate limited" | Wait 60s, or upgrade API tier | | "MCP not connected" | Restart Claude Code, check config syntax | | "Image not saving" | Check write permissions on output directory |
Works well with:
references/prompting-guide.md - Detailed prompting techniquesreferences/examples.md - Sample prompts by categorydocumentation
Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks
testing
Design a PostgreSQL-specific schema. Covers best-practices, data types, indexing, constraints, performance patterns, and advanced features
content-media
Production-ready PDF processing with forms, tables, OCR, validation, and batch operations. Use when working with complex PDF workflows in production environments, processing large volumes of PDFs, or requiring robust error handling and validation.
development
Design microservices architectures with service boundaries, event-driven communication, and resilience patterns. Use when building distributed systems, decomposing monoliths, or implementing microservices.