tools/image/nano-banana/SKILL.md
Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image
npx skillsauth add inference-sh/agent-skills-registry nano-bananaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate images with Google Gemini native image models via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
belt app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'
| Model | App ID | Speed | Quality |
|-------|--------|-------|---------|
| Gemini 3 Pro Image | google/gemini-3-pro-image-preview | Slower | Best |
| Gemini 2.5 Flash Image | google/gemini-2-5-flash-image | Fast | Excellent |
belt app list --search "gemini image"
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "A futuristic cityscape at sunset with flying cars"
}'
belt app run google/gemini-2-5-flash-image --input '{
"prompt": "Minimalist logo design for a coffee shop",
"num_images": 4
}'
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "Panoramic mountain landscape with northern lights",
"aspect_ratio": "16:9"
}'
belt app run google/gemini-2-5-flash-image --input '{
"prompt": "Add a rainbow in the sky",
"images": ["https://example.com/landscape.jpg"]
}'
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "Detailed illustration of a medieval castle",
"resolution": "4K"
}'
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "Current weather in Tokyo visualized as an artistic scene",
"enable_google_search": true
}'
| Parameter | Type | Description |
|-----------|------|-------------|
| prompt | string | Required. What to generate or change |
| images | array | Input images for editing (up to 14) |
| num_images | integer | Number of images to generate |
| aspect_ratio | string | Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto" |
| resolution | string | "1K", "2K", "4K" (Gemini 3 Pro only) |
| output_format | string | Output format for images |
| enable_google_search | boolean | Enable real-time info grounding |
Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render
Composition: close-up, wide shot, aerial view, macro, portrait, landscape
Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon
Details: add specific details about textures, colors, mood, atmosphere
# 1. Generate sample input to see all options
belt app sample google/gemini-3-pro-image-preview --save input.json
# 2. Edit the prompt
# 3. Run
belt app run google/gemini-3-pro-image-preview --input input.json
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# All image generation models
npx skills add inference-sh/skills@ai-image-generation
# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation
Browse all image apps: belt app list --category image
development
Declarative UI widgets from JSON for React/Next.js from ui.inference.sh. Render rich interactive UIs from structured agent responses. Capabilities: forms, buttons, cards, layouts, inputs, selects, checkboxes. Use for: agent-generated UIs, dynamic forms, data display, interactive cards. Triggers: widgets, declarative ui, json ui, widget renderer, agent widgets, dynamic ui, form widgets, card widgets, shadcn widgets, structured output ui
tools
Tool lifecycle UI components for React/Next.js from ui.inference.sh. Display tool calls: pending, progress, approval required, results. Capabilities: tool status, progress indicators, approval flows, results display. Use for: showing agent tool calls, human-in-the-loop approvals, tool output. Triggers: tool ui, tool calls, tool status, tool approval, tool results, agent tools, mcp tools ui, function calling ui, tool lifecycle, tool pending
development
Chat UI building blocks for React/Next.js from ui.inference.sh. Components: container, messages, input, typing indicators, avatars. Capabilities: chat interfaces, message lists, input handling, streaming. Use for: building custom chat UIs, messaging interfaces, AI assistants. Triggers: chat ui, chat component, message list, chat input, shadcn chat, react chat, chat interface, messaging ui, conversation ui, chat building blocks
tools
Batteries-included agent component for React/Next.js from ui.inference.sh. One component with runtime, tools, streaming, approvals, and widgets built in. Capabilities: drop-in agent, human-in-the-loop, client-side tools, form filling. Use for: building AI chat interfaces, agentic UIs, SaaS copilots, assistants. Triggers: agent component, agent ui, chat agent, shadcn agent, react agent, agentic ui, ai assistant ui, copilot ui, inference ui, human in the loop