tools/image/nano-banana/SKILL.md
Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image
npx skillsauth add inferencesh/skills nano-bananaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate images with Google Gemini native image models via inference.sh CLI.

Requires inference.sh CLI (
belt). Install instructions
belt login
belt app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'
| Model | App ID | Speed | Quality |
|-------|--------|-------|---------|
| Gemini 3 Pro Image | google/gemini-3-pro-image-preview | Slower | Best |
| Gemini 2.5 Flash Image | google/gemini-2-5-flash-image | Fast | Excellent |
belt app list --search "gemini image"
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "A futuristic cityscape at sunset with flying cars"
}'
belt app run google/gemini-2-5-flash-image --input '{
"prompt": "Minimalist logo design for a coffee shop",
"num_images": 4
}'
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "Panoramic mountain landscape with northern lights",
"aspect_ratio": "16:9"
}'
belt app run google/gemini-2-5-flash-image --input '{
"prompt": "Add a rainbow in the sky",
"images": ["https://example.com/landscape.jpg"]
}'
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "Detailed illustration of a medieval castle",
"resolution": "4K"
}'
belt app run google/gemini-3-pro-image-preview --input '{
"prompt": "Current weather in Tokyo visualized as an artistic scene",
"enable_google_search": true
}'
| Parameter | Type | Description |
|-----------|------|-------------|
| prompt | string | Required. What to generate or change |
| images | array | Input images for editing (up to 14) |
| num_images | integer | Number of images to generate |
| aspect_ratio | string | Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto" |
| resolution | string | "1K", "2K", "4K" (Gemini 3 Pro only) |
| output_format | string | Output format for images |
| enable_google_search | boolean | Enable real-time info grounding |
Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render
Composition: close-up, wide shot, aerial view, macro, portrait, landscape
Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon
Details: add specific details about textures, colors, mood, atmosphere
# 1. Generate sample input to see all options
belt app sample google/gemini-3-pro-image-preview --save input.json
# 2. Edit the prompt
# 3. Run
belt app run google/gemini-3-pro-image-preview --input input.json
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# All image generation models
npx skills add inference-sh/skills@ai-image-generation
# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation
Browse all image apps: belt app list --category image
tools
Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merge with music. Tools: FLUX, Veo, Kokoro TTS, OmniHuman, media merger, upscaling. Use for: YouTube videos, social media content, marketing materials, automated content. Triggers: content pipeline, ai workflow, content creation, multi-step ai, content automation, ai video workflow, generate and edit, ai content factory, automated content creation, ai production pipeline, media pipeline, content at scale
tools
Build automated AI workflows combining multiple models and services. Patterns: batch processing, scheduled tasks, event-driven pipelines, agent loops. Tools: inference.sh CLI, bash scripting, Python SDK, webhook integration. Use for: content automation, data processing, monitoring, scheduled generation. Triggers: ai automation, workflow automation, batch processing, ai pipeline, automated content, scheduled ai, ai cron, ai batch job, automated generation, ai workflow, content at scale, automation script, ai orchestration
data-ai
Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.
tools
Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image