Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

inferencesh/nano-banana

Name: nano-banana
Author: inferencesh

tools/image/nano-banana/SKILL.md

npx skillsauth add inferencesh/skills nano-banana

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Nano Banana - Gemini Native Image Generation

Generate images with Google Gemini native image models via inference.sh CLI.

Nano Banana

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

belt app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'

Models

| Model | App ID | Speed | Quality | |-------|--------|-------|---------| | Gemini 3 Pro Image | google/gemini-3-pro-image-preview | Slower | Best | | Gemini 2.5 Flash Image | google/gemini-2-5-flash-image | Fast | Excellent |

Search Gemini Image Apps

belt app list --search "gemini image"

Examples

Basic Text-to-Image

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "A futuristic cityscape at sunset with flying cars"
}'

Multiple Images

belt app run google/gemini-2-5-flash-image --input '{
  "prompt": "Minimalist logo design for a coffee shop",
  "num_images": 4
}'

Custom Aspect Ratio

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Panoramic mountain landscape with northern lights",
  "aspect_ratio": "16:9"
}'

Image Editing (with input image)

belt app run google/gemini-2-5-flash-image --input '{
  "prompt": "Add a rainbow in the sky",
  "images": ["https://example.com/landscape.jpg"]
}'

High Resolution (4K)

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Detailed illustration of a medieval castle",
  "resolution": "4K"
}'

With Google Search Grounding

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Current weather in Tokyo visualized as an artistic scene",
  "enable_google_search": true
}'

Input Options

| Parameter | Type | Description | |-----------|------|-------------| | prompt | string | Required. What to generate or change | | images | array | Input images for editing (up to 14) | | num_images | integer | Number of images to generate | | aspect_ratio | string | Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto" | | resolution | string | "1K", "2K", "4K" (Gemini 3 Pro only) | | output_format | string | Output format for images | | enable_google_search | boolean | Enable real-time info grounding |

Prompt Tips

Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render

Composition: close-up, wide shot, aerial view, macro, portrait, landscape

Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon

Details: add specific details about textures, colors, mood, atmosphere

Sample Workflow

# 1. Generate sample input to see all options
belt app sample google/gemini-3-pro-image-preview --save input.json

# 2. Edit the prompt
# 3. Run
belt app run google/gemini-3-pro-image-preview --input input.json

Related Skills

# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# All image generation models
npx skills add inference-sh/skills@ai-image-generation

# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation

Browse all image apps: belt app list --category image

Documentation

Running Apps - How to run apps via CLI
Streaming Results - Real-time progress updates
File Handling - Working with images

inferencesh/nano-banana

tools/image/nano-banana/SKILL.md

Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image

363 stars

tools

Updated Apr 23, 2026

$ install --global

skillsauth

npx skillsauth add inferencesh/skills nano-banana

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 30, 2026, 8:25 PM96.1s1 file scanned

SKILL.md

name:: nano-banana
description:: Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image
allowed-tools:: Bash(belt *)

Nano Banana - Gemini Native Image Generation

Generate images with Google Gemini native image models via inference.sh CLI.

Nano Banana

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

belt app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'

Models

Search Gemini Image Apps

belt app list --search "gemini image"

Examples

Basic Text-to-Image

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "A futuristic cityscape at sunset with flying cars"
}'

Multiple Images

belt app run google/gemini-2-5-flash-image --input '{
  "prompt": "Minimalist logo design for a coffee shop",
  "num_images": 4
}'

Custom Aspect Ratio

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Panoramic mountain landscape with northern lights",
  "aspect_ratio": "16:9"
}'

Image Editing (with input image)

belt app run google/gemini-2-5-flash-image --input '{
  "prompt": "Add a rainbow in the sky",
  "images": ["https://example.com/landscape.jpg"]
}'

High Resolution (4K)

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Detailed illustration of a medieval castle",
  "resolution": "4K"
}'

With Google Search Grounding

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Current weather in Tokyo visualized as an artistic scene",
  "enable_google_search": true
}'

Input Options

Prompt Tips

Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render

Composition: close-up, wide shot, aerial view, macro, portrait, landscape

Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon

Details: add specific details about textures, colors, mood, atmosphere

Sample Workflow

# 1. Generate sample input to see all options
belt app sample google/gemini-3-pro-image-preview --save input.json

# 2. Edit the prompt
# 3. Run
belt app run google/gemini-3-pro-image-preview --input input.json

Related Skills

# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# All image generation models
npx skills add inference-sh/skills@ai-image-generation

# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation

Browse all image apps: belt app list --category image

Documentation

Running Apps - How to run apps via CLI
Streaming Results - Real-time progress updates
File Handling - Working with images

Related Skills

inferencesh/ai-content-pipeline

tools

Community

Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merge with music. Tools: FLUX, Veo, Kokoro TTS, OmniHuman, media merger, upscaling. Use for: YouTube videos, social media content, marketing materials, automated content. Triggers: content pipeline, ai workflow, content creation, multi-step ai, content automation, ai video workflow, generate and edit, ai content factory, automated content creation, ai production pipeline, media pipeline, content at scale

457SKILL.mdUpdated Apr 23, 2026

inferencesh/ai-content-pipeline

Security Scans

mcp-scan — Pending Scan

Semgrep — Pending Scan

Trivy — Pending Scan

OWASP — Pending Scan

VirusTotal — Pending Scan

inferencesh/ai-automation-workflows

tools

VerifiedTrustedCommunity

Build automated AI workflows combining multiple models and services. Patterns: batch processing, scheduled tasks, event-driven pipelines, agent loops. Tools: inference.sh CLI, bash scripting, Python SDK, webhook integration. Use for: content automation, data processing, monitoring, scheduled generation. Triggers: ai automation, workflow automation, batch processing, ai pipeline, automated content, scheduled ai, ai cron, ai batch job, automated generation, ai workflow, content at scale, automation script, ai orchestration

450SKILL.mdUpdated Apr 21, 2026

inferencesh/ai-automation-workflows

inferencesh/ai-podcast

data-ai

VerifiedTrustedCommunity

Generate multi-person talking head podcast videos from scratch using AI — character creation, TTS, avatar animation, and video stitching. Use when the user wants to create a podcast, talking head video, or multi-speaker conversation video.

442SKILL.mdUpdated May 17, 2026

inferencesh/ai-podcast

inferencesh/ai-image-generation

tools

VerifiedTrustedCommunity

Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image

414SKILL.mdUpdated Apr 23, 2026

inferencesh/ai-image-generation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/inferencesh/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/tools/image/nano-banana ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

inferencesh/skills

363 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT