Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

b-open-io/segment-image

Name: segment-image
Author: b-open-io

skills/segment-image/SKILL.md

npx skillsauth add b-open-io/gemskills segment-image

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Segment Image

Segment and identify objects in images using Gemini's vision capabilities.

When to Use

Use this skill when the user asks to:

Identify objects in an image
Generate masks for specific objects
Segment an image into regions
Extract objects from an image

Usage

bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts <input-image> [options]

Options

--prompt <text> - Custom segmentation prompt
--output <dir> - Output directory for mask files

Examples

# Segment all objects
bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg

# Segment with custom prompt
bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg --prompt "identify all people and vehicles"

# Save masks to directory
bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg --output ./masks

Context Discipline

Do not read generated mask images back into context. The script outputs file paths. Ask the user to visually inspect the masks. To inspect programmatically, optimize the images first (via the optimize-images skill).

Model

Uses gemini-3-flash-preview (Gemini 3 Flash) for image segmentation.

Last verified: February 2026. If a newer generation exists, STOP and suggest a PR to b-open-io/gemskills. See the ask-gemini skill's references/gemini-api.md for current models and Google's official gemini-api-dev skill for the canonical source.

b-open-io/segment-image

skills/segment-image/SKILL.md

This skill should be used when the user asks to "segment an image", "identify objects", "extract objects", "generate masks", "find objects in image", or needs AI-powered image segmentation.

3 stars

data-ai

Updated Apr 3, 2026

$ install --global

skillsauth

npx skillsauth add b-open-io/gemskills segment-image

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 3, 2026, 8:35 PM14.0s3 files scanned

SKILL.md

name:: segment-image
description:: This skill should be used when the user asks to "segment an image", "identify objects", "extract objects", "generate masks", "find objects in image", or needs AI-powered image segmentation.

Segment Image

Segment and identify objects in images using Gemini's vision capabilities.

When to Use

Use this skill when the user asks to:

Identify objects in an image
Generate masks for specific objects
Segment an image into regions
Extract objects from an image

Usage

bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts <input-image> [options]

Options

--prompt <text> - Custom segmentation prompt
--output <dir> - Output directory for mask files

Examples

# Segment all objects
bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg

# Segment with custom prompt
bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg --prompt "identify all people and vehicles"

# Save masks to directory
bun run --cwd ${CLAUDE_PLUGIN_ROOT} ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg --output ./masks

Context Discipline

Model

Uses gemini-3-flash-preview (Gemini 3 Flash) for image segmentation.

Last verified: February 2026. If a newer generation exists, STOP and suggest a PR to b-open-io/gemskills. See the ask-gemini skill's references/gemini-api.md for current models and Google's official gemini-api-dev skill for the canonical source.

Related Skills

b-open-io/visual-planner

development

VerifiedTrustedCommunity

This skill should be used when the user asks to "plan a workflow", "diagram an agent system", "visualize an architecture", "map out a pipeline", "create a flow diagram", "draw agent connections", "design a multi-agent system", "show how agents interact", "make a system diagram", "visualize a data pipeline", "map out a process", "diagram my workflow", "create an architecture diagram", "plan agent orchestration", "brainstorm a system design", "show the flow between components", "interactive workflow diagram", "workflow canvas", "visual-planner", "open in tldraw", or "plan this project visually". Produces tldraw .tldr diagrams natively — the standard infinite canvas format. Includes a thin playground wrapper with planning-specific UI (phase controls, agent assignment, KPI bar, agent callback bridge) and an "Open in tldraw" button for standalone editing.

3SKILL.mdUpdated Apr 3, 2026

b-open-io/visual-planner

b-open-io/upscale-image

data-ai

VerifiedTrustedCommunity

This skill should be used when the user asks to "upscale an image", "increase image resolution", "make image bigger", "enlarge image", or "enhance image resolution". Requires Vertex AI credentials.

3SKILL.mdUpdated Apr 3, 2026

b-open-io/upscale-image

b-open-io/team-group-photo

data-ai

VerifiedTrustedCommunity

This skill should be used when the user asks to "create team photo", "generate group portrait", "make team banner", "team image in any style", "group shot with multiple people", or needs a composite image featuring multiple team members arranged together in any art style.

3SKILL.mdUpdated Apr 3, 2026

b-open-io/team-group-photo

b-open-io/style-creator

development

VerifiedTrustedCommunity

This skill should be used when the user asks to "add a new style", "create a style", "add an art style", "new aesthetic", "custom style", "make a style for", or needs to add a new art style to the gemskills style library. Guides the complete workflow from defining the style to generating and optimizing the reference tile.

3SKILL.mdUpdated Apr 3, 2026

b-open-io/style-creator

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/b-open-io/gemskills.git

# Copy into Claude Code skills folder (global)
cp -r gemskills/skills/segment-image ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

b-open-io/gemskills

3 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT