describe-image/SKILL.md
Generates a short text description of an image file using AI.
npx skillsauth add supercorks/agent-skills describe-imageInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use this skill when you need to understand the content of an image file in the project, or when you need to generate a description for an image (e.g. for alt text or filename generation).
Script location: .github/skills/describe-image/
@google/genai and dotenv npm packagesFor --google:
GOOGLE_AI_API_KEY environment variable setzsh -ilc)For --llava:
ollama installed and available in PATHllava).jpg, .jpeg).png).webp).heic, .heif)To describe an image, run the following command:
node .github/skills/describe-image/index.js [options] <image_path>
--google Use Google Gemini (default provider)--llava Use local Ollama model--prompt "..." Custom prompt text--model <name> Override model (gemini-3-flash-preview for Google, llava for Ollama)-h, --help Show usageExample:
node .github/skills/describe-image/index.js ./public/properties/shared/image_01.jpg
node .github/skills/describe-image/index.js --llava ./public/properties/shared/image_01.jpg
node .github/skills/describe-image/index.js --google --prompt "List visible buttons" ./public/properties/shared/image_01.jpg
If --llava is used and ollama is not installed, the script prints installation steps (including Homebrew commands) and exits.
Output:
A modern living room with a white sectional sofa and large windows overlooking a city skyline.
The script outputs a concise 10-20 word description of the image content, focusing on the main subject and key features.
tools
Read local coding-agent thread/session history for Codex, Codex CLI, Copilot CLI, Copilot Chat local VS Code threads, and Claude Code. Use when asked to find, inventory, summarize, search, or inspect local agent conversations, session subjects, transcript files, or agent thread storage.
tools
Operate Google Workspace (Drive, Gmail, Calendar, Sheets, Docs, Slides, Chat, Admin, Forms, Tasks…) via the @googleworkspace/cli (`gws`) Rust CLI distributed on npm. Use for any read/write task across Workspace APIs — list/search/upload Drive files (incl. shared drives), send/read mail, manage calendar events, sheets cell ops, etc.
development
Find useful Agent Skills in curated public skill repositories for a user query. Searches official and community skill repos, ranks matching SKILL.md files, and recommends candidates with source trust notes.
tools
Use Browserbase and the browse CLI for browser automation, Fetch/Search API work, authenticated remote browsing, UI QA, debugging, tracing, and Browserbase platform or Functions workflows with multi-account support.