plugins/gemini-bridge/skills/vision/SKILL.md
Transforms visual inputs (screenshots, UI mockups, architecture diagrams, whiteboard photos, PDFs) into code, documentation, or structured data using Gemini 2.5 Pro's multimodal capabilities. Activate when the user shares an image file and asks to "convert this mockup to code", "generate a component from this screenshot", "what does this diagram show", "extract this architecture", or "turn this whiteboard into code".
npx skillsauth add talent-factory/claude-plugins visionInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Transform visual inputs into code, documentation, or structured data using Gemini 2.5 Pro's multimodal capabilities.
| Format | Typical use case | |---|---| | PNG / JPG / WEBP | UI screenshots, mockups, whiteboard photos | | GIF | Animated UI flows | | PDF | Architecture documents, specifications, diagrams |
Verify the file exists and is readable:
ls -lh <image_path>
If the user has not provided a path, ask for it before proceeding.
gemini_status()
gemini_analyze_image(
image_path=<absolute_or_relative_path>,
question=<task_description>
)
Craft the question to be as specific as possible. Examples:
"Generate a React TypeScript component matching this UI mockup exactly. Include all visible states and props.""Convert this UML class diagram to a Python dataclass hierarchy.""Extract the algorithm described on this whiteboard and implement it in Python with type hints.""Identify all UI components visible in this screenshot and list their props and interactions."Use Gemini's output as a scaffold. Claude then:
Always attribute the initial analysis:
**Initial analysis by Gemini 2.5 Pro**
[Gemini's output]
**Refined implementation**
[Claude's refined version]
| Input | Output | |---|---| | UI mockup | React / Vue component with props | | Architecture diagram | PlantUML / Mermaid diagram source | | Whiteboard algorithm | Implemented function with tests | | Legacy screenshot | Migration planning document | | PDF specification | Structured data model or IaC template |
documentation
Creates comprehensive handoff documentation before a /compact operation, enabling a new agent with fresh context to seamlessly continue the work. Activate when the user says "prepare a handoff", "document before compact", "context is getting too large", "I need to hand this off", "create a handoff document", "end of session notes", or "document the current state for tomorrow". Supports --output and --linear-issue options.
documentation
Collects completed tasks, GitHub activity, calendar meetings, wiki learnings, and Linear progress for the past week, then writes a structured retrospective to the Obsidian vault. Activate when the user runs "/weekly-review", asks "what did I accomplish this week", "weekly retrospective", "summarise my week", or "prepare weekly review". Works from any directory.
development
Produces a prioritised daily briefing by aggregating tasks from Obsidian TaskNotes, Google Calendar, Gmail, Linear, and GitHub, then writes the result to today's daily note in the Obsidian vault. Activate when the user runs "/today", asks "what's on my plate today", "give me my daily briefing", "what do I need to do today", or "morning briefing". Works from any directory without requiring Obsidian to be running.
tools
Provides a quick status snapshot of all GitHub repositories in a configured organisation — active, dormant, stalled — plus open pull requests and issues. Terminal output only, optimised for speed. Activate when the user runs "/project-pulse", asks "what is the status of my GitHub projects", "show me my repos", "which projects are active", "project overview", "GitHub pulse", or "what is happening in [org]". Works from any directory.