Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

acnlabs/layers/faculties/vision

Name: layers/faculties/vision
Author: acnlabs

layers/faculties/vision/SKILL.md

npx skillsauth add acnlabs/openpersona layers/faculties/vision

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Vision Faculty — Sense

Perceive and interpret visual content natively through your model's vision capability. You can receive images, screenshots, diagrams, charts, and video frames as part of a conversation — treat them as a natural input channel, not an exception.

When to Engage Vision

Always engage when the user shares an image — do not ask for a text description if you can perceive the image directly.

Proactively describe relevant visual content when it materially affects your response:

A screenshot showing an error → identify the error, not just acknowledge the image
A diagram of a system → explain what the diagram shows before answering questions about it
A photo of a person or scene → describe what you perceive, then respond to the user's actual question

Do not narrate your own perception process ("I am now analyzing the image..."). Engage with the content directly.

Perception Principles

Accuracy over confidence

Describe what you can see clearly. Acknowledge ambiguity when present ("the text in the bottom-right is partially cut off").
Do not fabricate details that are not visible. If something is unclear, say so.

Context-first interpretation

Read the image in context of the conversation. A photo in a health conversation has different weight than the same photo in a creative writing session.
Align visual interpretation with your persona's role and domain.

Privacy by default

Do not retain, memorize, or reference image content in future conversations unless the user explicitly asks you to remember it.
If an image contains identifiable faces or personal data, engage with the user's actual question — do not gratuitously describe personal identifying details beyond what the task requires.
If an image appears to contain sensitive personal, medical, or financial information, acknowledge what the user is asking about without quoting sensitive data back verbatim.

Graceful Degradation

When vision is unavailable (model does not support vision, image failed to load, or no image was shared):

Do not pretend to see — never hallucinate image content.
Inform briefly and continue: "I can't see the image in this context — could you describe what you're looking at?" Keep it conversational, not technical.
Emit a signal if vision is expected but unavailable in your environment:

node scripts/state-sync.js signal capability_gap '{"need":"vision","reason":"image shared but model cannot process it","priority":"high"}'

Interaction Patterns

| Scenario | Behavior | |---|---| | User shares image with no text | Describe what you perceive, then invite the user's question | | User shares image with a question | Answer the question using the visual content | | User asks about an image you cannot see | Acknowledge the limitation, ask for description | | Multiple images in one message | Address each one, or focus on the one most relevant to the question | | Image contains text (OCR use case) | Read and use the text; note if portions are illegible | | Chart or diagram | Interpret the data/structure, not just the visual layout |

Provider Notes

Vision capability is declared in body.runtime.modalities (e.g. { "type": "vision", "provider": "claude-vision" }). The provider determines what image formats and sizes are accepted. No separate script is required — vision is a native model capability. If the declared provider differs from your active model, emit a capability_gap signal.

acnlabs/layers/faculties/vision

layers/faculties/vision/SKILL.md

# Vision Faculty — Sense Perceive and interpret visual content natively through your model's vision capability. You can receive images, screenshots, diagrams, charts, and video frames as part of a conversation — treat them as a natural input channel, not an exception. --- ## When to Engage Vision **Always engage** when the user shares an image — do not ask for a text description if you can perceive the image directly. **Proactively describe** relevant visual content when it materially affec

18 stars

data-ai

Updated Apr 18, 2026

$ install --global

skillsauth

npx skillsauth add acnlabs/openpersona layers/faculties/vision

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 18, 2026, 8:16 AM41.2s2 files scanned

SKILL.md

Vision Faculty — Sense

When to Engage Vision

Always engage when the user shares an image — do not ask for a text description if you can perceive the image directly.

Proactively describe relevant visual content when it materially affects your response:

A screenshot showing an error → identify the error, not just acknowledge the image
A diagram of a system → explain what the diagram shows before answering questions about it
A photo of a person or scene → describe what you perceive, then respond to the user's actual question

Do not narrate your own perception process ("I am now analyzing the image..."). Engage with the content directly.

Perception Principles

Accuracy over confidence

Describe what you can see clearly. Acknowledge ambiguity when present ("the text in the bottom-right is partially cut off").
Do not fabricate details that are not visible. If something is unclear, say so.

Context-first interpretation

Read the image in context of the conversation. A photo in a health conversation has different weight than the same photo in a creative writing session.
Align visual interpretation with your persona's role and domain.

Privacy by default

Do not retain, memorize, or reference image content in future conversations unless the user explicitly asks you to remember it.
If an image contains identifiable faces or personal data, engage with the user's actual question — do not gratuitously describe personal identifying details beyond what the task requires.
If an image appears to contain sensitive personal, medical, or financial information, acknowledge what the user is asking about without quoting sensitive data back verbatim.

Graceful Degradation

When vision is unavailable (model does not support vision, image failed to load, or no image was shared):

Do not pretend to see — never hallucinate image content.
Inform briefly and continue: "I can't see the image in this context — could you describe what you're looking at?" Keep it conversational, not technical.
Emit a signal if vision is expected but unavailable in your environment:

node scripts/state-sync.js signal capability_gap '{"need":"vision","reason":"image shared but model cannot process it","priority":"high"}'

Interaction Patterns

Provider Notes

Related Skills

acnlabs/persona-evaluator

tools

VerifiedTrustedCommunity

Audit any OpenPersona (or peer LLM-agent) persona in three complementary modes: structural (CLI, deterministic, CI-friendly: 4 Layers × 5 Systemic Concepts × Constitution gate with role-aware severity), semantic white-box (LLM reads pack-content JSON and scores Soul-narrative quality via rubrics), and semantic black-box (LLM evaluates a remote agent it cannot read on disk, via A2A handshake / consent-probe / passive observation, with confidence caps). Produces quality reports with dimension scores, strengths, and actionable improvements. Use when asked to evaluate, audit, score, review, self-review, peer-review, or black-box review an agent.

21SKILL.mdUpdated Apr 27, 2026

acnlabs/persona-evaluator

acnlabs/brand-persona-skill

tools

VerifiedTrustedCommunity

Distill any commercial entity into a personalized brand agent — a living brand persona with authentic voice, declared service capabilities, and a standard service contract. Every commercial entity has a brand: a name, a style, a way of showing up in the world. This skill exists so that a street vendor, a family clinic, and a global chain can all have their own agent on equal footing. Supports both distillation from existing brand content and declaration from scratch.

21SKILL.mdUpdated Apr 20, 2026

acnlabs/brand-persona-skill

acnlabs/persona-secondme-skill

development

VerifiedTrustedCommunity

A local-first personal AI double framework that helps users build, govern, and evolve their own digital self with clear

21SKILL.mdUpdated Apr 18, 2026

acnlabs/persona-secondme-skill

acnlabs/secondme-skill

development

VerifiedTrustedCommunity

A complete pipeline to build your AI Second Me: distill your identity from personal data, grow a private knowledge base, train a local model, and govern what gets shared.

21SKILL.mdUpdated Apr 18, 2026

acnlabs/secondme-skill

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/acnlabs/openpersona.git

# Copy into Claude Code skills folder (global)
cp -r openpersona/layers/faculties/vision ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

acnlabs/openpersona

18 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT