Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

omidzamani/dspy-adapters-multimodal

Name: dspy-adapters-multimodal
Author: omidzamani

skills/dspy-adapters-multimodal/SKILL.md

npx skillsauth add omidzamani/dspy-skills dspy-adapters-multimodal

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

DSPy Adapters and Multimodal I/O

Goal

Choose an adapter deliberately and model image, audio, and file inputs with DSPy's typed primitives.

Adapter Selection

| Adapter | Use it for | |---------|------------| | dspy.ChatAdapter() | Default, human-readable field markers, broad model compatibility | | dspy.JSONAdapter() | Structured JSON output and native function calling where supported | | dspy.XMLAdapter() | XML-tagged fields when XML is easier for the target LM to follow | | dspy.TwoStepAdapter() | A separate extraction pass when parsing needs extra help |

Configure globally or for a limited scope:

import dspy

dspy.configure(
    lm=dspy.LM("openai/gpt-4o-mini"),
    adapter=dspy.JSONAdapter(),
)

with dspy.context(adapter=dspy.XMLAdapter()):
    result = dspy.Predict("question -> answer")(question="What is DSPy?")

Native Function Calling

JSONAdapter enables native function calling by default. ChatAdapter keeps text parsing by default. Override either behavior explicitly:

chat_native = dspy.ChatAdapter(use_native_function_calling=True)
json_manual = dspy.JSONAdapter(use_native_function_calling=False)

DSPy falls back to manual parsing when the configured LM does not support native function calling.

Image Inputs

class DescribeImage(dspy.Signature):
    image: dspy.Image = dspy.InputField()
    description: str = dspy.OutputField()

describe = dspy.Predict(DescribeImage)
result = describe(image=dspy.Image("./diagram.png"))

Pass a local path, HTTP URL, bytes, PIL image, or existing data URI directly to dspy.Image(...).

Audio and File Inputs

class SummarizeAudio(dspy.Signature):
    audio: dspy.Audio = dspy.InputField()
    summary: str = dspy.OutputField()

audio = dspy.Audio.from_file("./meeting.wav")
summary = dspy.Predict(SummarizeAudio)(audio=audio)

class SummarizeFile(dspy.Signature):
    file: dspy.File = dspy.InputField()
    summary: str = dspy.OutputField()

document = dspy.File.from_path("./research.pdf")
summary = dspy.Predict(SummarizeFile)(file=document)

Provider capabilities vary. Verify that the selected model accepts the media type before deployment.

Best Practices

Start with ChatAdapter; switch only for a measured reason.
Use typed signatures for structured output.
Test adapter behavior against the exact production model.
Avoid deprecated Image.from_file() and Image.from_url() helpers; call dspy.Image(...).
Keep local file handling and uploaded file IDs within provider policy.

Related Skills

Design signatures: dspy-signature-designer
Build tool agents: dspy-react-agent-builder

Official Documentation

Adapters guide: https://dspy.ai/learn/programming/adapters/
Tools guide: https://dspy.ai/learn/programming/tools/
XMLAdapter API: https://dspy.ai/api/adapters/XMLAdapter/
Image API: https://dspy.ai/api/primitives/Image/
Audio API: https://dspy.ai/api/primitives/Audio/

omidzamani/dspy-adapters-multimodal

skills/dspy-adapters-multimodal/SKILL.md

This skill should be used when the user asks to "choose a DSPy adapter", "use JSONAdapter", "use XMLAdapter", "enable native function calling", "send images, audio, or files to DSPy", mentions `dspy.ChatAdapter`, `dspy.JSONAdapter`, `dspy.XMLAdapter`, `dspy.Image`, `dspy.Audio`, `dspy.File`, structured outputs, or multimodal DSPy signatures.

78 stars

data-ai

Updated Jun 3, 2026

$ install --global

skillsauth

npx skillsauth add omidzamani/dspy-skills dspy-adapters-multimodal

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 3, 2026, 3:10 AM24.9s1 file scanned

SKILL.md

name:: dspy-adapters-multimodal
version:: 2.0.0
dspy-compatibility:: 3.2.1
description:: This skill should be used when the user asks to "choose a DSPy adapter", "use JSONAdapter", "use XMLAdapter", "enable native function calling", "send images, audio, or files to DSPy", mentions `dspy.ChatAdapter`, `dspy.JSONAdapter`, `dspy.XMLAdapter`, `dspy.Image`, `dspy.Audio`, `dspy.File`, structured outputs, or multimodal DSPy signatures.

DSPy Adapters and Multimodal I/O

Goal

Choose an adapter deliberately and model image, audio, and file inputs with DSPy's typed primitives.

Adapter Selection

Configure globally or for a limited scope:

import dspy

dspy.configure(
    lm=dspy.LM("openai/gpt-4o-mini"),
    adapter=dspy.JSONAdapter(),
)

with dspy.context(adapter=dspy.XMLAdapter()):
    result = dspy.Predict("question -> answer")(question="What is DSPy?")

Native Function Calling

JSONAdapter enables native function calling by default. ChatAdapter keeps text parsing by default. Override either behavior explicitly:

chat_native = dspy.ChatAdapter(use_native_function_calling=True)
json_manual = dspy.JSONAdapter(use_native_function_calling=False)

DSPy falls back to manual parsing when the configured LM does not support native function calling.

Image Inputs

class DescribeImage(dspy.Signature):
    image: dspy.Image = dspy.InputField()
    description: str = dspy.OutputField()

describe = dspy.Predict(DescribeImage)
result = describe(image=dspy.Image("./diagram.png"))

Pass a local path, HTTP URL, bytes, PIL image, or existing data URI directly to dspy.Image(...).

Audio and File Inputs

class SummarizeAudio(dspy.Signature):
    audio: dspy.Audio = dspy.InputField()
    summary: str = dspy.OutputField()

audio = dspy.Audio.from_file("./meeting.wav")
summary = dspy.Predict(SummarizeAudio)(audio=audio)

class SummarizeFile(dspy.Signature):
    file: dspy.File = dspy.InputField()
    summary: str = dspy.OutputField()

document = dspy.File.from_path("./research.pdf")
summary = dspy.Predict(SummarizeFile)(file=document)

Provider capabilities vary. Verify that the selected model accepts the media type before deployment.

Best Practices

Start with ChatAdapter; switch only for a measured reason.
Use typed signatures for structured output.
Test adapter behavior against the exact production model.
Avoid deprecated Image.from_file() and Image.from_url() helpers; call dspy.Image(...).
Keep local file handling and uploaded file IDs within provider policy.

Related Skills

Design signatures: dspy-signature-designer
Build tool agents: dspy-react-agent-builder

Official Documentation

Adapters guide: https://dspy.ai/learn/programming/adapters/
Tools guide: https://dspy.ai/learn/programming/tools/
XMLAdapter API: https://dspy.ai/api/adapters/XMLAdapter/
Image API: https://dspy.ai/api/primitives/Image/
Audio API: https://dspy.ai/api/primitives/Audio/

Related Skills

omidzamani/dspy-simba-optimizer

tools

VerifiedTrustedCommunity

This skill should be used when the user asks to "optimize with SIMBA", "use mini-batch introspective optimization", "generate self-reflective rules", mentions "SIMBA optimizer", "stochastic mini-batch ascent", "output variability", or needs an alternative to MIPROv2/GEPA that evolves rules and demonstrations from numeric metrics.

78SKILL.mdUpdated Jun 3, 2026

omidzamani/dspy-simba-optimizer

omidzamani/dspy-signature-designer

data-ai

VerifiedTrustedCommunity

This skill should be used when the user asks to "create a DSPy signature", "define inputs and outputs", "design a signature", "use InputField or OutputField", "add type hints to DSPy", mentions "signature class", "type-safe DSPy", "Pydantic models in DSPy", or needs to define what a DSPy module should do with structured inputs and outputs.

78SKILL.mdUpdated Jun 3, 2026

omidzamani/dspy-signature-designer

omidzamani/dspy-reasoning-modules

development

VerifiedTrustedCommunity

This skill should be used when the user asks to "use DSPy RLM", "process a very long context", "use ProgramOfThought", "use CodeAct", "run DSPy modules in parallel", mentions Recursive Language Models, sandboxed Python execution, Deno, `dspy.RLM`, `dspy.ProgramOfThought`, `dspy.CodeAct`, or `dspy.Parallel`, or needs to choose a DSPy reasoning module beyond Predict, ChainOfThought, and ReAct.

78SKILL.mdUpdated Jun 3, 2026

omidzamani/dspy-reasoning-modules

omidzamani/dspy-react-agent-builder

tools

VerifiedTrustedCommunity

This skill should be used when the user asks to "create a ReAct agent", "build an agent with tools", "implement tool-calling agent", "use dspy.ReAct", mentions "agent with tools", "reasoning and acting", "multi-step agent", "agent optimization with GEPA", or needs to build production agents that use tools to solve complex tasks.

78SKILL.mdUpdated Jun 3, 2026

omidzamani/dspy-react-agent-builder

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/omidzamani/dspy-skills.git

# Copy into Claude Code skills folder (global)
cp -r dspy-skills/skills/dspy-adapters-multimodal ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

omidzamani/dspy-skills

78 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT