Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

hasna/extract

Name: extract
Author: hasna

skills/extract/SKILL.md

npx skillsauth add hasna/skills extract

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Extraction Skill

Extract text, data, and structured content from images and PDF documents using OpenAI Vision.

Capabilities

Image OCR: Extract text from images using GPT-4 Vision
PDF Text Extraction: Parse text from PDF documents
Structured Output: Output as plain text, Markdown, or JSON
Custom Prompts: Direct the extraction with specific instructions

Supported Formats

Input

Images: PNG, JPG, JPEG, GIF, WEBP, BMP, TIFF
Documents: PDF

Output

text: Clean, readable plain text
markdown: Structured Markdown with headings, lists, and tables
json: Structured JSON with sections, tables, and metadata

Usage

# Extract text from an image
bun run src/index.ts extract --input ./receipt.png --output ./receipt.txt

# Extract as Markdown from a PDF
bun run src/index.ts extract -i ./document.pdf -o ./document.md -f markdown

# Extract with custom prompt
bun run src/index.ts extract \
  --input ./invoice.png \
  --format json \
  --prompt "Extract invoice number, date, total amount, and line items"

# High-detail extraction for small text
bun run src/index.ts extract \
  --input ./handwriting.jpg \
  --detail high \
  --format text

Options

| Option | Short | Description | |--------|-------|-------------| | --input | -i | Input file path (required) | | --output | -o | Output file path (optional) | | --format | -f | Output format: text, markdown, json | | --prompt | -p | Custom extraction prompt | | --model | -m | OpenAI model (default: gpt-4o) | | --detail | -d | Image detail: low, high, auto |

Environment Variables

export OPENAI_API_KEY="your-openai-key"

Examples

Receipt Extraction

bun run src/index.ts extract \
  --input ./receipt.jpg \
  --format json \
  --prompt "Extract store name, date, items with prices, subtotal, tax, and total"

Document to Markdown

bun run src/index.ts extract \
  --input ./report.pdf \
  --format markdown \
  --output ./report.md

Handwritten Notes

bun run src/index.ts extract \
  --input ./notes.jpg \
  --detail high \
  --prompt "Transcribe the handwritten text, preserving the structure"

hasna/extract

skills/extract/SKILL.md

Extract text and structured data from images and PDFs using OpenAI Vision

8 stars

data-ai

Updated Jun 7, 2026

$ install --global

skillsauth

npx skillsauth add hasna/skills extract

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Jun 7, 2026, 2:38 AM19.6s10 files scanned

SKILL.md

name:: extract
description:: Extract text and structured data from images and PDFs using OpenAI Vision

Extraction Skill

Extract text, data, and structured content from images and PDF documents using OpenAI Vision.

Capabilities

Image OCR: Extract text from images using GPT-4 Vision
PDF Text Extraction: Parse text from PDF documents
Structured Output: Output as plain text, Markdown, or JSON
Custom Prompts: Direct the extraction with specific instructions

Supported Formats

Input

Images: PNG, JPG, JPEG, GIF, WEBP, BMP, TIFF
Documents: PDF

Output

text: Clean, readable plain text
markdown: Structured Markdown with headings, lists, and tables
json: Structured JSON with sections, tables, and metadata

Usage

# Extract text from an image
bun run src/index.ts extract --input ./receipt.png --output ./receipt.txt

# Extract as Markdown from a PDF
bun run src/index.ts extract -i ./document.pdf -o ./document.md -f markdown

# Extract with custom prompt
bun run src/index.ts extract \
  --input ./invoice.png \
  --format json \
  --prompt "Extract invoice number, date, total amount, and line items"

# High-detail extraction for small text
bun run src/index.ts extract \
  --input ./handwriting.jpg \
  --detail high \
  --format text

Options

Environment Variables

export OPENAI_API_KEY="your-openai-key"

Examples

Receipt Extraction

bun run src/index.ts extract \
  --input ./receipt.jpg \
  --format json \
  --prompt "Extract store name, date, items with prices, subtotal, tax, and total"

Document to Markdown

bun run src/index.ts extract \
  --input ./report.pdf \
  --format markdown \
  --output ./report.md

Handwritten Notes

bun run src/index.ts extract \
  --input ./notes.jpg \
  --detail high \
  --prompt "Transcribe the handwritten text, preserving the structure"

Related Skills

hasna/merge-pr

testing

VerifiedTrustedCommunity

Merge a GitHub pull request, merge when green, use a merge queue, or decide whether a pull request is mergeable. Use only for explicit merge intent, not ordinary review.

19SKILL.mdUpdated Jul 24, 2026

hasna/performance-audit-report

development

VerifiedTrustedCommunity

Generate premium performance audit reports for web apps, APIs, or SaaS surfaces with metrics, findings, budgets, remediation plans, and manifest metadata.

19SKILL.mdUpdated May 15, 2026

hasna/performance-audit-report

hasna/customer-feedback-report

data-ai

VerifiedTrustedCommunity

Generate premium customer feedback reports from reviews, support tickets, surveys, call notes, or raw feedback with clusters, sentiment, root causes, roadmap recommendations, evidence, and manifest metadata.

19SKILL.mdUpdated May 15, 2026

hasna/customer-feedback-report

hasna/pdf-generate

development

VerifiedTrustedCommunity

Generate high-quality PDF documents from markdown, HTML, or templates

19SKILL.mdUpdated May 12, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/hasna/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/skills/extract ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

hasna/skills

8 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT