.claude/skills/document-extract/SKILL.md
--- context: fork --- # /document-extract Extract and analyse content from scanned documents, PDFs, and document images using Sonnet sub-agents. ## Usage ``` /document-extract <file-path> /document-extract +Attachments/meeting-whiteboard.jpg /document-extract +Attachments/spec-document.pdf /document-extract +Attachments/handwritten-notes.png --type "meeting notes" ``` ## Instructions This skill uses **Sonnet model sub-agents** for comprehensive document analysis and text extraction. ### P
npx skillsauth add DavidROliverBA/ArchitectKB .claude/skills/document-extractInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Extract and analyse content from scanned documents, PDFs, and document images using Sonnet sub-agents.
/document-extract <file-path>
/document-extract +Attachments/meeting-whiteboard.jpg
/document-extract +Attachments/spec-document.pdf
/document-extract +Attachments/handwritten-notes.png --type "meeting notes"
This skill uses Sonnet model sub-agents for comprehensive document analysis and text extraction.
Launch these sub-agents using model: "sonnet":
Agent 1: Text Extraction (Sonnet)
Task: Extract all text content from the document
- Read the file using the Read tool
- Perform comprehensive OCR on all visible text
- Preserve structure (headings, paragraphs, lists)
- Handle multiple columns if present
- Extract text from tables maintaining structure
- Note any text that is unclear or uncertain
Return: Complete text extraction with structure preserved
Agent 2: Structure Analysis (Sonnet)
Task: Analyse document structure and formatting
- Read the file
- Identify document type (letter, form, spec, notes, etc.)
- Map section hierarchy
- Identify tables, lists, and special formatting
- Note headers, footers, page numbers
- Identify logos, stamps, signatures
Return: Document structure map
Agent 3: Content Classification (Sonnet)
Task: Classify and categorise the content
- Read the file
- Determine document purpose
- Identify key entities (people, projects, dates, systems)
- Extract action items or tasks
- Find decisions or commitments
- Note any deadlines or dates mentioned
- Identify references to YourOrg projects or systems
Return: Classified content with entity extraction
Agent 4: Quality Assessment (Sonnet)
Task: Assess extraction quality and completeness
- Read the file
- Evaluate image/scan quality
- Identify areas with low confidence extraction
- Note any missing or obscured content
- Assess if re-scan might be needed
- Check for multiple pages
Return: Quality report with confidence scores
# Document Extraction
**Source**: {{filename}}
**Extracted**: {{DATE}}
**Document Type**: {{type}}
**Quality Score**: {{High/Medium/Low}}
## Summary
{{Brief description of document content and purpose}}
## Extracted Content
### Full Text
{{complete extracted text with formatting preserved}}
---
## Structured Data
### Key Information
| Field | Value |
|-------|-------|
| Date | {{if found}} |
| Author | {{if found}} |
| Subject | {{if found}} |
| Reference | {{if found}} |
### People Mentioned
{{list of names with context}}
### Dates & Deadlines
| Date | Context |
|------|---------|
{{dates found}}
### Projects/Systems Referenced
{{links to matching vault notes}}
## Tables Extracted
{{any tables found, formatted as markdown}}
## Action Items Found
- [ ] {{action items extracted from document}}
## Decisions/Commitments
{{any decisions or commitments mentioned}}
## Extraction Notes
### Confidence Assessment
- Overall quality: {{assessment}}
- Unclear sections: {{list any problematic areas}}
- Recommendations: {{re-scan suggestions if needed}}
### Areas of Uncertainty
{{text that couldn't be confidently extracted}}
## Suggested Actions
1. **Create Note**: {{suggest note type and title}}
2. **Link to**: {{suggest related notes}}
3. **Follow up**: {{any actions needed}}
After extraction, offer to:
tools
--- context: fork --- # /youtube Save a YouTube video as both a Weblink (quick reference) and a detailed Page (full analysis). ## Usage ``` /youtube <url> /youtube <url> <optional title override> ``` ## Examples ``` /youtube https://www.youtube.com/watch?v=0TpON5T-Sw4 /youtube https://youtu.be/abc123 AWS re:Invent Keynote ``` ## Prerequisites This skill uses the MCP Docker YouTube tools: - `mcp__MCP_DOCKER__get_video_info` - Video metadata - `mcp__MCP_DOCKER__get_transcript` - Full trans
data-ai
Create and manage git worktrees for parallel agent sessions
testing
--- context: fork --- # /wipe Generate a context handoff summary, clear the session, and resume in a fresh conversation. Detects environment and provides automated (tmux) or manual workflow. ## Usage ``` /wipe /wipe quick # Minimal handoff, just essentials /wipe detailed # Comprehensive handoff with full context ``` ## Instructions When the user invokes `/wipe`: ### Phase 1: Detect Environment First, check the terminal environment: ```bash echo "Environment Detection:"
data-ai
--- context: fork --- # /weekly-summary Generate comprehensive weekly summary from daily notes, meetings, tasks, and project updates using parallel sub-agents. ## Usage ``` /weekly-summary /weekly-summary --last-week /weekly-summary --from 2026-01-01 --to 2026-01-07 /weekly-summary --output page # Create Page note instead of just outputting ``` ## Instructions This skill uses **5 parallel sub-agents** to gather data concurrently from different vault areas, then synthesizes a comprehensi