.claude/skills/datasource/SKILL.md
--- context: fork --- # /datasource Create a DataSource note documenting a specific database, table, API endpoint, dataset, or data entity with schema, quality metrics, and access information. ## Usage ``` /datasource <name> /datasource "SAP Invoices" /datasource "DataPlatform Revenue Fact Table" /datasource "Snowflake Customers" ``` ## Instructions ### Phase 1: Parse Input & Link to System 1. Extract data source name 2. Ask which system owns this data: ``` Which system owns this da
npx skillsauth add DavidROliverBA/ArchitectKB .claude/skills/datasourceInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Create a DataSource note documenting a specific database, table, API endpoint, dataset, or data entity with schema, quality metrics, and access information.
/datasource <name>
/datasource "SAP Invoices"
/datasource "DataPlatform Revenue Fact Table"
/datasource "Snowflake Customers"
Which system owns this data source?
Search: [user searches for System]
Or create new System? (Y/n)
Creating DataSource: {{name}} (owned by {{system}})
1️⃣ Data Type:
- database-table (relational table)
- database-view (virtual table)
- api-endpoint (REST/GraphQL data)
- kafka-topic (event stream)
- data-warehouse-table (Snowflake/BigQuery)
- data-lake (file-based storage)
- cache (Redis/Memcached)
Default: database-table
User input: [selection]
2️⃣ Record Count (approximate):
Default: null
User input: [number, e.g., 5000000]
3️⃣ Data Volume per Day:
Default: null
User input: [e.g., "2.5GB", "500K records"]
4️⃣ Refresh Frequency:
- real-time
- hourly
- daily
- weekly
- on-demand
Default: daily
User input: [selection]
5️⃣ Classification:
- public
- internal
- confidential
- secret
Default: internal
User input: [selection]
Ask: "Add data quality metrics? (Y/n)"
If YES:
- Completeness (%): 98.5
- Uniqueness (%): 99.9
- Accuracy: high | medium | low
- Timeliness: how fresh (< 5 minutes, < 1 hour, etc.)
Key Fields (comma-separated):
invoice_id, vendor_id, amount
Then ask:
Schema details needed? (Y/n)
- Parent entities (sources of this data)
- Child entities (what feeds from this)
- Related tables
How is this data accessed?
- REST API
- GraphQL
- Direct database query
- Kafka topic
- Batch export / S3
- Other
Default: [based on data type]
Which systems consume this data?
Search: [[System - DataPlatform]]
Add: [[System - Analytics]]
type: DataSource
title: "{{name}}"
sourceId: "{{sourceId}}"
sourceSystem: "[[System - {{system}}]]"
owner: "[[{{person}}]]"
dataType: {{type}}
recordCount: {{count}}
volumePerDay: "{{volume}}"
refreshFrequency: {{frequency}}
classification: {{classification}}
gdprApplicable: {{gdpr}}
piiFields: [{{pii}}]
completeness: {{completeness}}
uniqueness: {{uniqueness}}
accuracy: {{accuracy}}
timeliness: {{timeliness}}
exposedVia: [rest-api, kafka-topic]
consumerCount: {{consumer_count}}
criticalConsumers: [{{critical_systems}}]
confidence: medium
freshness: current
verified: false
created: 2026-01-14
tags: [type/data-source, {{sourceSystem|lower}}]
Filename: DataSource - {{name}}.md
Location: Vault root
Output:
✅ Created: DataSource - {{name}}.md
Linked to:
- [[System - {{owning system}}]]
Next steps:
1. Create integration: /integration {{source}} {{target}}
2. Document consumer systems
3. Add to architecture diagram
User: /datasource "SAP Invoices"
System: Which system owns this data?
> SAP S/4HANA
Data Type:
> database-table
Record Count:
> 200000000
Volume per Day:
> 5GB
Refresh Frequency:
> hourly
Classification:
> confidential
Data Quality:
Completeness: 98.5
Uniqueness: 99.9
Accuracy: high
Timeliness: < 5 minutes
Key Fields:
> invoice_id, vendor_id, company_code, amount, invoice_date
Consumers (search for Systems):
> DataPlatform
> Snowflake
> Analytics
✅ Created: DataSource - SAP Invoices.md
Updated:
- [[System - SAP S/4HANA]] added to owns/exposes
- [[System - DataPlatform]] added to consumers
- [[System - Snowflake]] added to consumers
tools
--- context: fork --- # /youtube Save a YouTube video as both a Weblink (quick reference) and a detailed Page (full analysis). ## Usage ``` /youtube <url> /youtube <url> <optional title override> ``` ## Examples ``` /youtube https://www.youtube.com/watch?v=0TpON5T-Sw4 /youtube https://youtu.be/abc123 AWS re:Invent Keynote ``` ## Prerequisites This skill uses the MCP Docker YouTube tools: - `mcp__MCP_DOCKER__get_video_info` - Video metadata - `mcp__MCP_DOCKER__get_transcript` - Full trans
data-ai
Create and manage git worktrees for parallel agent sessions
testing
--- context: fork --- # /wipe Generate a context handoff summary, clear the session, and resume in a fresh conversation. Detects environment and provides automated (tmux) or manual workflow. ## Usage ``` /wipe /wipe quick # Minimal handoff, just essentials /wipe detailed # Comprehensive handoff with full context ``` ## Instructions When the user invokes `/wipe`: ### Phase 1: Detect Environment First, check the terminal environment: ```bash echo "Environment Detection:"
data-ai
--- context: fork --- # /weekly-summary Generate comprehensive weekly summary from daily notes, meetings, tasks, and project updates using parallel sub-agents. ## Usage ``` /weekly-summary /weekly-summary --last-week /weekly-summary --from 2026-01-01 --to 2026-01-07 /weekly-summary --output page # Create Page note instead of just outputting ``` ## Instructions This skill uses **5 parallel sub-agents** to gather data concurrently from different vault areas, then synthesizes a comprehensi