Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

starlake-ai/starflow-domain-discovery

Name: starflow-domain-discovery
Author: starlake-ai

.agents/starflow/skills/starflow-domain-discovery/SKILL.md

npx skillsauth add starlake-ai/starlake-skills starflow-domain-discovery

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Data Domain Discovery

Overview

Guides the user through identifying and documenting all data domains in their organization, mapping data sources to domains, establishing ownership, and defining the boundaries of the data landscape. This produces a domain map that serves as the foundation for all subsequent pipeline design.

Role Guidance: Act as a Business Data Analyst with expertise in data governance and domain-driven design.

Design Rationale: Domain discovery must happen before any pipeline work. Without clear domain boundaries and ownership, pipelines become tangled and ungovernable. This workflow follows Starlake's domain-based organization where each domain maps to a database schema/namespace.

Steps

Step 1: Context Gathering

Ask the user about their organization's data landscape:
- What business functions generate or consume data?
- What existing databases, data warehouses, or data lakes exist?
- What are the key business processes that depend on data?
Document initial understanding.

Step 2: Domain Identification

Group related data entities into logical domains (e.g., sales, inventory, customers, finance).
For each domain, identify:
- Name: kebab-case identifier (maps to Starlake domain directory)
- Description: Business purpose of this domain
- Owner: Team or person responsible
- Sources: Where data originates (databases, APIs, files, streams)
- Consumers: Who/what uses this data downstream
Present domain map for review.

Step 3: Source Cataloging

For each identified source within each domain, document: | Field | Description | |-------|-------------| | Source name | Unique identifier | | Source type | JDBC, file (CSV/JSON/XML/Parquet), API, stream (Kafka) | | Connection | Database/endpoint details | | Format | DSV, JSON, XML, POSITION, Parquet, Avro | | Refresh frequency | Real-time, hourly, daily, weekly, on-demand | | Volume | Approximate row count and growth rate | | Schema stability | Stable, evolving, unpredictable |

Step 4: Dependency Mapping

Map data flows between domains (which domains feed into which).
Identify shared reference data (e.g., country codes, product catalogs).
Flag circular dependencies or tight coupling.
Document the resulting dependency graph.

Step 5: Output Generation

Generate the domain discovery document and save to {planning_artifacts}/domain-discovery-{{project_name}}.md using the template structure.

Outcome

A comprehensive domain discovery document that maps all data domains, sources, ownership, and dependencies — ready to inform data architecture design and Starlake domain configuration.

starlake-ai/starflow-domain-discovery

.agents/starflow/skills/starflow-domain-discovery/SKILL.md

Discover and document data domains, sources, and ownership. Use when the user says "discover data domains" or "map data sources".

1 stars

documentation

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add starlake-ai/starlake-skills starflow-domain-discovery

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 3:37 AM7.4s1 file scanned

SKILL.md

name:: starflow-domain-discovery
description:: Discover and document data domains, sources, and ownership. Use when the user says "discover data domains" or "map data sources".

Data Domain Discovery

Overview

Role Guidance: Act as a Business Data Analyst with expertise in data governance and domain-driven design.

Steps

Step 1: Context Gathering

Ask the user about their organization's data landscape:
- What business functions generate or consume data?
- What existing databases, data warehouses, or data lakes exist?
- What are the key business processes that depend on data?
Document initial understanding.

Step 2: Domain Identification

Group related data entities into logical domains (e.g., sales, inventory, customers, finance).
For each domain, identify:
- Name: kebab-case identifier (maps to Starlake domain directory)
- Description: Business purpose of this domain
- Owner: Team or person responsible
- Sources: Where data originates (databases, APIs, files, streams)
- Consumers: Who/what uses this data downstream
Present domain map for review.

Step 3: Source Cataloging

Step 4: Dependency Mapping

Map data flows between domains (which domains feed into which).
Identify shared reference data (e.g., country codes, product catalogs).
Flag circular dependencies or tight coupling.
Document the resulting dependency graph.

Step 5: Output Generation

Generate the domain discovery document and save to {planning_artifacts}/domain-discovery-{{project_name}}.md using the template structure.

Outcome

A comprehensive domain discovery document that maps all data domains, sources, ownership, and dependencies — ready to inform data architecture design and Starlake domain configuration.

Related Skills

starlake-ai/starflow-transform-design

development

VerifiedTrustedCommunity

Design SQL transformations for data pipelines with quality checks and dependency management. Use when the user says "design transforms" or "create SQL transformations".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-transform-design

starlake-ai/starflow-sprint-planning

devops

VerifiedTrustedCommunity

Plan and track sprint progress for data pipeline implementation. Use when the user says "sprint planning" or "plan data sprint".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-sprint-planning

starlake-ai/starflow-source-analysis

testing

VerifiedTrustedCommunity

Analyze data sources in depth: schema, quality, volume, and extraction strategy. Use when the user says "analyze data source" or "profile this data source".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-source-analysis

starlake-ai/starflow-schema-design

data-ai

VerifiedTrustedCommunity

Design Starlake-compatible table schemas with types, constraints, privacy, and expectations. Use when the user says "design schema" or "create table definition".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-schema-design

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/starlake-ai/starlake-skills.git

# Copy into Claude Code skills folder (global)
cp -r starlake-skills/.agents/starflow/skills/starflow-domain-discovery ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

starlake-ai/starlake-skills

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT