Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

starlake-ai/starflow-dev-pipeline

Name: starflow-dev-pipeline
Author: starlake-ai

.agents/starflow/skills/starflow-dev-pipeline/SKILL.md

npx skillsauth add starlake-ai/starlake-skills starflow-dev-pipeline

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Pipeline Implementation

Overview

Implements a data pipeline by generating all necessary Starlake configuration files (YAML + SQL) from a pipeline specification. Sets up the Starlake metadata directory structure, creates load schemas, transform queries, extraction configs, and DAG definitions. Validates the implementation locally with DuckDB.

Role Guidance: Act as a Data Engineer building production-grade data pipelines using Starlake's declarative configuration.

Design Rationale: Pipeline implementation with Starlake means creating YAML + SQL files — not writing application code. The framework handles execution mechanics. Focus is on correct configuration, comprehensive schemas, and testable SQL.

Steps

Step 1: Load Specification

Load the pipeline specification from {implementation_artifacts}/pipeline-spec-*.md.
Load related artifacts:
- {planning_artifacts}/data-architecture-*.md
- {planning_artifacts}/schema-design-*.md
- {implementation_artifacts}/transform-design-*.md
- {implementation_artifacts}/orchestration-design-*.md
Confirm scope: which pipeline tasks to implement?

Step 2: Project Bootstrap

If the Starlake project doesn't exist yet:

starlake bootstrap

This creates the base metadata/ directory structure. Then configure:

metadata/application.sl.yml — global settings and connections
metadata/env.sl.yml — base environment variables
metadata/types/ — custom type definitions

Step 3: Implement Load Configuration

For each table in the pipeline:

Create domain directory: metadata/load/{domain}/
Create domain config: metadata/load/{domain}/_config.sl.yml
Create table schema: metadata/load/{domain}/{table}.sl.yml
Validate schema: starlake validate

Step 4: Implement Transforms

For each transformation task:

Create transform directory: metadata/transform/{domain}/
Create SQL file: metadata/transform/{domain}/{task}.sql
Create task config: metadata/transform/{domain}/{task}.sl.yml
Create expectations macros: metadata/expectations/{domain}.j2

Step 5: Implement Extraction (if applicable)

For JDBC sources:

Create extract config: metadata/extract/{source}.sl.yml
Test extraction: starlake extract-data
Generate schemas from extracted data: starlake infer-schema

Step 6: Implement Orchestration

Create DAG configs: metadata/dags/{dag_name}.sl.yml
Generate DAG code: starlake dag-generate
Verify generated DAG files

Step 7: Local Validation

Run the full pipeline locally with DuckDB:

# Stage incoming files
starlake stage

# Load data
starlake load

# Run transforms (with dependencies)
starlake transform --name {domain}.{task} --recursive

# Check lineage
starlake lineage --task {domain}.{task}

# Validate all configs
starlake validate

Step 8: Documentation

Generate implementation summary to {implementation_artifacts}/pipeline-impl-{{pipeline_name}}.md covering:

Files created and their purpose
How to run the pipeline locally
Known limitations or TODOs
Deployment instructions

Related Starlake Skills

Use the bootstrap skill to initialize a new Starlake project
Use the load skill for write strategy and sink configuration details
Use the transform skill for SQL transformation execution options
Use the extract-schema skill for JDBC schema extraction
Use the extract-data skill for data extraction to files
Use the dag-generate skill for DAG generation options and templates
Use the validate skill to check all configuration files
Use the config skill for environment variables and connection setup

Outcome

A fully implemented, locally validated Starlake pipeline with all YAML configuration, SQL transforms, and orchestration DAGs — ready for deployment.

starlake-ai/starflow-dev-pipeline

.agents/starflow/skills/starflow-dev-pipeline/SKILL.md

Implement a data pipeline from a pipeline specification, generating Starlake configuration files. Use when the user says "implement pipeline" or "dev this pipeline".

1 stars

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add starlake-ai/starlake-skills starflow-dev-pipeline

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 3:37 AM6.5s1 file scanned

SKILL.md

name:: starflow-dev-pipeline
description:: Implement a data pipeline from a pipeline specification, generating Starlake configuration files. Use when the user says "implement pipeline" or "dev this pipeline".

Pipeline Implementation

Overview

Role Guidance: Act as a Data Engineer building production-grade data pipelines using Starlake's declarative configuration.

Steps

Step 1: Load Specification

Load the pipeline specification from {implementation_artifacts}/pipeline-spec-*.md.
Load related artifacts:
- {planning_artifacts}/data-architecture-*.md
- {planning_artifacts}/schema-design-*.md
- {implementation_artifacts}/transform-design-*.md
- {implementation_artifacts}/orchestration-design-*.md
Confirm scope: which pipeline tasks to implement?

Step 2: Project Bootstrap

If the Starlake project doesn't exist yet:

starlake bootstrap

This creates the base metadata/ directory structure. Then configure:

metadata/application.sl.yml — global settings and connections
metadata/env.sl.yml — base environment variables
metadata/types/ — custom type definitions

Step 3: Implement Load Configuration

For each table in the pipeline:

Create domain directory: metadata/load/{domain}/
Create domain config: metadata/load/{domain}/_config.sl.yml
Create table schema: metadata/load/{domain}/{table}.sl.yml
Validate schema: starlake validate

Step 4: Implement Transforms

For each transformation task:

Create transform directory: metadata/transform/{domain}/
Create SQL file: metadata/transform/{domain}/{task}.sql
Create task config: metadata/transform/{domain}/{task}.sl.yml
Create expectations macros: metadata/expectations/{domain}.j2

Step 5: Implement Extraction (if applicable)

For JDBC sources:

Create extract config: metadata/extract/{source}.sl.yml
Test extraction: starlake extract-data
Generate schemas from extracted data: starlake infer-schema

Step 6: Implement Orchestration

Create DAG configs: metadata/dags/{dag_name}.sl.yml
Generate DAG code: starlake dag-generate
Verify generated DAG files

Step 7: Local Validation

Run the full pipeline locally with DuckDB:

# Stage incoming files
starlake stage

# Load data
starlake load

# Run transforms (with dependencies)
starlake transform --name {domain}.{task} --recursive

# Check lineage
starlake lineage --task {domain}.{task}

# Validate all configs
starlake validate

Step 8: Documentation

Generate implementation summary to {implementation_artifacts}/pipeline-impl-{{pipeline_name}}.md covering:

Files created and their purpose
How to run the pipeline locally
Known limitations or TODOs
Deployment instructions

Related Starlake Skills

Use the bootstrap skill to initialize a new Starlake project
Use the load skill for write strategy and sink configuration details
Use the transform skill for SQL transformation execution options
Use the extract-schema skill for JDBC schema extraction
Use the extract-data skill for data extraction to files
Use the dag-generate skill for DAG generation options and templates
Use the validate skill to check all configuration files
Use the config skill for environment variables and connection setup

Outcome

A fully implemented, locally validated Starlake pipeline with all YAML configuration, SQL transforms, and orchestration DAGs — ready for deployment.

Related Skills

starlake-ai/starflow-transform-design

development

VerifiedTrustedCommunity

Design SQL transformations for data pipelines with quality checks and dependency management. Use when the user says "design transforms" or "create SQL transformations".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-transform-design

starlake-ai/starflow-sprint-planning

devops

VerifiedTrustedCommunity

Plan and track sprint progress for data pipeline implementation. Use when the user says "sprint planning" or "plan data sprint".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-sprint-planning

starlake-ai/starflow-source-analysis

testing

VerifiedTrustedCommunity

Analyze data sources in depth: schema, quality, volume, and extraction strategy. Use when the user says "analyze data source" or "profile this data source".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-source-analysis

starlake-ai/starflow-schema-design

data-ai

VerifiedTrustedCommunity

Design Starlake-compatible table schemas with types, constraints, privacy, and expectations. Use when the user says "design schema" or "create table definition".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-schema-design

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/starlake-ai/starlake-skills.git

# Copy into Claude Code skills folder (global)
cp -r starlake-skills/.agents/starflow/skills/starflow-dev-pipeline ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

starlake-ai/starlake-skills

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT