.agents/starflow/skills/starflow-orchestration-design/SKILL.md
Design orchestration DAGs for scheduling and managing data pipeline execution. Use when the user says "design orchestration" or "create DAG configuration".
npx skillsauth add starlake-ai/starlake-skills starflow-orchestration-designInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Guides the design of orchestration workflows that schedule and manage pipeline execution. Produces Starlake DAG configuration files that generate Airflow or Dagster Python code, along with scheduling, dependency, retry, and alerting strategies.
Role Guidance: Act as a Platform Engineer with expertise in workflow orchestration, Airflow/Dagster, and Starlake's DAG generation system.
Design Rationale: Orchestration code should be generated, not hand-written. Starlake derives task dependencies from SQL references and generates DAG code from YAML configuration + Jinja2 templates. This approach eliminates orchestration bugs caused by manually maintained dependency lists.
{implementation_artifacts}/pipeline-spec-*.md{implementation_artifacts}/transform-design-*.mdGroup pipeline tasks into logical DAGs:
| DAG Name | Schedule | Trigger | Tasks | SLA |
|----------|----------|---------|-------|-----|
| dag_daily_sales | 0 6 * * * | Cron | extract_orders, load_orders, transform_daily_agg | 8 AM |
| dag_realtime_events | N/A | File sensor | load_events | 15 min |
For each DAG, define:
Create Starlake DAG definitions:
version: 1
dag:
comment: "Daily sales pipeline - extracts, loads, and transforms sales data"
template: "dag_standard.py.j2"
filename: "dag_daily_sales"
schedule: "0 6 * * *"
options:
catchup: false
dagrun_timeout: 7200
start_date: "2024-01-01"
default_args:
retries: 2
retry_delay: 300
Document the DAG assignment hierarchy:
application.sl.yml → dagRef (applies to all tasks){domain}/_config.sl.yml → dagRef (applies to domain tasks){task}.sl.yml → dagRef (applies to specific task)Generate:
{implementation_artifacts}/orchestration-design-{{project_name}}.md{implementation_artifacts}/dags/Run starlake dag-generate to produce Airflow/Dagster Python files from the YAML configurations.
dag-generate skill for Airflow/Dagster DAG template optionsdag-deploy skill for DAG deployment proceduresComplete orchestration design with DAG configurations, scheduling strategies, and monitoring plans — ready for deployment to Airflow or Dagster.
development
Design SQL transformations for data pipelines with quality checks and dependency management. Use when the user says "design transforms" or "create SQL transformations".
devops
Plan and track sprint progress for data pipeline implementation. Use when the user says "sprint planning" or "plan data sprint".
testing
Analyze data sources in depth: schema, quality, volume, and extraction strategy. Use when the user says "analyze data source" or "profile this data source".
data-ai
Design Starlake-compatible table schemas with types, constraints, privacy, and expectations. Use when the user says "design schema" or "create table definition".