Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

starlake-ai/expectations

Name: expectations
Author: starlake-ai

.agents/skills/expectations/SKILL.md

npx skillsauth add starlake-ai/starlake-skills expectations

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Expectations Skill

Define and enforce data quality checks on loaded and transformed data. Expectations are SQL-based conditions evaluated after data processing — they can warn or fail the pipeline based on configurable thresholds.

Expectation Syntax

expectations:
  - expect: "<query_name>(<params>) => <condition>"
    failOnError: true # or false to continue with warnings

Expectations are defined in table.sl.yml (load) or task.sl.yml (transform).

Built-in Expectation Macros

Macros are defined as Jinja2 templates in metadata/expectations/default.j2:

is_col_value_not_unique

Check that a column has unique values:

{% macro is_col_value_not_unique(col, table='SL_THIS') %}
    SELECT max(cnt)
    FROM (SELECT {{ col }}, count(*) as cnt FROM {{ table }}
    GROUP BY {{ col }}
    HAVING cnt > 1)
{% endmacro %}

is_row_count_to_be_between

Check that row count falls within a range:

{% macro is_row_count_to_be_between(min_value, max_value, table_name = 'SL_THIS') -%}
    SELECT
        CASE
            WHEN count(*) BETWEEN {{min_value}} AND {{max_value}} THEN 1
        ELSE
            0
        END
    FROM {{table_name}}
{%- endmacro %}

count_by_value

Count rows matching a specific value:

{% macro count_by_value(col, value, table='SL_THIS') %}
    SELECT count(*)
    FROM {{ table }}
    WHERE {{ col }} LIKE '{{ value }}'
{% endmacro %}

Available Variables in Conditions

| Variable | Type | Description | |---|---|---| | count | Long | Number of rows in query result | | result | Seq[Any] | First row values (0-indexed) | | results | Seq[Seq[Any]] | All rows (for multi-row results) |

Examples

expectations:
  # Uniqueness: order_id must be unique
  - expect: "is_col_value_not_unique('order_id') => result(0) == 1"
    failOnError: true

  # Row count: between 100 and 1 million rows
  - expect: "is_row_count_to_be_between(100, 1000000) => result(0) == 1"
    failOnError: false

  # Value count: at least 10 USA records
  - expect: "count_by_value('country', 'USA') => result(0) >= 10"
    failOnError: false

  # Custom SQL: no negative amounts
  - expect: "SELECT COUNT(*) FROM SL_THIS WHERE amount < 0 => count == 0"
    failOnError: true

  # Null check: email not null
  - expect: "SELECT COUNT(*) FROM SL_THIS WHERE email IS NULL => count == 0"
    failOnError: true

Custom Macros

Create custom Jinja2 macros in metadata/expectations/:

{# metadata/expectations/custom.j2 #}

{% macro is_valid_email(col, table='SL_THIS') %}
    SELECT COUNT(*)
    FROM {{ table }}
    WHERE {{ col }} NOT LIKE '%@%.%'
{% endmacro %}

Usage:

expectations:
  - expect: "is_valid_email('email') => count == 0"
    failOnError: true

Related Skills

load - Expectations applied during data loading
transform - Expectations applied to transform outputs
validate - Validate project configuration
config - Configuration reference

starlake-ai/expectations

.agents/skills/expectations/SKILL.md

Data quality expectations syntax, built-in macros, and validation patterns

1 stars

testing

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add starlake-ai/starlake-skills expectations

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 3:36 AM12.5s1 file scanned

SKILL.md

name:: expectations
description:: Data quality expectations syntax, built-in macros, and validation patterns

Expectations Skill

Expectation Syntax

expectations:
  - expect: "<query_name>(<params>) => <condition>"
    failOnError: true # or false to continue with warnings

Expectations are defined in table.sl.yml (load) or task.sl.yml (transform).

Built-in Expectation Macros

Macros are defined as Jinja2 templates in metadata/expectations/default.j2:

is_col_value_not_unique

Check that a column has unique values:

{% macro is_col_value_not_unique(col, table='SL_THIS') %}
    SELECT max(cnt)
    FROM (SELECT {{ col }}, count(*) as cnt FROM {{ table }}
    GROUP BY {{ col }}
    HAVING cnt > 1)
{% endmacro %}

is_row_count_to_be_between

Check that row count falls within a range:

{% macro is_row_count_to_be_between(min_value, max_value, table_name = 'SL_THIS') -%}
    SELECT
        CASE
            WHEN count(*) BETWEEN {{min_value}} AND {{max_value}} THEN 1
        ELSE
            0
        END
    FROM {{table_name}}
{%- endmacro %}

count_by_value

Count rows matching a specific value:

{% macro count_by_value(col, value, table='SL_THIS') %}
    SELECT count(*)
    FROM {{ table }}
    WHERE {{ col }} LIKE '{{ value }}'
{% endmacro %}

Available Variables in Conditions

Examples

expectations:
  # Uniqueness: order_id must be unique
  - expect: "is_col_value_not_unique('order_id') => result(0) == 1"
    failOnError: true

  # Row count: between 100 and 1 million rows
  - expect: "is_row_count_to_be_between(100, 1000000) => result(0) == 1"
    failOnError: false

  # Value count: at least 10 USA records
  - expect: "count_by_value('country', 'USA') => result(0) >= 10"
    failOnError: false

  # Custom SQL: no negative amounts
  - expect: "SELECT COUNT(*) FROM SL_THIS WHERE amount < 0 => count == 0"
    failOnError: true

  # Null check: email not null
  - expect: "SELECT COUNT(*) FROM SL_THIS WHERE email IS NULL => count == 0"
    failOnError: true

Custom Macros

Create custom Jinja2 macros in metadata/expectations/:

{# metadata/expectations/custom.j2 #}

{% macro is_valid_email(col, table='SL_THIS') %}
    SELECT COUNT(*)
    FROM {{ table }}
    WHERE {{ col }} NOT LIKE '%@%.%'
{% endmacro %}

Usage:

expectations:
  - expect: "is_valid_email('email') => count == 0"
    failOnError: true

Related Skills

load - Expectations applied during data loading
transform - Expectations applied to transform outputs
validate - Validate project configuration
config - Configuration reference

Related Skills

starlake-ai/starflow-transform-design

development

VerifiedTrustedCommunity

Design SQL transformations for data pipelines with quality checks and dependency management. Use when the user says "design transforms" or "create SQL transformations".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-transform-design

starlake-ai/starflow-sprint-planning

devops

VerifiedTrustedCommunity

Plan and track sprint progress for data pipeline implementation. Use when the user says "sprint planning" or "plan data sprint".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-sprint-planning

starlake-ai/starflow-source-analysis

testing

VerifiedTrustedCommunity

Analyze data sources in depth: schema, quality, volume, and extraction strategy. Use when the user says "analyze data source" or "profile this data source".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-source-analysis

starlake-ai/starflow-schema-design

data-ai

VerifiedTrustedCommunity

Design Starlake-compatible table schemas with types, constraints, privacy, and expectations. Use when the user says "design schema" or "create table definition".

1SKILL.mdUpdated Apr 16, 2026

starlake-ai/starflow-schema-design

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/starlake-ai/starlake-skills.git

# Copy into Claude Code skills folder (global)
cp -r starlake-skills/.agents/skills/expectations ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

starlake-ai/starlake-skills

1 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT