skills/dagster/SKILL.md
Asset-centric data orchestration platform for building, testing, and deploying data pipelines. MANDATORY TRIGGERS: dagster, software-defined assets, @asset, @op, @job, data orchestration, data pipeline, dagster-dbt, dagster pipes, asset materialization, dagster cloud, dagster partitions. Also trigger when user wants to build data pipelines, orchestrate ETL/ELT workflows, schedule data transformations, manage data assets with lineage, or integrate dbt/Snowflake/BigQuery into a pipeline. When in doubt about whether to use this skill for data engineering or pipeline orchestration tasks, use it.
npx skillsauth add abhisheksharma-17/skills-graph dagsterInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Asset-centric data orchestration: build, test, and deploy data pipelines with software-defined assets, declarative automation, and 87+ integrations.
Source: docs.dagster.io | Version: 1.13.x | Python: ≥3.9, <3.15 | License: Apache 2.0
| Reference | File | Read When |
|-----------|------|-----------|
| Overview & Quickstart | references/00-overview.md | Getting started, installation, core concepts, project scaffolding, architecture |
| Software-Defined Assets | references/01-assets.md | @asset, @multi_asset, AssetSpec, asset checks, asset groups, dependencies |
| Resources & I/O Managers | references/02-resources-io-managers.md | ConfigurableResource, ConfigurableIOManager, EnvVar, dependency injection |
| Ops, Jobs & Graphs | references/03-ops-jobs-graphs.md | @op, @job, @graph, graph-backed assets, RetryPolicy, Nothing type |
| Schedules & Sensors | references/04-schedules-sensors.md | Cron schedules, event-driven sensors, @asset_sensor, run_status_sensor |
| Partitions & Backfills | references/05-partitions-backfills.md | Daily/hourly/weekly partitions, static, dynamic, multi-dimensional, backfill policies |
| Declarative Automation | references/06-automation.md | AutomationCondition, on_cron, eager, on_missing, custom conditions |
| Testing | references/07-testing.md | Unit testing assets/ops, mock resources, build_asset_context, validate_loadable |
| Dagster Pipes | references/08-dagster-pipes.md | External process execution: subprocess, Kubernetes, Databricks, Docker |
| Integrations | references/09-integrations.md | dbt, Snowflake, BigQuery, DuckDB, Polars, S3, GCS, Airbyte, Fivetran |
| Deployment | references/10-deployment.md | Docker Compose, Kubernetes/Helm, Dagster Cloud (serverless/hybrid), dagster.yaml |
| Project Structure | references/11-project-structure.md | Scaffolding, code locations, workspace.yaml, multi-team organization |
| AI/ML Pipelines | references/12-ai-ml-pipelines.md | OpenAI integration, LLM fine-tuning, ML training pipelines, AI workflows |
# Install core + UI
pip install dagster dagster-webserver
# or with uv
uv add dagster dagster-webserver
# Scaffold a new project
uvx create-dagster@latest project my-project
# Start dev server
cd my-project && dg dev
tools
Open-source AI-native vector database for semantic search, hybrid search, RAG, and agent-driven workflows with multi-tenancy and model provider integrations. MANDATORY TRIGGERS: weaviate, Weaviate, weaviate-client, near_text, near_vector, hybrid search vector database, WeaviateClient, weaviate.connect, collections.create, query.near_text, query.hybrid, query.bm25. Also trigger when user wants to build semantic search, store vector embeddings, implement RAG pipelines, combine keyword and vector search, set up multi-tenant vector storage, use named vectors, or integrate vector DB with LLM providers. When in doubt about whether to use this skill for vector database or semantic search tasks, use it.
tools
End-to-end testing and browser automation framework by Microsoft — locators, assertions, fixtures, network mocking, visual testing, tracing, codegen, API testing, and CI/CD integration. MANDATORY TRIGGERS: playwright, Playwright, @playwright/test, page.goto, page.click, page.locator, getByRole, getByText, getByTestId, expect(page), toHaveScreenshot, playwright.config, npx playwright. Also trigger when user wants to write E2E tests, automate browser interactions, test across Chromium/Firefox/WebKit, mock network requests in tests, do visual regression testing, generate tests with codegen, or set up CI test pipelines. When in doubt about whether to use this skill for browser testing or E2E automation tasks, use it.
tools
OpenAI Agents SDK for building multi-agent workflows with tools, handoffs, guardrails, streaming, MCP, sessions, and tracing. MANDATORY TRIGGERS: openai-agents, openai agents sdk, openai-agents-python, openai agents, Runner.run, function_tool, handoff, guardrail, MCPServerStdio, MCPServerStreamableHttp, HostedMCPTool, RunContextWrapper, AgentHooks. Also trigger when user wants to build multi-agent systems with OpenAI models, create agent orchestration with handoffs, add guardrails to LLM applications, integrate MCP servers with agents, implement streaming agent responses, or use OpenAI's official agent framework. When in doubt about whether to use this skill for OpenAI agent tasks, use it.
tools
AI orchestration framework for building production-ready RAG applications, autonomous agents, and multimodal search systems. MANDATORY TRIGGERS: haystack, deepset, haystack-ai, haystack pipeline, haystack agent. Also trigger when the user wants to build RAG pipelines with modular components, create tool-calling agents with Haystack, orchestrate retrieval-augmented generation, build semantic search systems, or evaluate LLM pipelines. When in doubt about whether to use this skill for RAG orchestration or AI pipeline tasks, use it.