.agents/skills/kafkaload/SKILL.md
Load or offload data to/from Kafka topics
npx skillsauth add starlake-ai/starlake-skills kafkaloadInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Loads data from files to Kafka topics or offloads data from Kafka topics to files. Supports both batch and streaming modes with optional transformations.
starlake kafkaload [options]
--config <value>: Kafka topic name (from connection configuration)--connectionRef <value>: Connection reference for the Kafka cluster--format <value>: Read format: parquet, json, csv, etc. (default: parquet)--path <value>: Source file path (for loading to Kafka) or target path (for offloading)--options k1=v1,k2=v2: Spark Reader options--write-config <value>: Write topic name--write-path <value>: Target file path for offloading--write-mode <value>: Write mode for file output--write-options k1=v1,k2=v2: Spark Writer options--write-format <value>: Streaming output format: kafka, console, etc.--write-coalesce <value>: Number of output partitions--transform <value>: SQL transformation to apply to messages before loading/offloading--stream: Enable streaming mode (continuous processing)--streaming-trigger <value>: Trigger type: Once, Continuous, ProcessingTime--streaming-trigger-option <value>: Trigger interval (e.g. 10 seconds)--streaming-to-table <value>: Sink to a table instead of files--streaming-partition-by <value>: Partition output by these columns--reportFormat <value>: Report output format: console, json, or htmlstarlake kafkaload --config orders_topic --path /data/output/orders --format parquet
starlake kafkaload --config orders_topic --path /data/input/orders.json --format json --write-config orders_output
starlake kafkaload --config orders_topic --stream --write-format console
starlake kafkaload --config orders_topic --stream --streaming-trigger ProcessingTime --streaming-trigger-option "10 seconds"
Apply a SQL transformation before saving:
starlake kafkaload --config orders_topic --path /data/output --transform "SELECT order_id, total FROM SL_THIS WHERE total > 100"
development
Design SQL transformations for data pipelines with quality checks and dependency management. Use when the user says "design transforms" or "create SQL transformations".
devops
Plan and track sprint progress for data pipeline implementation. Use when the user says "sprint planning" or "plan data sprint".
testing
Analyze data sources in depth: schema, quality, volume, and extraction strategy. Use when the user says "analyze data source" or "profile this data source".
data-ai
Design Starlake-compatible table schemas with types, constraints, privacy, and expectations. Use when the user says "design schema" or "create table definition".