kafka-engineer-skill/SKILL.md
Expert in Apache Kafka, Event Streaming, and Real-time Data Pipelines. Specializes in Kafka Connect, KSQL, and Schema Registry.
npx skillsauth add 404kidwiz/claude-supercode-skills kafka-engineerInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Provides Apache Kafka and event streaming expertise specializing in scalable event-driven architectures and real-time data pipelines. Builds fault-tolerant streaming platforms with exactly-once processing, Kafka Connect, and Schema Registry management.
What is the use case?
│
├─ **Data Integration (ETL)**
│ ├─ DB to DB/Data Lake? → **Kafka Connect** (Zero code)
│ └─ Complex transformations? → **Kafka Streams**
│
├─ **Real-Time Analytics**
│ ├─ SQL-like queries? → **ksqlDB** (Quick aggregation)
│ └─ Complex stateful logic? → **Kafka Streams / Flink**
│
└─ **Microservices Comm**
├─ Event Notification? → **Standard Producer/Consumer**
└─ Event Sourcing? → **State Stores (RocksDB)**
batch.size, linger.ms, compression.type=lz4.linger.ms=0, acks=1.acks=all, min.insync.replicas=2, replication.factor=3.Red Flags → Escalate to sre-engineer:
Goal: Stream changes from PostgreSQL to S3.
Steps:
Source Config (postgres-source.json)
{
"name": "postgres-source",
"config": {
"connector.class": "io.debezium.connector.postgresql.PostgresConnector",
"database.hostname": "db-host",
"database.dbname": "mydb",
"database.user": "kafka",
"plugin.name": "pgoutput"
}
}
Sink Config (s3-sink.json)
{
"name": "s3-sink",
"config": {
"connector.class": "io.confluent.connect.s3.S3SinkConnector",
"s3.bucket.name": "my-datalake",
"format.class": "io.confluent.connect.s3.format.parquet.ParquetFormat",
"flush.size": "1000"
}
}
Deploy
curl -X POST -d @postgres-source.json http://connect:8083/connectorsGoal: Enforce schema compatibility.
Steps:
Define Schema (user.avsc)
{
"type": "record",
"name": "User",
"fields": [
{"name": "id", "type": "int"},
{"name": "name", "type": "string"}
]
}
Producer (Java)
KafkaAvroSerializer.http://schema-registry:8081.What it looks like:
Why it fails:
Correct approach:
What it looks like:
Why it fails:
Correct approach:
What it looks like:
Why it fails:
Correct approach:
consumer.pause() if buffer is full.Configuration:
Observability:
Scenario: A financial services company needs real-time fraud detection using Kafka streaming.
Architecture Implementation:
Pipeline Configuration: | Component | Configuration | Purpose | |-----------|---------------|---------| | Topics | 3 (transactions, alerts, enriched) | Data organization | | Partitions | 12 (3 brokers × 4) | Parallelism | | Replication | 3 | High availability | | Compression | LZ4 | Throughput optimization |
Key Logic:
Results:
Scenario: Build a resilient order processing system with Kafka for high reliability.
System Design:
Resilience Patterns:
Configuration:
# Producer Configuration
acks: all
retries: 3
enable.idempotence: true
# Consumer Configuration
auto.offset.reset: earliest
enable.auto.commit: false
max.poll.records: 500
Results:
Scenario: Process millions of IoT device telemetry messages with Kafka.
Platform Architecture:
Scalability Configuration:
Performance Metrics: | Metric | Value | |--------|-------| | Throughput | 500,000 messages/sec | | Latency (P99) | 50ms | | Consumer lag | < 1 second | | Storage efficiency | 60% reduction with compression |
Security:
development
Expert in automating Excel workflows using Node.js (ExcelJS, SheetJS) and Python (pandas, openpyxl).
content-media
Expert in designing durable, scalable workflow systems using Temporal, Camunda, and Event-Driven Architectures.
tools
Use when user needs WordPress development, theme or plugin creation, site optimization, security hardening, multisite management, or scaling WordPress from small sites to enterprise platforms.
tools
Expert in Windows Server, Active Directory (AD DS), Hybrid Identity (Entra ID), and PowerShell automation.