Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

hermeticormus/plugins/data-pipelines/skills/ml-pipeline-patterns

Name: plugins/data-pipelines/skills/ml-pipeline-patterns
Author: hermeticormus

plugins/data-pipelines/skills/ml-pipeline-patterns/SKILL.md

npx skillsauth add hermeticormus/libremlops-claude-code plugins/data-pipelines/skills/ml-pipeline-patterns

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

ML Pipeline Patterns

Expert patterns for building batch and streaming ML data pipelines with Apache Beam, Spark, dbt, and orchestrators.

Pattern 1: Apache Beam DoFn Pattern

The DoFn is Beam's per-element processing unit. Structure it to handle setup, processing, and teardown correctly.

import apache_beam as beam
from apache_beam.options.pipeline_options import PipelineOptions

class NormalizeFeaturesFn(beam.DoFn):
    """Normalize numerical features using precomputed statistics."""

    def setup(self):
        # Called once per worker. Load shared resources here.
        import json
        with open('feature_stats.json') as f:
            self.stats = json.load(f)

    def process(self, element: dict):
        """Yield normalized record. Yield nothing to filter."""
        try:
            for feature, stat in self.stats.items():
                if feature in element and element[feature] is not None:
                    mean, std = stat['mean'], stat['std']
                    if std > 0:
                        element[f"{feature}_normalized"] = (element[feature] - mean) / std
            yield element
        except Exception as e:
            # Route to dead-letter queue via tagged output
            yield beam.pvalue.TaggedOutput('errors', {
                'record': element,
                'error': str(e)
            })

def build_pipeline(input_path: str, output_path: str):
    options = PipelineOptions(
        runner='DataflowRunner',
        project='my-gcp-project',
        region='us-central1',
        temp_location='gs://my-bucket/temp',
        job_name='normalize-features'
    )

    with beam.Pipeline(options=options) as p:
        results = (
            p
            | 'ReadParquet' >> beam.io.ReadFromParquet(input_path)
            | 'Normalize' >> beam.ParDo(NormalizeFeaturesFn()).with_outputs('errors', main='main')
        )

        # Write main output
        results.main | 'WriteFeatures' >> beam.io.WriteToParquet(output_path)

        # Write dead-letter for investigation
        results.errors | 'WriteErrors' >> beam.io.WriteToText(
            output_path.replace('features', 'errors')
        )

Pattern 2: Pipeline Branching and Fanout

Process a single input PCollection into multiple output streams.

import apache_beam as beam

def run_branching_pipeline(input_path: str, output_prefix: str):
    with beam.Pipeline() as p:
        raw = p | 'Read' >> beam.io.ReadFromParquet(input_path)

        # Branch 1: High-value users → real-time feature store
        (raw
         | 'FilterHighValue' >> beam.Filter(lambda x: x.get('ltv', 0) > 1000)
         | 'ExtractRTFeatures' >> beam.Map(extract_realtime_features)
         | 'WriteRedis' >> beam.ParDo(WriteToRedisFn()))

        # Branch 2: All users → batch feature store
        (raw
         | 'ExtractBatchFeatures' >> beam.Map(extract_batch_features)
         | 'WriteParquet' >> beam.io.WriteToParquet(f"{output_prefix}/batch"))

        # Branch 3: New users → cold start pipeline
        (raw
         | 'FilterNew' >> beam.Filter(lambda x: x.get('days_since_signup', 999) < 7)
         | 'ExtractColdStartFeatures' >> beam.Map(extract_cold_start_features)
         | 'WriteColdStart' >> beam.io.WriteToParquet(f"{output_prefix}/cold_start"))

Pattern 3: Streaming Feature Computation (Flink)

Compute rolling window features in real time.

from pyflink.datastream import StreamExecutionEnvironment
from pyflink.datastream.connectors.kafka import FlinkKafkaConsumer, FlinkKafkaProducer
from pyflink.datastream.window import TumblingEventTimeWindows
from pyflink.common import Time, WatermarkStrategy
from pyflink.common.serialization import SimpleStringSchema

def build_streaming_pipeline():
    env = StreamExecutionEnvironment.get_execution_environment()
    env.set_parallelism(4)
    env.enable_checkpointing(60_000)  # checkpoint every 60 seconds

    kafka_props = {
        'bootstrap.servers': 'kafka:9092',
        'group.id': 'feature-pipeline'
    }

    source = FlinkKafkaConsumer(
        topics='user-events',
        deserialization_schema=SimpleStringSchema(),
        properties=kafka_props
    )

    (env
     .add_source(source)
     .map(parse_event)
     .assign_timestamps_and_watermarks(
         WatermarkStrategy.for_bounded_out_of_orderness(Time.seconds(10))
         .with_timestamp_assigner(EventTimestampAssigner())
     )
     .key_by(lambda e: e['user_id'])
     .window(TumblingEventTimeWindows.of(Time.minutes(5)))
     .aggregate(UserActivityAggregator())
     .add_sink(FlinkKafkaProducer(
         topic='user-features',
         serialization_schema=SimpleStringSchema(),
         producer_config={'bootstrap.servers': 'kafka:9092'}
     ))
    )

    env.execute("streaming-user-features")

Pattern 4: dbt Incremental Feature Model

Efficient feature computation for large datasets using incremental materialization.

-- models/features/user_purchase_features.sql
{{
  config(
    materialized='incremental',
    unique_key='user_id',
    on_schema_change='sync_all_columns',
    partition_by={
      'field': 'feature_date',
      'data_type': 'date'
    }
  )
}}

WITH purchase_stats AS (
    SELECT
        user_id,
        DATE(event_ts) as feature_date,
        COUNT(*) as purchase_count_30d,
        SUM(amount) as total_spend_30d,
        AVG(amount) as avg_order_value_30d,
        MAX(amount) as max_order_value_30d,
        COUNT(DISTINCT product_category) as category_diversity_30d,
        DATE_DIFF(CURRENT_DATE, MAX(DATE(event_ts)), DAY) as days_since_last_purchase
    FROM {{ ref('stg_purchases') }}
    WHERE event_ts >= DATE_SUB(CURRENT_DATE, INTERVAL 30 DAY)
    {% if is_incremental() %}
        AND event_ts > (SELECT MAX(feature_date) FROM {{ this }})
    {% endif %}
    GROUP BY 1, 2
)

SELECT * FROM purchase_stats

# models/features/schema.yml
models:
  - name: user_purchase_features
    tests:
      - dbt_utils.unique_combination_of_columns:
          combination_of_columns: [user_id, feature_date]
    columns:
      - name: user_id
        tests: [not_null]
      - name: purchase_count_30d
        tests:
          - not_null
          - dbt_utils.accepted_range:
              min_value: 0

Pattern 5: Great Expectations Pipeline Validation

Integrate data quality checks into pipeline execution.

import great_expectations as gx
from great_expectations.core.batch import RuntimeBatchRequest

def validate_feature_batch(df, expectation_suite_name: str = "features.warning") -> bool:
    context = gx.get_context()

    validator = context.get_validator(
        batch_request=RuntimeBatchRequest(
            datasource_name="pandas_datasource",
            data_connector_name="runtime_data_connector",
            data_asset_name="feature_batch",
            runtime_parameters={"batch_data": df},
            batch_identifiers={"batch_id": "pipeline_run"},
        ),
        expectation_suite_name=expectation_suite_name,
    )

    results = validator.validate()

    if not results.success:
        failed = [r for r in results.results if not r.success]
        for r in failed:
            print(f"FAILED: {r.expectation_config.expectation_type} "
                  f"on column {r.expectation_config.kwargs.get('column', 'N/A')}")
        return False
    return True

# Define suite programmatically
def create_feature_suite(validator):
    validator.expect_column_values_to_not_be_null("user_id")
    validator.expect_column_values_to_be_between("purchase_count_30d", min_value=0)
    validator.expect_column_values_to_be_between("avg_order_value_30d", min_value=0)
    validator.expect_table_row_count_to_be_between(min_value=1000)
    validator.expect_column_proportion_of_unique_values_to_be_between(
        "user_id", min_value=0.99  # near-unique
    )
    validator.save_expectation_suite()

Pattern 6: Backfill Strategy

Backfilling historical features without reprocessing everything.

from prefect import flow, task
from datetime import date, timedelta
from typing import Generator

@task(retries=3, retry_delay_seconds=60)
def compute_features_for_date(feature_date: date) -> dict:
    """Idempotent: safe to rerun for same date."""
    # Check if already computed
    output_path = f"s3://features/date={feature_date}/part-0.parquet"
    if s3_exists(output_path):
        return {"date": feature_date, "status": "skipped"}

    # Compute features for this date
    df = load_raw_data(feature_date)
    features = transform_features(df, reference_date=feature_date)
    write_parquet(features, output_path)
    return {"date": feature_date, "status": "computed", "rows": len(features)}

@flow(name="feature-backfill")
def backfill_features(start_date: date, end_date: date, parallelism: int = 4):
    dates = [start_date + timedelta(days=i)
             for i in range((end_date - start_date).days + 1)]

    # Process in parallel chunks
    results = compute_features_for_date.map(dates)
    return results

# Run backfill
backfill_features(
    start_date=date(2024, 1, 1),
    end_date=date(2024, 12, 31),
    parallelism=8
)

Anti-Patterns

Anti-Pattern 1: No Dead-Letter Queue

Pipelines without error routing drop records silently. Always route failed records to an error path and monitor error rate.

Anti-Pattern 2: Point-in-Time Leakage

Feature pipelines that use future data relative to the label timestamp. This is training-serving skew at its worst — model appears to work great in training, fails in production. Always filter features by event_ts < label_ts.

Anti-Pattern 3: Non-Idempotent Pipeline Steps

If a step can't be safely re-run without side effects, backfill becomes dangerous. Design every step to be idempotent: check existence before writing, use UPSERT semantics.

Anti-Pattern 4: Triggering Batch Jobs Without Data Checks

Airflow/Prefect tasks that run regardless of upstream data availability. Add data sensors (ExternalTaskSensor, S3KeySensor) before computation tasks.

Anti-Pattern 5: String-Typed Everything in Parquet

Writing all features as strings loses compression and query performance. Use proper dtypes: int32, float32, date32, categorical. A float32 Parquet is ~4x smaller than the same data in JSON.

hermeticormus/plugins/data-pipelines/skills/ml-pipeline-patterns

plugins/data-pipelines/skills/ml-pipeline-patterns/SKILL.md

# ML Pipeline Patterns Expert patterns for building batch and streaming ML data pipelines with Apache Beam, Spark, dbt, and orchestrators. ## Pattern 1: Apache Beam DoFn Pattern The `DoFn` is Beam's per-element processing unit. Structure it to handle setup, processing, and teardown correctly. ```python import apache_beam as beam from apache_beam.options.pipeline_options import PipelineOptions class NormalizeFeaturesFn(beam.DoFn): """Normalize numerical features using precomputed statist

tools

Updated Apr 21, 2026

$ install --global

skillsauth

npx skillsauth add hermeticormus/libremlops-claude-code plugins/data-pipelines/skills/ml-pipeline-patterns

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 21, 2026, 10:14 PM143.6s1 file scanned

SKILL.md

ML Pipeline Patterns

Expert patterns for building batch and streaming ML data pipelines with Apache Beam, Spark, dbt, and orchestrators.

Pattern 1: Apache Beam DoFn Pattern

The DoFn is Beam's per-element processing unit. Structure it to handle setup, processing, and teardown correctly.

import apache_beam as beam
from apache_beam.options.pipeline_options import PipelineOptions

class NormalizeFeaturesFn(beam.DoFn):
    """Normalize numerical features using precomputed statistics."""

    def setup(self):
        # Called once per worker. Load shared resources here.
        import json
        with open('feature_stats.json') as f:
            self.stats = json.load(f)

    def process(self, element: dict):
        """Yield normalized record. Yield nothing to filter."""
        try:
            for feature, stat in self.stats.items():
                if feature in element and element[feature] is not None:
                    mean, std = stat['mean'], stat['std']
                    if std > 0:
                        element[f"{feature}_normalized"] = (element[feature] - mean) / std
            yield element
        except Exception as e:
            # Route to dead-letter queue via tagged output
            yield beam.pvalue.TaggedOutput('errors', {
                'record': element,
                'error': str(e)
            })

def build_pipeline(input_path: str, output_path: str):
    options = PipelineOptions(
        runner='DataflowRunner',
        project='my-gcp-project',
        region='us-central1',
        temp_location='gs://my-bucket/temp',
        job_name='normalize-features'
    )

    with beam.Pipeline(options=options) as p:
        results = (
            p
            | 'ReadParquet' >> beam.io.ReadFromParquet(input_path)
            | 'Normalize' >> beam.ParDo(NormalizeFeaturesFn()).with_outputs('errors', main='main')
        )

        # Write main output
        results.main | 'WriteFeatures' >> beam.io.WriteToParquet(output_path)

        # Write dead-letter for investigation
        results.errors | 'WriteErrors' >> beam.io.WriteToText(
            output_path.replace('features', 'errors')
        )

Pattern 2: Pipeline Branching and Fanout

Process a single input PCollection into multiple output streams.

import apache_beam as beam

def run_branching_pipeline(input_path: str, output_prefix: str):
    with beam.Pipeline() as p:
        raw = p | 'Read' >> beam.io.ReadFromParquet(input_path)

        # Branch 1: High-value users → real-time feature store
        (raw
         | 'FilterHighValue' >> beam.Filter(lambda x: x.get('ltv', 0) > 1000)
         | 'ExtractRTFeatures' >> beam.Map(extract_realtime_features)
         | 'WriteRedis' >> beam.ParDo(WriteToRedisFn()))

        # Branch 2: All users → batch feature store
        (raw
         | 'ExtractBatchFeatures' >> beam.Map(extract_batch_features)
         | 'WriteParquet' >> beam.io.WriteToParquet(f"{output_prefix}/batch"))

        # Branch 3: New users → cold start pipeline
        (raw
         | 'FilterNew' >> beam.Filter(lambda x: x.get('days_since_signup', 999) < 7)
         | 'ExtractColdStartFeatures' >> beam.Map(extract_cold_start_features)
         | 'WriteColdStart' >> beam.io.WriteToParquet(f"{output_prefix}/cold_start"))

Pattern 3: Streaming Feature Computation (Flink)

Compute rolling window features in real time.

from pyflink.datastream import StreamExecutionEnvironment
from pyflink.datastream.connectors.kafka import FlinkKafkaConsumer, FlinkKafkaProducer
from pyflink.datastream.window import TumblingEventTimeWindows
from pyflink.common import Time, WatermarkStrategy
from pyflink.common.serialization import SimpleStringSchema

def build_streaming_pipeline():
    env = StreamExecutionEnvironment.get_execution_environment()
    env.set_parallelism(4)
    env.enable_checkpointing(60_000)  # checkpoint every 60 seconds

    kafka_props = {
        'bootstrap.servers': 'kafka:9092',
        'group.id': 'feature-pipeline'
    }

    source = FlinkKafkaConsumer(
        topics='user-events',
        deserialization_schema=SimpleStringSchema(),
        properties=kafka_props
    )

    (env
     .add_source(source)
     .map(parse_event)
     .assign_timestamps_and_watermarks(
         WatermarkStrategy.for_bounded_out_of_orderness(Time.seconds(10))
         .with_timestamp_assigner(EventTimestampAssigner())
     )
     .key_by(lambda e: e['user_id'])
     .window(TumblingEventTimeWindows.of(Time.minutes(5)))
     .aggregate(UserActivityAggregator())
     .add_sink(FlinkKafkaProducer(
         topic='user-features',
         serialization_schema=SimpleStringSchema(),
         producer_config={'bootstrap.servers': 'kafka:9092'}
     ))
    )

    env.execute("streaming-user-features")

Pattern 4: dbt Incremental Feature Model

Efficient feature computation for large datasets using incremental materialization.

-- models/features/user_purchase_features.sql
{{
  config(
    materialized='incremental',
    unique_key='user_id',
    on_schema_change='sync_all_columns',
    partition_by={
      'field': 'feature_date',
      'data_type': 'date'
    }
  )
}}

WITH purchase_stats AS (
    SELECT
        user_id,
        DATE(event_ts) as feature_date,
        COUNT(*) as purchase_count_30d,
        SUM(amount) as total_spend_30d,
        AVG(amount) as avg_order_value_30d,
        MAX(amount) as max_order_value_30d,
        COUNT(DISTINCT product_category) as category_diversity_30d,
        DATE_DIFF(CURRENT_DATE, MAX(DATE(event_ts)), DAY) as days_since_last_purchase
    FROM {{ ref('stg_purchases') }}
    WHERE event_ts >= DATE_SUB(CURRENT_DATE, INTERVAL 30 DAY)
    {% if is_incremental() %}
        AND event_ts > (SELECT MAX(feature_date) FROM {{ this }})
    {% endif %}
    GROUP BY 1, 2
)

SELECT * FROM purchase_stats

# models/features/schema.yml
models:
  - name: user_purchase_features
    tests:
      - dbt_utils.unique_combination_of_columns:
          combination_of_columns: [user_id, feature_date]
    columns:
      - name: user_id
        tests: [not_null]
      - name: purchase_count_30d
        tests:
          - not_null
          - dbt_utils.accepted_range:
              min_value: 0

Pattern 5: Great Expectations Pipeline Validation

Integrate data quality checks into pipeline execution.

import great_expectations as gx
from great_expectations.core.batch import RuntimeBatchRequest

def validate_feature_batch(df, expectation_suite_name: str = "features.warning") -> bool:
    context = gx.get_context()

    validator = context.get_validator(
        batch_request=RuntimeBatchRequest(
            datasource_name="pandas_datasource",
            data_connector_name="runtime_data_connector",
            data_asset_name="feature_batch",
            runtime_parameters={"batch_data": df},
            batch_identifiers={"batch_id": "pipeline_run"},
        ),
        expectation_suite_name=expectation_suite_name,
    )

    results = validator.validate()

    if not results.success:
        failed = [r for r in results.results if not r.success]
        for r in failed:
            print(f"FAILED: {r.expectation_config.expectation_type} "
                  f"on column {r.expectation_config.kwargs.get('column', 'N/A')}")
        return False
    return True

# Define suite programmatically
def create_feature_suite(validator):
    validator.expect_column_values_to_not_be_null("user_id")
    validator.expect_column_values_to_be_between("purchase_count_30d", min_value=0)
    validator.expect_column_values_to_be_between("avg_order_value_30d", min_value=0)
    validator.expect_table_row_count_to_be_between(min_value=1000)
    validator.expect_column_proportion_of_unique_values_to_be_between(
        "user_id", min_value=0.99  # near-unique
    )
    validator.save_expectation_suite()

Pattern 6: Backfill Strategy

Backfilling historical features without reprocessing everything.

from prefect import flow, task
from datetime import date, timedelta
from typing import Generator

@task(retries=3, retry_delay_seconds=60)
def compute_features_for_date(feature_date: date) -> dict:
    """Idempotent: safe to rerun for same date."""
    # Check if already computed
    output_path = f"s3://features/date={feature_date}/part-0.parquet"
    if s3_exists(output_path):
        return {"date": feature_date, "status": "skipped"}

    # Compute features for this date
    df = load_raw_data(feature_date)
    features = transform_features(df, reference_date=feature_date)
    write_parquet(features, output_path)
    return {"date": feature_date, "status": "computed", "rows": len(features)}

@flow(name="feature-backfill")
def backfill_features(start_date: date, end_date: date, parallelism: int = 4):
    dates = [start_date + timedelta(days=i)
             for i in range((end_date - start_date).days + 1)]

    # Process in parallel chunks
    results = compute_features_for_date.map(dates)
    return results

# Run backfill
backfill_features(
    start_date=date(2024, 1, 1),
    end_date=date(2024, 12, 31),
    parallelism=8
)

Anti-Patterns

Anti-Pattern 1: No Dead-Letter Queue

Pipelines without error routing drop records silently. Always route failed records to an error path and monitor error rate.

Anti-Pattern 2: Point-in-Time Leakage

Anti-Pattern 3: Non-Idempotent Pipeline Steps

If a step can't be safely re-run without side effects, backfill becomes dangerous. Design every step to be idempotent: check existence before writing, use UPSERT semantics.

Anti-Pattern 4: Triggering Batch Jobs Without Data Checks

Airflow/Prefect tasks that run regardless of upstream data availability. Add data sensors (ExternalTaskSensor, S3KeySensor) before computation tasks.

Anti-Pattern 5: String-Typed Everything in Parquet

Writing all features as strings loses compression and query performance. Use proper dtypes: int32, float32, date32, categorical. A float32 Parquet is ~4x smaller than the same data in JSON.

Related Skills

hermeticormus/plugins/vector-databases/skills/vectordb-patterns

tools

VerifiedTrustedCommunity

# VectorDB Patterns Expert patterns for HNSW index tuning, pgvector setup, Pinecone/Qdrant upsert, metadata filtering, multi-tenancy, and embedding drift management. ## Pattern 1: pgvector Setup with HNSW Index PostgreSQL vector search with proper index configuration. ```sql -- Install extension (requires PostgreSQL 15+ with pgvector) CREATE EXTENSION IF NOT EXISTS vector; -- Table with embedding column CREATE TABLE documents ( id UUID PRIMARY KEY DEFAULT gen_random_uuid(),

SKILL.mdUpdated Apr 21, 2026

hermeticormus/plugins/vector-databases/skills/vectordb-patterns

hermeticormus/plugins/tensorflow-patterns/skills/tensorflow-patterns

tools

VerifiedTrustedCommunity

# TensorFlow Patterns Expert patterns for Keras functional API, tf.data pipeline ordering, custom layers, SavedModel export, and TFLite quantization. ## Pattern 1: Keras Functional API Model Multi-input model with proper BatchNorm and Dropout usage. ```python import tensorflow as tf from tensorflow import keras from tensorflow.keras import layers def build_classifier( numeric_dim: int, cat_vocab_sizes: dict, # {"country": 50, "device": 10} embedding_dim: int = 16, hidden_u

SKILL.mdUpdated Apr 21, 2026

hermeticormus/plugins/tensorflow-patterns/skills/tensorflow-patterns

hermeticormus/plugins/rag-architecture/skills/rag-patterns

tools

VerifiedTrustedCommunity

# RAG Patterns Expert patterns for document chunking, embedding pipelines, hybrid search, cross-encoder re-ranking, and RAGAS evaluation. ## Pattern 1: Document Ingestion with Recursive Chunking Parse and chunk documents with metadata preservation. ```python from langchain.text_splitter import RecursiveCharacterTextSplitter from langchain.document_loaders import PyPDFLoader, TextLoader from langchain.schema import Document import hashlib from pathlib import Path def ingest_documents(file_pa

SKILL.mdUpdated Apr 21, 2026

hermeticormus/plugins/rag-architecture/skills/rag-patterns

hermeticormus/plugins/pytorch-patterns/skills/pytorch-patterns

tools

VerifiedTrustedCommunity

# PyTorch Patterns Expert patterns for custom Dataset/DataLoader, nn.Module design, model surgery, custom autograd, and profiling. ## Pattern 1: Custom Dataset with Transforms Production Dataset with augmentation pipeline and weighted sampling. ```python import torch from torch.utils.data import Dataset, DataLoader, WeightedRandomSampler import pandas as pd import numpy as np from pathlib import Path from PIL import Image import albumentations as A from albumentations.pytorch import ToTensor

SKILL.mdUpdated Apr 21, 2026

hermeticormus/plugins/pytorch-patterns/skills/pytorch-patterns

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/hermeticormus/libremlops-claude-code.git

# Copy into Claude Code skills folder (global)
cp -r libremlops-claude-code/plugins/data-pipelines/skills/ml-pipeline-patterns ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

hermeticormus/libremlops-claude-code

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT