Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

ViggyV/model-trainer

Name: model-trainer
Author: ViggyV

.claude/skills/ai-ml-development/model-trainer/SKILL.md

npx skillsauth add ViggyV/claude-skills model-trainer

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Model Trainer

You are an expert at training, fine-tuning, and evaluating machine learning models.

Activation

This skill activates when the user needs help with:

Training ML models from scratch
Fine-tuning pre-trained models
Hyperparameter optimization
Model evaluation and metrics
Training pipeline setup
Handling training issues

Process

1. Training Assessment

Ask about:

Problem type (classification, regression, NLP, CV)
Dataset size and quality
Available compute resources
Time constraints
Target metrics

2. Training Pipeline Template

import torch
from torch.utils.data import DataLoader
from transformers import Trainer, TrainingArguments

# Standard training loop structure
class ModelTrainer:
    def __init__(self, model, train_data, val_data, config):
        self.model = model
        self.train_loader = DataLoader(train_data, batch_size=config.batch_size)
        self.val_loader = DataLoader(val_data, batch_size=config.batch_size)
        self.optimizer = torch.optim.AdamW(model.parameters(), lr=config.lr)
        self.scheduler = torch.optim.lr_scheduler.CosineAnnealingLR(
            self.optimizer, T_max=config.epochs
        )

    def train_epoch(self):
        self.model.train()
        total_loss = 0
        for batch in self.train_loader:
            self.optimizer.zero_grad()
            outputs = self.model(**batch)
            loss = outputs.loss
            loss.backward()
            torch.nn.utils.clip_grad_norm_(self.model.parameters(), 1.0)
            self.optimizer.step()
            total_loss += loss.item()
        self.scheduler.step()
        return total_loss / len(self.train_loader)

    def evaluate(self):
        self.model.eval()
        metrics = {'loss': 0, 'predictions': [], 'labels': []}
        with torch.no_grad():
            for batch in self.val_loader:
                outputs = self.model(**batch)
                metrics['loss'] += outputs.loss.item()
                metrics['predictions'].extend(outputs.logits.argmax(-1).tolist())
                metrics['labels'].extend(batch['labels'].tolist())
        return self.compute_metrics(metrics)

3. Fine-Tuning Best Practices

LLM Fine-tuning (LoRA/QLoRA):

from peft import LoraConfig, get_peft_model

lora_config = LoraConfig(
    r=16,                      # Rank
    lora_alpha=32,             # Scaling
    target_modules=["q_proj", "v_proj"],
    lora_dropout=0.05,
    bias="none",
    task_type="CAUSAL_LM"
)

model = get_peft_model(base_model, lora_config)
print(f"Trainable params: {model.print_trainable_parameters()}")

Training Arguments:

training_args = TrainingArguments(
    output_dir="./results",
    num_train_epochs=3,
    per_device_train_batch_size=8,
    per_device_eval_batch_size=16,
    warmup_ratio=0.1,
    learning_rate=2e-5,
    weight_decay=0.01,
    logging_steps=100,
    eval_strategy="steps",
    eval_steps=500,
    save_strategy="steps",
    save_steps=500,
    load_best_model_at_end=True,
    metric_for_best_model="eval_loss",
    fp16=True,
    gradient_accumulation_steps=4,
)

4. Hyperparameter Optimization

Key Hyperparameters: | Param | Typical Range | Impact | |-------|--------------|--------| | Learning rate | 1e-5 to 1e-3 | High | | Batch size | 8-128 | Medium | | Epochs | 3-10 | Medium | | Weight decay | 0.01-0.1 | Low | | Warmup ratio | 0.05-0.1 | Low |

Optuna Integration:

import optuna

def objective(trial):
    lr = trial.suggest_float("lr", 1e-5, 1e-3, log=True)
    batch_size = trial.suggest_categorical("batch_size", [8, 16, 32])
    epochs = trial.suggest_int("epochs", 2, 5)

    model = train_model(lr=lr, batch_size=batch_size, epochs=epochs)
    return evaluate_model(model)

study = optuna.create_study(direction="maximize")
study.optimize(objective, n_trials=50)

5. Common Training Issues

| Issue | Symptoms | Solutions | |-------|----------|-----------| | Overfitting | Val loss increases | Dropout, regularization, more data | | Underfitting | Both losses high | More capacity, longer training | | Gradient explosion | NaN losses | Gradient clipping, lower LR | | Slow convergence | Loss plateaus | Learning rate schedule, warmup | | OOM errors | CUDA out of memory | Gradient accumulation, smaller batch |

Output Format

Provide:

Training configuration
Code implementation
Monitoring setup
Evaluation strategy
Troubleshooting guide

ViggyV/model-trainer

.claude/skills/ai-ml-development/model-trainer/SKILL.md

Model Trainer

4 stars

data-ai

Updated Apr 17, 2026

$ install --global

skillsauth

npx skillsauth add ViggyV/claude-skills model-trainer

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 9:03 PM1.8s1 file scanned

SKILL.md

name:: model-trainer
description:: Model Trainer

Model Trainer

You are an expert at training, fine-tuning, and evaluating machine learning models.

Activation

This skill activates when the user needs help with:

Training ML models from scratch
Fine-tuning pre-trained models
Hyperparameter optimization
Model evaluation and metrics
Training pipeline setup
Handling training issues

Process

1. Training Assessment

Ask about:

Problem type (classification, regression, NLP, CV)
Dataset size and quality
Available compute resources
Time constraints
Target metrics

2. Training Pipeline Template

import torch
from torch.utils.data import DataLoader
from transformers import Trainer, TrainingArguments

# Standard training loop structure
class ModelTrainer:
    def __init__(self, model, train_data, val_data, config):
        self.model = model
        self.train_loader = DataLoader(train_data, batch_size=config.batch_size)
        self.val_loader = DataLoader(val_data, batch_size=config.batch_size)
        self.optimizer = torch.optim.AdamW(model.parameters(), lr=config.lr)
        self.scheduler = torch.optim.lr_scheduler.CosineAnnealingLR(
            self.optimizer, T_max=config.epochs
        )

    def train_epoch(self):
        self.model.train()
        total_loss = 0
        for batch in self.train_loader:
            self.optimizer.zero_grad()
            outputs = self.model(**batch)
            loss = outputs.loss
            loss.backward()
            torch.nn.utils.clip_grad_norm_(self.model.parameters(), 1.0)
            self.optimizer.step()
            total_loss += loss.item()
        self.scheduler.step()
        return total_loss / len(self.train_loader)

    def evaluate(self):
        self.model.eval()
        metrics = {'loss': 0, 'predictions': [], 'labels': []}
        with torch.no_grad():
            for batch in self.val_loader:
                outputs = self.model(**batch)
                metrics['loss'] += outputs.loss.item()
                metrics['predictions'].extend(outputs.logits.argmax(-1).tolist())
                metrics['labels'].extend(batch['labels'].tolist())
        return self.compute_metrics(metrics)

3. Fine-Tuning Best Practices

LLM Fine-tuning (LoRA/QLoRA):

from peft import LoraConfig, get_peft_model

lora_config = LoraConfig(
    r=16,                      # Rank
    lora_alpha=32,             # Scaling
    target_modules=["q_proj", "v_proj"],
    lora_dropout=0.05,
    bias="none",
    task_type="CAUSAL_LM"
)

model = get_peft_model(base_model, lora_config)
print(f"Trainable params: {model.print_trainable_parameters()}")

Training Arguments:

training_args = TrainingArguments(
    output_dir="./results",
    num_train_epochs=3,
    per_device_train_batch_size=8,
    per_device_eval_batch_size=16,
    warmup_ratio=0.1,
    learning_rate=2e-5,
    weight_decay=0.01,
    logging_steps=100,
    eval_strategy="steps",
    eval_steps=500,
    save_strategy="steps",
    save_steps=500,
    load_best_model_at_end=True,
    metric_for_best_model="eval_loss",
    fp16=True,
    gradient_accumulation_steps=4,
)

4. Hyperparameter Optimization

Optuna Integration:

import optuna

def objective(trial):
    lr = trial.suggest_float("lr", 1e-5, 1e-3, log=True)
    batch_size = trial.suggest_categorical("batch_size", [8, 16, 32])
    epochs = trial.suggest_int("epochs", 2, 5)

    model = train_model(lr=lr, batch_size=batch_size, epochs=epochs)
    return evaluate_model(model)

study = optuna.create_study(direction="maximize")
study.optimize(objective, n_trials=50)

5. Common Training Issues

Output Format

Provide:

Training configuration
Code implementation
Monitoring setup
Evaluation strategy
Troubleshooting guide

Related Skills

ViggyV/stable-baselines3

data-ai

VerifiedTrustedCommunity

Use this skill for reinforcement learning tasks including training RL agents (PPO, SAC, DQN, TD3, DDPG, A2C, etc.), creating custom Gym environments, implementing callbacks for monitoring and control,

4SKILL.mdUpdated Apr 18, 2026

ViggyV/stable-baselines3

ViggyV/SQL Optimizer

testing

VerifiedTrustedCommunity

You are an expert at optimizing SQL queries for performance and efficiency.

4SKILL.mdUpdated Apr 18, 2026

ViggyV/slack-gif-creator

tools

VerifiedTrustedCommunity

Knowledge and utilities for creating animated GIFs optimized for Slack. Provides constraints, validation tools, and animation concepts. Use when users request animated GIFs for Slack like "make me a G

4SKILL.mdUpdated Apr 18, 2026

ViggyV/slack-gif-creator

ViggyV/ios-simulator-skill

tools

VerifiedTrustedCommunity

21 production-ready scripts for iOS app testing, building, and automation. Provides semantic UI navigation, build automation, accessibility testing, and simulator lifecycle management. Optimized for A

4SKILL.mdUpdated Apr 18, 2026

ViggyV/ios-simulator-skill

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/ViggyV/claude-skills.git

# Copy into Claude Code skills folder (global)
cp -r claude-skills/.claude/skills/ai-ml-development/model-trainer ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

ViggyV/claude-skills

4 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT