Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

awslabs/finetuning

Name: finetuning
Author: awslabs

plugins/sagemaker-ai/skills/finetuning/SKILL.md

npx skillsauth add awslabs/agent-plugins finetuning

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Prerequisites

Before starting this workflow, verify:

A use_case_spec.md file exists
- If missing: Activate the use-case-specification skill first, then resume
- DON'T EVER offer to create a use case spec without activating the use-case-specification skill.
A fine-tuning technique (SFT, DPO, or RLVR) and base model have already been selected
- If missing: Activate the finetuning-setup skill to collect what's missing, then resume
- Don't make recommendations on the spot. You MUST activate the finetuning-setup skill.
A base model name available on SageMakerHub has been identified
- If missing: Activate the finetuning-setup skill to get it
- Important: Only use the model name that finetuning-setup retrieves, as it may differ from other commonly used names for the same model

Critical Rules

Code Generation Rules

✅ Use EXACTLY the imports shown in each cell template
❌ Do NOT add additional imports even if they seem helpful
❌ Do NOT create variables before they're needed in that cell
📋 Copy the code structure precisely - no improvisation
🎯 Follow the minimal code principle strictly
✅ When writing a notebook cell, make sure the indentation and f strings are correct
✅ Write notebooks using your standard file write tool to create the .ipynb file with the complete notebook JSON, OR use notebook MCP tools (e.g., create_notebook, add_cell) if available
❌ Do NOT use bash commands, shell scripts, or echo/cat piping to generate notebooks

User Communication Rules

❌ NEVER offer to run the notebook for the user (you don't have the tools)
❌ NEVER offer to move on to a downstream skill while training is in progress (logically impossible)
❌ NEVER set ACCEPT_EULA to True yourself for Meta/Llama models (user must read and agree)
✅ Always mention both the number AND title of cells you reference
✅ If user asks how to run: Tell them to run cells one by one, mention ipykernel requirement

Workflow

1. Notebook Setup

1.1 Directory Setup

Identify project directory from conversation context
- If unclear (multiple relevant directories exist) → Ask user which folder to use
- If no project directory exists → activate the directory-management skill to set one up
Check if the project notebook already exists at <project-dir>/notebooks/<project-name>.ipynb
- If it exists → ask: "Would you like me to append the fine-tuning cells to the existing notebook, or create a new one?"
- If it doesn't exist → create it
When appending, add a markdown header cell ## Fine-Tuning as a section divider before the new cells

⏸ Wait for user.

1.2 Select Reference Template

Read the example notebook matching the finetuning strategy:

SFT → references/sft_example.md
DPO → references/dpo_example.md
RLVR → references/rlvr_example.md

1.3 Copy Notebook Structure

Write the exact cells from the example to the project notebook
Use same order, dependencies, and imports as the example
DO NOT improvise or add extra code
If the model is NOT a Meta/Llama model (model ID does NOT start with meta-):
- Omit the ACCEPT_EULA = False line from the config cell
- Omit the accept_eula=ACCEPT_EULA, line from the trainer call
If the model is in the Nova family, exclude print and override statements for the following hyperparameters: max_epochs and lr_warmup_ratio

1.4 Auto-Generate Configuration Values

In the 'Setup & Credentials' cell, populate:

BASE_MODEL
- Use the exact SageMakerHub model name from context
MODEL_PACKAGE_GROUP_NAME
- Generate from use case (read use_case_spec.md if needed)
- Format rules:
  - Lowercase, alphanumeric with hyphens only
  - 1-63 characters
  - Pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}
  - Example: "Customer Support Chatbot" → customer-support-chatbot-v1
Save notebook

2. RLVR Reward Function (for RLVR only, skip this section if technique is SFT or DPO)

2.1 Check Reward Function Status

Ask if user has a reward function already, or would like help creating one.
- If user says they have one → Ask for the SageMaker Hub Evaluator ARN. Only proceed to Section 2.3 once the user provides a valid Evaluator ARN. If they don't have it registered as a SageMaker Hub Evaluator, continue to 2.2.
- If user says they do not have one → Continue to 2.2

2.2 Generate Reward Function From Template

Follow workflow in references/rlvr_reward_function.md section "Helping Users Create Lambda Functions"

2.3 Set CUSTOM_REWARD_FUNCTION value

Set the value for CUSTOM_REWARD_FUNCTION in the Notebook with the ARN of the reward function (either given directly by the user, or from the function generation code as evaluator.arn).

3. EULA review and acceptance

Look up the official license link for the selected base model from references/eula_links.md
Display the license to the user following the phrasing in references/eula_links.md. For OSS models: "This model is licensed under {License}. Please review the license terms here: {URL}." For Nova models: "This model is subject to the AWS Service Terms: {URL}."
Check if the selected base model is a Meta/Llama model (model ID starts with meta-)
- If Meta/Llama: Tell the user they must read and agree to the EULA before using this model. Ask them to manually change ACCEPT_EULA to True in the notebook after reviewing the license. NEVER set ACCEPT_EULA to True yourself for Meta/Llama models.
- If non-Meta: Inform the user of the license for their awareness. No code-level action needed — the ACCEPT_EULA variable and accept_eula parameter should already be omitted from the notebook (see Step 1.3).

4. Notebook Execution

Display the following to the user:: I have updated your Jupyter Notebook with the finetuning code. If you run it cell by cell, you should be able to launch your SageMaker Training job. Training takes a while. Please monitor the progress and let me know when it's complete so I can help you get to the next step in your plan.
Wait for user's confirmation about training completion. Once the user has confirmed, you are free to move to the next step of the plan.

CRITICAL:

DON'T suggest moving to next steps before training completes
DON'T elaborate on the next steps unless the user specifically asks you about them.

5. Continuous Customization

If the user wants to finetune a model they had already customized, follow the instructions in references/continuous_customization.md

References

rlvr_reward_function.md - Lambda reward function creation guide (RLVR only)
templates/rlvr_reward_function_source_template.py - Lambda reward function source template for open-weights models (RLVR only)
templates/nova_rlvr_reward_function_source_template.py - Lambda reward function source template for Nova 2.0 Lite (RLVR only)
sft_example.md - Complete notebook template for Supervised Fine-Tuning
dpo_example.md - Complete notebook template for Direct Preference Optimization
rlvr_example.md - Complete notebook template for Reinforcement Learning from Verifiable Rewards
continuous_customization.md - Instructions on fine-tuning an already fine-tuned model.

awslabs/finetuning

plugins/sagemaker-ai/skills/finetuning/SKILL.md

Generates a Jupyter notebook that fine-tunes a base model using SageMaker serverless training jobs. Use when the user says "start training", "fine-tune my model", "I'm ready to train", or when the plan reaches the finetuning step. Supports SFT, DPO, and RLVR trainers, including RLVR Lambda reward function creation.

611 stars

data-ai

Updated Apr 24, 2026

$ install --global

skillsauth

npx skillsauth add awslabs/agent-plugins finetuning

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 9:49 AM43.1s9 files scanned

SKILL.md

name:: finetuning
description:: Generates a Jupyter notebook that fine-tunes a base model using SageMaker serverless training jobs. Use when the user says "start training", "fine-tune my model", "I'm ready to train", or when the plan reaches the finetuning step. Supports SFT, DPO, and RLVR trainers, including RLVR Lambda reward function creation.
version:: 1.0.0

Prerequisites

Before starting this workflow, verify:

A use_case_spec.md file exists
- If missing: Activate the use-case-specification skill first, then resume
- DON'T EVER offer to create a use case spec without activating the use-case-specification skill.
A fine-tuning technique (SFT, DPO, or RLVR) and base model have already been selected
- If missing: Activate the finetuning-setup skill to collect what's missing, then resume
- Don't make recommendations on the spot. You MUST activate the finetuning-setup skill.
A base model name available on SageMakerHub has been identified
- If missing: Activate the finetuning-setup skill to get it
- Important: Only use the model name that finetuning-setup retrieves, as it may differ from other commonly used names for the same model

Critical Rules

Code Generation Rules

✅ Use EXACTLY the imports shown in each cell template
❌ Do NOT add additional imports even if they seem helpful
❌ Do NOT create variables before they're needed in that cell
📋 Copy the code structure precisely - no improvisation
🎯 Follow the minimal code principle strictly
✅ When writing a notebook cell, make sure the indentation and f strings are correct
✅ Write notebooks using your standard file write tool to create the .ipynb file with the complete notebook JSON, OR use notebook MCP tools (e.g., create_notebook, add_cell) if available
❌ Do NOT use bash commands, shell scripts, or echo/cat piping to generate notebooks

User Communication Rules

❌ NEVER offer to run the notebook for the user (you don't have the tools)
❌ NEVER offer to move on to a downstream skill while training is in progress (logically impossible)
❌ NEVER set ACCEPT_EULA to True yourself for Meta/Llama models (user must read and agree)
✅ Always mention both the number AND title of cells you reference
✅ If user asks how to run: Tell them to run cells one by one, mention ipykernel requirement

Workflow

1. Notebook Setup

1.1 Directory Setup

Identify project directory from conversation context
- If unclear (multiple relevant directories exist) → Ask user which folder to use
- If no project directory exists → activate the directory-management skill to set one up
Check if the project notebook already exists at <project-dir>/notebooks/<project-name>.ipynb
- If it exists → ask: "Would you like me to append the fine-tuning cells to the existing notebook, or create a new one?"
- If it doesn't exist → create it
When appending, add a markdown header cell ## Fine-Tuning as a section divider before the new cells

⏸ Wait for user.

1.2 Select Reference Template

Read the example notebook matching the finetuning strategy:

SFT → references/sft_example.md
DPO → references/dpo_example.md
RLVR → references/rlvr_example.md

1.3 Copy Notebook Structure

Write the exact cells from the example to the project notebook
Use same order, dependencies, and imports as the example
DO NOT improvise or add extra code
If the model is NOT a Meta/Llama model (model ID does NOT start with meta-):
- Omit the ACCEPT_EULA = False line from the config cell
- Omit the accept_eula=ACCEPT_EULA, line from the trainer call
If the model is in the Nova family, exclude print and override statements for the following hyperparameters: max_epochs and lr_warmup_ratio

1.4 Auto-Generate Configuration Values

In the 'Setup & Credentials' cell, populate:

BASE_MODEL
- Use the exact SageMakerHub model name from context
MODEL_PACKAGE_GROUP_NAME
- Generate from use case (read use_case_spec.md if needed)
- Format rules:
  - Lowercase, alphanumeric with hyphens only
  - 1-63 characters
  - Pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}
  - Example: "Customer Support Chatbot" → customer-support-chatbot-v1
Save notebook

2. RLVR Reward Function (for RLVR only, skip this section if technique is SFT or DPO)

2.1 Check Reward Function Status

Ask if user has a reward function already, or would like help creating one.
- If user says they have one → Ask for the SageMaker Hub Evaluator ARN. Only proceed to Section 2.3 once the user provides a valid Evaluator ARN. If they don't have it registered as a SageMaker Hub Evaluator, continue to 2.2.
- If user says they do not have one → Continue to 2.2

2.2 Generate Reward Function From Template

Follow workflow in references/rlvr_reward_function.md section "Helping Users Create Lambda Functions"

2.3 Set CUSTOM_REWARD_FUNCTION value

Set the value for CUSTOM_REWARD_FUNCTION in the Notebook with the ARN of the reward function (either given directly by the user, or from the function generation code as evaluator.arn).

3. EULA review and acceptance

Look up the official license link for the selected base model from references/eula_links.md
Display the license to the user following the phrasing in references/eula_links.md. For OSS models: "This model is licensed under {License}. Please review the license terms here: {URL}." For Nova models: "This model is subject to the AWS Service Terms: {URL}."
Check if the selected base model is a Meta/Llama model (model ID starts with meta-)
- If Meta/Llama: Tell the user they must read and agree to the EULA before using this model. Ask them to manually change ACCEPT_EULA to True in the notebook after reviewing the license. NEVER set ACCEPT_EULA to True yourself for Meta/Llama models.
- If non-Meta: Inform the user of the license for their awareness. No code-level action needed — the ACCEPT_EULA variable and accept_eula parameter should already be omitted from the notebook (see Step 1.3).

4. Notebook Execution

Display the following to the user:: I have updated your Jupyter Notebook with the finetuning code. If you run it cell by cell, you should be able to launch your SageMaker Training job. Training takes a while. Please monitor the progress and let me know when it's complete so I can help you get to the next step in your plan.
Wait for user's confirmation about training completion. Once the user has confirmed, you are free to move to the next step of the plan.

CRITICAL:

DON'T suggest moving to next steps before training completes
DON'T elaborate on the next steps unless the user specifically asks you about them.

5. Continuous Customization

If the user wants to finetune a model they had already customized, follow the instructions in references/continuous_customization.md

References

rlvr_reward_function.md - Lambda reward function creation guide (RLVR only)
templates/rlvr_reward_function_source_template.py - Lambda reward function source template for open-weights models (RLVR only)
templates/nova_rlvr_reward_function_source_template.py - Lambda reward function source template for Nova 2.0 Lite (RLVR only)
sft_example.md - Complete notebook template for Supervised Fine-Tuning
dpo_example.md - Complete notebook template for Direct Preference Optimization
rlvr_example.md - Complete notebook template for Reinforcement Learning from Verifiable Rewards
continuous_customization.md - Instructions on fine-tuning an already fine-tuned model.

Related Skills

awslabs/elastic-beanstalk

development

VerifiedTrustedCommunity

Deploy to AWS Elastic Beanstalk. Triggers on: elastic beanstalk, EB, managed EC2 platform, web app with managed patching, worker on EC2, Heroku alternative, don't want to manage servers or containers, migrate from Heroku, managed operational lifecycle. Covers Elastic Beanstalk on EC2 for web and worker applications.

772SKILL.mdUpdated Jun 4, 2026

awslabs/elastic-beanstalk

awslabs/aws-lambda-managed-instances

testing

VerifiedTrustedCommunity

Evaluate, configure, and migrate workloads to AWS Lambda Managed Instances (LMI). Triggers on: Lambda Managed Instances, LMI, capacity provider, multi-concurrency Lambda, dedicated instance Lambda, EC2-backed Lambda, cold start elimination, Graviton Lambda, instance type for Lambda, Lambda cost optimization with Reserved Instances or Savings Plans. Also trigger when users describe high-volume predictable workloads seeking cost savings, or compare Lambda vs EC2 for steady-state traffic. For standard Lambda without LMI, use the aws-lambda skill instead.

772SKILL.mdUpdated Jun 4, 2026

awslabs/aws-lambda-managed-instances

awslabs/deploy

development

VerifiedTrustedCommunity

Deploy applications to AWS. Triggers on phrases like: deploy to AWS, host on AWS, run this on AWS, AWS architecture, estimate AWS cost, generate infrastructure. Analyzes any codebase and deploys to optimal AWS services.

772SKILL.mdUpdated Apr 3, 2026

awslabs/dsql

development

VerifiedTrustedCommunity

Build with Aurora DSQL — manage schemas, execute queries, handle migrations, diagnose query plans, load data, and develop applications with a serverless, distributed SQL database. Covers IAM auth, multi-tenant patterns, MySQL-to-DSQL migration, DDL operations, query plan explainability, SQL compatibility validation, and bulk data loading. Triggers on phrases like: DSQL, Aurora DSQL, create DSQL table, DSQL schema, migrate to DSQL, distributed SQL database, serverless PostgreSQL-compatible database, DSQL query plan, DSQL EXPLAIN ANALYZE, why is my DSQL query slow, aurora-dsql-loader, load CSV into DSQL.

772SKILL.mdUpdated Apr 3, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/awslabs/agent-plugins.git

# Copy into Claude Code skills folder (global)
cp -r agent-plugins/plugins/sagemaker-ai/skills/finetuning ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

awslabs/agent-plugins

611 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT