Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

abelrguezr/llm-instruction-finetuning

Name: llm-instruction-finetuning
Author: abelrguezr

skills/AI/AI-llm-architecture/7.2.-fine-tuning-to-follow-instructions/SKILL.md

npx skillsauth add abelrguezr/hacktricks-skills llm-instruction-finetuning

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

LLM Instruction Fine-Tuning

This skill guides you through fine-tuning a pre-trained LLM to follow instructions and respond to tasks like a chatbot, rather than just generating text.

When to Use This Skill

Use this skill when:

You have a dataset of instruction-response pairs and want to fine-tune an LLM
You need to format data in Alpaca or Phi-3 style for instruction tuning
You want to evaluate a fine-tuned model's response quality
You're building a chatbot or task-oriented AI assistant
You need to understand the complete instruction fine-tuning workflow

Overview

Instruction fine-tuning transforms a pre-trained language model into one that:

Understands it should respond to specific prompts
Follows the format and style of instruction-response pairs
Generates appropriate responses to user queries

Step 1: Prepare Your Dataset

Dataset Format

You need a dataset with instructions and responses. Common formats:

Alpaca Style:

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
Calculate the area of a circle with a radius of 5 units.

### Response:
The area of a circle is calculated using the formula A = πr². Plugging in the radius of 5 units:
A = π(5)² = π × 25 = 25π square units.

Phi-3 Style:

<|User|>
Can you explain what gravity is in simple terms?

<|Assistant|>
Absolutely! Gravity is a force that pulls objects toward each other.

Format Your Data

Use the format_instruction_data.py script (in scripts/) to convert your raw instruction-response pairs into the desired format. This handles:

Adding instruction/response templates
Handling optional input fields
Creating training/validation/test splits

Step 2: Create Data Loaders

Key Processing Steps

Tokenize all texts using your tokenizer
Pad samples to the same length (typically your model's context length)
Create targets by shifting inputs by 1 token (for next-token prediction)
Mask padding tokens with -100 to exclude them from loss calculation
Optionally mask instruction tokens so the model only learns to generate responses

Why Mask with -100?

Using cross_entropy(..., ignore_index=-100) tells PyTorch to ignore targets with -100. This means:

Padding tokens don't contribute to loss
You can optionally mask the instruction portion so the model focuses on learning responses

Step 3: Load Pre-trained Model & Fine-Tune

Loading the Model

Load your pre-trained LLM (this was covered in previous training steps). Then use your existing training function to fine-tune.

Monitoring Training

Watch both training and validation loss:

Training loss decreasing: Model is learning
Validation loss decreasing: Model is generalizing
Validation loss increasing while training loss decreases: Overfitting is occurring

Action: Stop training at the epoch where validation loss starts increasing to avoid overfitting.

Step 4: Evaluate Response Quality

Why Manual Evaluation Matters

Unlike classification tasks, you can't trust loss alone for instruction tuning. The model might:

Generate correct format and syntax
But give completely wrong answers

Loss won't catch this. You must evaluate response quality.

Evaluation Methods

1. Manual Review

Generate responses on your test set
Review them manually for correctness
Check for hallucinations, wrong answers, or format issues

2. LLM-as-Judge

Pass generated responses and expected responses to another LLM
Ask it to evaluate quality, correctness, and helpfulness

3. Standardized Benchmarks

| Benchmark | What It Tests | Link | |-----------|---------------|------| | MMLU | Knowledge across 57 subjects (humanities, sciences, etc.) | https://arxiv.org/abs/2009.03300 | | LMSYS Chatbot Arena | Side-by-side chatbot comparison | https://arena.lmsys.org | | AlpacaEval | GPT-4 evaluates model responses | https://github.com/tatsu-lab/alpaca_eval | | GLUE | 9 NLU tasks (sentiment, entailment, QA) | https://gluebenchmark.com | | SuperGLUE | Harder version of GLUE | https://super.gluebenchmark.com | | BIG-bench | 200+ tasks (reasoning, translation, QA) | https://github.com/google/BIG-bench | | HELM | Comprehensive evaluation (accuracy, robustness, fairness) | https://crfm.stanford.edu/helm | | HumanEval | Code generation problems | https://github.com/openai/human-eval | | SQuAD | Question answering on Wikipedia | https://rajpurkar.github.io/SQuAD-explorer | | TriviaQA | Trivia questions with evidence | https://nlp.cs.washington.edu/triviaqa |

Common Issues & Solutions

Issue: Model Ignores Instructions

Cause: Not enough instruction-response examples, or poor formatting
Solution: Ensure consistent format, increase training data, verify masking is correct

Issue: Model Overfits

Cause: Training too long, dataset too small
Solution: Use early stopping on validation loss, add more diverse data

Issue: Model Generates Wrong Answers

Cause: Model memorized format but not content
Solution: Manual evaluation, use LLM-as-judge, check for hallucinations

Issue: Loss Doesn't Decrease

Cause: Learning rate too high/low, data formatting issues
Solution: Adjust learning rate, verify tokenization and masking

Best Practices

Start small: Fine-tune on a subset first to verify your pipeline works
Monitor both losses: Training AND validation loss tell different stories
Evaluate qualitatively: Don't trust loss alone for instruction tasks
Use early stopping: Prevent overfitting by watching validation loss
Test on diverse prompts: Ensure generalization beyond training distribution

References

Build a Large Language Model from Scratch
LLMs-from-scratch - Instruction Fine-Tuning Code

abelrguezr/llm-instruction-finetuning

skills/AI/AI-llm-architecture/7.2.-fine-tuning-to-follow-instructions/SKILL.md

How to fine-tune a pre-trained LLM to follow instructions and respond to tasks like a chatbot. Use this skill whenever the user wants to train an LLM on instruction-response pairs, format datasets for instruction tuning, evaluate fine-tuned model responses, or understand the complete instruction fine-tuning workflow. Make sure to use this skill when users mention instruction tuning, chatbot training, Alpaca format, Phi-3 format, or any scenario where they need to make an LLM respond to specific prompts rather than just generate text.

5 stars

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add abelrguezr/hacktricks-skills llm-instruction-finetuning

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 2:05 AM15.3s2 files scanned

SKILL.md

name:: llm-instruction-finetuning
description:: How to fine-tune a pre-trained LLM to follow instructions and respond to tasks like a chatbot. Use this skill whenever the user wants to train an LLM on instruction-response pairs, format datasets for instruction tuning, evaluate fine-tuned model responses, or understand the complete instruction fine-tuning workflow. Make sure to use this skill when users mention instruction tuning, chatbot training, Alpaca format, Phi-3 format, or any scenario where they need to make an LLM respond to specific prompts rather than just generate text.

LLM Instruction Fine-Tuning

This skill guides you through fine-tuning a pre-trained LLM to follow instructions and respond to tasks like a chatbot, rather than just generating text.

When to Use This Skill

Use this skill when:

You have a dataset of instruction-response pairs and want to fine-tune an LLM
You need to format data in Alpaca or Phi-3 style for instruction tuning
You want to evaluate a fine-tuned model's response quality
You're building a chatbot or task-oriented AI assistant
You need to understand the complete instruction fine-tuning workflow

Overview

Instruction fine-tuning transforms a pre-trained language model into one that:

Understands it should respond to specific prompts
Follows the format and style of instruction-response pairs
Generates appropriate responses to user queries

Step 1: Prepare Your Dataset

Dataset Format

You need a dataset with instructions and responses. Common formats:

Alpaca Style:

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
Calculate the area of a circle with a radius of 5 units.

### Response:
The area of a circle is calculated using the formula A = πr². Plugging in the radius of 5 units:
A = π(5)² = π × 25 = 25π square units.

Phi-3 Style:

<|User|>
Can you explain what gravity is in simple terms?

<|Assistant|>
Absolutely! Gravity is a force that pulls objects toward each other.

Format Your Data

Use the format_instruction_data.py script (in scripts/) to convert your raw instruction-response pairs into the desired format. This handles:

Adding instruction/response templates
Handling optional input fields
Creating training/validation/test splits

Step 2: Create Data Loaders

Key Processing Steps

Tokenize all texts using your tokenizer
Pad samples to the same length (typically your model's context length)
Create targets by shifting inputs by 1 token (for next-token prediction)
Mask padding tokens with -100 to exclude them from loss calculation
Optionally mask instruction tokens so the model only learns to generate responses

Why Mask with -100?

Using cross_entropy(..., ignore_index=-100) tells PyTorch to ignore targets with -100. This means:

Padding tokens don't contribute to loss
You can optionally mask the instruction portion so the model focuses on learning responses

Step 3: Load Pre-trained Model & Fine-Tune

Loading the Model

Load your pre-trained LLM (this was covered in previous training steps). Then use your existing training function to fine-tune.

Monitoring Training

Watch both training and validation loss:

Training loss decreasing: Model is learning
Validation loss decreasing: Model is generalizing
Validation loss increasing while training loss decreases: Overfitting is occurring

Action: Stop training at the epoch where validation loss starts increasing to avoid overfitting.

Step 4: Evaluate Response Quality

Why Manual Evaluation Matters

Unlike classification tasks, you can't trust loss alone for instruction tuning. The model might:

Generate correct format and syntax
But give completely wrong answers

Loss won't catch this. You must evaluate response quality.

Evaluation Methods

1. Manual Review

Generate responses on your test set
Review them manually for correctness
Check for hallucinations, wrong answers, or format issues

2. LLM-as-Judge

Pass generated responses and expected responses to another LLM
Ask it to evaluate quality, correctness, and helpfulness

3. Standardized Benchmarks

Common Issues & Solutions

Issue: Model Ignores Instructions

Cause: Not enough instruction-response examples, or poor formatting
Solution: Ensure consistent format, increase training data, verify masking is correct

Issue: Model Overfits

Cause: Training too long, dataset too small
Solution: Use early stopping on validation loss, add more diverse data

Issue: Model Generates Wrong Answers

Cause: Model memorized format but not content
Solution: Manual evaluation, use LLM-as-judge, check for hallucinations

Issue: Loss Doesn't Decrease

Cause: Learning rate too high/low, data formatting issues
Solution: Adjust learning rate, verify tokenization and masking

Best Practices

Start small: Fine-tune on a subset first to verify your pipeline works
Monitor both losses: Training AND validation loss tell different stories
Evaluate qualitatively: Don't trust loss alone for instruction tasks
Use early stopping: Prevent overfitting by watching validation loss
Test on diverse prompts: Ensure generalization beyond training distribution

References

Build a Large Language Model from Scratch
LLMs-from-scratch - Instruction Fine-Tuning Code

Related Skills

abelrguezr/house-of-lore-exploit

testing

VerifiedTrustedCommunity

How to perform a House of Lore (small bin attack) heap exploitation. Use this skill whenever the user mentions heap exploitation, small bin attacks, fake chunks, glibc heap vulnerabilities, or needs to insert fake chunks into small bins for arbitrary read/write. Trigger for CTF challenges involving heap corruption, glibc 2.31+ exploitation, or when the user needs to bypass malloc sanity checks using fake chunk linking.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-lore-exploit

abelrguezr/house-of-force-exploit

testing

VerifiedTrustedCommunity

How to perform House of Force heap exploitation attacks. Use this skill whenever the user mentions heap exploitation, House of Force, top chunk manipulation, arbitrary memory allocation, malloc manipulation, or wants to allocate chunks at specific addresses. Also trigger for CTF challenges involving heap overflows, top chunk size overwrites, or when the user needs to calculate evil_size for heap attacks. Make sure to use this skill for any binary exploitation task involving glibc heap manipulation, even if they don't explicitly say "House of Force".

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-force-exploit

abelrguezr/house-of-einherjar

tools

VerifiedTrustedCommunity

How to perform House of Einherjar heap exploitation to allocate memory at arbitrary addresses. Use this skill whenever the user mentions heap exploitation, glibc heap attacks, arbitrary memory allocation, off-by-one overflow exploitation, tcache poisoning, fast bin attacks, or any CTF challenge involving heap manipulation. This is essential for binary exploitation tasks where you need to control malloc() return addresses.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-einherjar

abelrguezr/heap-overflow-exploitation

testing

VerifiedTrustedCommunity

How to identify, analyze, and exploit heap overflow vulnerabilities in binary exploitation challenges and real-world scenarios. Use this skill whenever the user mentions heap overflows, memory corruption, heap grooming, tcache poisoning, fast-bin attacks, or any heap-related vulnerability in CTF challenges, binary analysis, or security research. This skill covers heap overflow fundamentals, exploitation techniques, heap grooming strategies, and real-world CVE analysis.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/heap-overflow-exploitation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/abelrguezr/hacktricks-skills.git

# Copy into Claude Code skills folder (global)
cp -r hacktricks-skills/skills/AI/AI-llm-architecture/7.2.-fine-tuning-to-follow-instructions ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

abelrguezr/hacktricks-skills

5 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT