Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

abelrguezr/lora-fine-tuning

Name: lora-fine-tuning
Author: abelrguezr

skills/AI/AI-llm-architecture/7.0.-lora-improvements-in-fine-tuning/SKILL.md

npx skillsauth add abelrguezr/hacktricks-skills lora-fine-tuning

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

LoRA Fine-Tuning Implementation

This skill helps you implement LoRA (Low-Rank Adaptation) for efficient fine-tuning of large language models. LoRA reduces training costs by only updating small adapter matrices instead of the entire model.

When to Use LoRA

Use LoRA when you need to:

Fine-tune a large model on limited hardware (less GPU memory)
Adapt a pre-trained model to a new task without full retraining
Store multiple task-specific adaptations efficiently
Reduce training time and computational costs

How LoRA Works

LoRA replaces standard linear layers with a combination of:

Original frozen weights (preserved from pre-training)
Two small trainable matrices (A and B) that approximate weight updates

The forward pass becomes: output = original_linear(x) + alpha * (x @ A @ B)

Key Benefits

Fewer trainable parameters: Only matrices A and B are updated
Preserved knowledge: Original model weights stay frozen
Storage efficiency: Save only small LoRA matrices per task
Faster training: Less computation per gradient update

Implementation

Step 1: Define LoRA Components

Use the scripts/lora_layers.py module which provides:

LoRALayer: The low-rank adapter with matrices A and B
LinearWithLoRA: Wrapper combining original linear layer with LoRA
replace_linear_with_lora(): Recursive function to convert all linear layers

Step 2: Apply LoRA to Your Model

from scripts.lora_layers import replace_linear_with_lora

# Choose rank and alpha (typical values)
rank = 8  # Lower rank = fewer parameters, higher compression
alpha = 16  # Scaling factor (often 2x rank)

# Apply LoRA to your model
model = replace_linear_with_lora(model, rank=rank, alpha=alpha)

Step 3: Configure Training

Only the LoRA parameters need gradients:

# Freeze original model parameters
for param in model.parameters():
    param.requires_grad = False

# Enable gradients only for LoRA matrices
for name, param in model.named_parameters():
    if 'lora' in name:
        param.requires_grad = True

Step 4: Train as Usual

Your training loop remains the same. The optimizer will only update LoRA parameters.

Parameter Selection Guide

| Rank | Use Case | Trainable Params (approx) | |------|----------|---------------------------| | 4-8 | Small tasks, limited data | ~0.1% of model | | 8-16 | Standard fine-tuning | ~0.2-0.4% of model | | 16-32 | Complex tasks, more data | ~0.4-0.8% of model |

Alpha: Typically set to 2x rank (e.g., rank=8, alpha=16)

Example: Fine-Tuning a Transformer

import torch
from transformers import AutoModelForCausalLM
from scripts.lora_layers import replace_linear_with_lora

# Load pre-trained model
model = AutoModelForCausalLM.from_pretrained("your-model")

# Apply LoRA
rank = 8
alpha = 16
model = replace_linear_with_lora(model, rank, alpha)

# Freeze and enable LoRA gradients
for param in model.parameters():
    param.requires_grad = False

for name, param in model.named_parameters():
    if 'lora' in name:
        param.requires_grad = True

# Count trainable parameters
total_params = sum(p.numel() for p in model.parameters())
trainable_params = sum(p.numel() for p in model.parameters() if p.requires_grad)
print(f"Trainable: {trainable_params:,} ({100*trainable_params/total_params:.2f}%)")

# Train normally
optimizer = torch.optim.AdamW(filter(lambda p: p.requires_grad, model.parameters()), lr=1e-4)

Merging LoRA Weights (Optional)

After training, you can merge LoRA weights back into the original model for inference:

def merge_lora_weights(model):
    for name, module in model.named_modules():
        if isinstance(module, LinearWithLoRA):
            # Merge: W_merged = W_original + alpha * B @ A
            with torch.no_grad():
                merged_weight = module.linear.weight + module.lora.alpha * (module.lora.B @ module.lora.A)
                module.linear.weight = torch.nn.Parameter(merged_weight)
            # Replace with plain linear
            setattr(model, name.split('.')[-1], module.linear)

Common Issues and Solutions

Issue: Out of memory during training

Solution: Reduce rank (try 4 or 8), use gradient accumulation, or enable mixed precision

Issue: Poor fine-tuning quality

Solution: Increase rank, check learning rate, ensure sufficient training data

Issue: LoRA not being applied

Solution: Verify replace_linear_with_lora was called before training, check that LoRA parameters have requires_grad=True

References

Build a Large Language Model from Scratch
LoRA Implementation Example

abelrguezr/lora-fine-tuning

skills/AI/AI-llm-architecture/7.0.-lora-improvements-in-fine-tuning/SKILL.md

How to implement LoRA (Low-Rank Adaptation) for efficient fine-tuning of large language models. Use this skill whenever the user wants to fine-tune an LLM, reduce training memory/compute requirements, implement parameter-efficient fine-tuning, or adapt a pre-trained model to a new task without retraining all parameters. Make sure to use this skill when users mention fine-tuning, LoRA, PEFT, parameter-efficient training, or want to train on limited hardware.

5 stars

testing

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add abelrguezr/hacktricks-skills lora-fine-tuning

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 2:05 AM32.8s2 files scanned

SKILL.md

name:: lora-fine-tuning
description:: How to implement LoRA (Low-Rank Adaptation) for efficient fine-tuning of large language models. Use this skill whenever the user wants to fine-tune an LLM, reduce training memory/compute requirements, implement parameter-efficient fine-tuning, or adapt a pre-trained model to a new task without retraining all parameters. Make sure to use this skill when users mention fine-tuning, LoRA, PEFT, parameter-efficient training, or want to train on limited hardware.

LoRA Fine-Tuning Implementation

When to Use LoRA

Use LoRA when you need to:

Fine-tune a large model on limited hardware (less GPU memory)
Adapt a pre-trained model to a new task without full retraining
Store multiple task-specific adaptations efficiently
Reduce training time and computational costs

How LoRA Works

LoRA replaces standard linear layers with a combination of:

Original frozen weights (preserved from pre-training)
Two small trainable matrices (A and B) that approximate weight updates

The forward pass becomes: output = original_linear(x) + alpha * (x @ A @ B)

Key Benefits

Fewer trainable parameters: Only matrices A and B are updated
Preserved knowledge: Original model weights stay frozen
Storage efficiency: Save only small LoRA matrices per task
Faster training: Less computation per gradient update

Implementation

Step 1: Define LoRA Components

Use the scripts/lora_layers.py module which provides:

LoRALayer: The low-rank adapter with matrices A and B
LinearWithLoRA: Wrapper combining original linear layer with LoRA
replace_linear_with_lora(): Recursive function to convert all linear layers

Step 2: Apply LoRA to Your Model

from scripts.lora_layers import replace_linear_with_lora

# Choose rank and alpha (typical values)
rank = 8  # Lower rank = fewer parameters, higher compression
alpha = 16  # Scaling factor (often 2x rank)

# Apply LoRA to your model
model = replace_linear_with_lora(model, rank=rank, alpha=alpha)

Step 3: Configure Training

Only the LoRA parameters need gradients:

# Freeze original model parameters
for param in model.parameters():
    param.requires_grad = False

# Enable gradients only for LoRA matrices
for name, param in model.named_parameters():
    if 'lora' in name:
        param.requires_grad = True

Step 4: Train as Usual

Your training loop remains the same. The optimizer will only update LoRA parameters.

Parameter Selection Guide

Alpha: Typically set to 2x rank (e.g., rank=8, alpha=16)

Example: Fine-Tuning a Transformer

import torch
from transformers import AutoModelForCausalLM
from scripts.lora_layers import replace_linear_with_lora

# Load pre-trained model
model = AutoModelForCausalLM.from_pretrained("your-model")

# Apply LoRA
rank = 8
alpha = 16
model = replace_linear_with_lora(model, rank, alpha)

# Freeze and enable LoRA gradients
for param in model.parameters():
    param.requires_grad = False

for name, param in model.named_parameters():
    if 'lora' in name:
        param.requires_grad = True

# Count trainable parameters
total_params = sum(p.numel() for p in model.parameters())
trainable_params = sum(p.numel() for p in model.parameters() if p.requires_grad)
print(f"Trainable: {trainable_params:,} ({100*trainable_params/total_params:.2f}%)")

# Train normally
optimizer = torch.optim.AdamW(filter(lambda p: p.requires_grad, model.parameters()), lr=1e-4)

Merging LoRA Weights (Optional)

After training, you can merge LoRA weights back into the original model for inference:

def merge_lora_weights(model):
    for name, module in model.named_modules():
        if isinstance(module, LinearWithLoRA):
            # Merge: W_merged = W_original + alpha * B @ A
            with torch.no_grad():
                merged_weight = module.linear.weight + module.lora.alpha * (module.lora.B @ module.lora.A)
                module.linear.weight = torch.nn.Parameter(merged_weight)
            # Replace with plain linear
            setattr(model, name.split('.')[-1], module.linear)

Common Issues and Solutions

Issue: Out of memory during training

Solution: Reduce rank (try 4 or 8), use gradient accumulation, or enable mixed precision

Issue: Poor fine-tuning quality

Solution: Increase rank, check learning rate, ensure sufficient training data

Issue: LoRA not being applied

Solution: Verify replace_linear_with_lora was called before training, check that LoRA parameters have requires_grad=True

References

Build a Large Language Model from Scratch
LoRA Implementation Example

Related Skills

abelrguezr/house-of-lore-exploit

testing

VerifiedTrustedCommunity

How to perform a House of Lore (small bin attack) heap exploitation. Use this skill whenever the user mentions heap exploitation, small bin attacks, fake chunks, glibc heap vulnerabilities, or needs to insert fake chunks into small bins for arbitrary read/write. Trigger for CTF challenges involving heap corruption, glibc 2.31+ exploitation, or when the user needs to bypass malloc sanity checks using fake chunk linking.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-lore-exploit

abelrguezr/house-of-force-exploit

testing

VerifiedTrustedCommunity

How to perform House of Force heap exploitation attacks. Use this skill whenever the user mentions heap exploitation, House of Force, top chunk manipulation, arbitrary memory allocation, malloc manipulation, or wants to allocate chunks at specific addresses. Also trigger for CTF challenges involving heap overflows, top chunk size overwrites, or when the user needs to calculate evil_size for heap attacks. Make sure to use this skill for any binary exploitation task involving glibc heap manipulation, even if they don't explicitly say "House of Force".

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-force-exploit

abelrguezr/house-of-einherjar

tools

VerifiedTrustedCommunity

How to perform House of Einherjar heap exploitation to allocate memory at arbitrary addresses. Use this skill whenever the user mentions heap exploitation, glibc heap attacks, arbitrary memory allocation, off-by-one overflow exploitation, tcache poisoning, fast bin attacks, or any CTF challenge involving heap manipulation. This is essential for binary exploitation tasks where you need to control malloc() return addresses.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/house-of-einherjar

abelrguezr/heap-overflow-exploitation

testing

VerifiedTrustedCommunity

How to identify, analyze, and exploit heap overflow vulnerabilities in binary exploitation challenges and real-world scenarios. Use this skill whenever the user mentions heap overflows, memory corruption, heap grooming, tcache poisoning, fast-bin attacks, or any heap-related vulnerability in CTF challenges, binary analysis, or security research. This skill covers heap overflow fundamentals, exploitation techniques, heap grooming strategies, and real-world CVE analysis.

5SKILL.mdUpdated Apr 16, 2026

abelrguezr/heap-overflow-exploitation

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/abelrguezr/hacktricks-skills.git

# Copy into Claude Code skills folder (global)
cp -r hacktricks-skills/skills/AI/AI-llm-architecture/7.0.-lora-improvements-in-fine-tuning ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

abelrguezr/hacktricks-skills

5 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT