jupyter/skills/notebook-finetuning/SKILL.md
Unsloth fine-tuning notebook collection provisioned into the workspace volume at deploy time. Data-only candy — no packages, no services, no dependencies. Use when working with notebook-finetuning, Unsloth training notebooks, or unsloth-studio data provisioning.
npx skillsauth add overthinkos/overthink-plugins notebook-finetuningInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
| Property | Value |
|----------|-------|
| Dependencies | (none) |
| Packages | (none) |
| Services | (none) |
| Volumes | workspace -> /workspace (from unsloth-studio) |
| Data | data/finetuning -> workspace volume, dest: finetuning |
| Install files | (none) |
This is a data candy — it uses the data: field in charly.yml to map a directory of notebooks to a named volume with a subdirectory destination:
info: "Unsloth fine-tuning notebook collection for unsloth-studio"
data:
- src: data/finetuning
volume: workspace
dest: finetuning
At build time, the contents of data/finetuning/ are staged into /data/workspace/finetuning/ inside the box.
At deploy time, when the workspace volume is configured as a bind mount (charly config --bind workspace), charly config copies the staged data into the host-backed volume directory at <workspace>/finetuning/. This seeds the volume with ready-to-use training notebooks.
The dest: finetuning field places the notebooks in a subdirectory rather than the volume root, keeping the workspace organized alongside other content.
37 Jupyter notebooks + 1 manifest, organized by training category:
| Category | Notebooks | Models |
|----------|-----------|--------|
| 00-Setup | 00_Unsloth_Setup.ipynb | (general) |
| 01-FastInference | 3 notebooks | Llama, Qwen, Qwen_Think |
| 02-Vision Training | 2 notebooks | Ministral, Pixtral |
| 03-SFT Training | 5 notebooks | Ministral (text + vision), Pixtral, Qwen, Qwen_Think |
| 04-GRPO Training | 5 notebooks | Ministral (text + vision), Pixtral, Qwen, Qwen_Think |
| 05-DPO Training | 2 notebooks | Qwen, Qwen_Think |
| 06-Reward Training | 2 notebooks | Qwen, Qwen_Think |
| 07-RLOO Training | 5 notebooks | Ministral (text + vision), Pixtral, Qwen, Qwen_Think |
| 08-QLoRA | 12 notebooks | Ministral, Qwen_Think (alpha scaling, continual learning, rank comparison, multi-adapter, quantization comparison, target modules) |
Manifest: notebooks.yaml — structured catalog of all notebooks with metadata.
# charly.yml
unsloth-studio:
candy:
- unsloth-studio
- notebook-finetuning
# ... other candies
# Deploy with bind-backed workspace volume
charly config unsloth-studio --bind workspace
# Notebooks are seeded at <workspace>/finetuning/ on first config
charly start unsloth-studio
# Open http://localhost:8888 → navigate to finetuning/
The notebooks include several workarounds for upstream library changes:
packing=True in SFTConfig (19 notebooks) -- Required for TRL 1.0 compatibility. Without it, SFT training fails with the updated TRL APIos.environ["UNSLOTH_ENABLE_FLEX_ATTENTION"] = "0" (16 Ministral/Pixtral notebooks) -- Disables flex_attention to work around a transformers 5.5 bug that crashes these model architecturesmax_memory={0: "14GB"} in model loading (3 Pixtral-12B notebooks) -- Fixes accelerate device_map estimation for Pixtral-12B models that would otherwise OOMmax_prompt_length removed from DPOConfig (2 DPO notebooks) -- Parameter deprecated and removed in TRL 1.0/charly-jupyter:unsloth-studio/charly-jupyter:jupyter-ml-notebook/charly-image:layer -- data field documentation and candy authoring rules/charly-core:charly-config -- data provisioning during charly config setup/charly-core:deploy -- volume backing configuration (bind, named, encrypted)/charly-jupyter:unsloth-studio -- the Tier 2 meta-layer that owns the workspace volume/charly-jupyter:notebook-templates -- sibling data candy pattern (starter notebooks for jupyter)/charly-jupyter:unsloth-studio -- the box that includes this candyUse when the user asks about:
data: field with dest: subdirectory mapping/charly-eval:eval — declarative testing (eval: block, charly eval box, charly eval live)tools
OpenCharly CLI (charly) binary installed into container/VM images for in-container use. Use when working with charly binary deployment inside containers, native D-Bus support, or the full charly toolchain (charly binary + virtualization + gocryptfs + socat).
development
Operator CachyOS workstation profile — a kind:local template + target:local deploy that installs the full dev stack (30 candies) onto a CachyOS host via ShellExecutor. Lives in the overthinkos/cachyos submodule. MUST be invoked before editing or applying the charly-cachyos workstation profile.
tools
Fedora box with the full charly toolchain using shared candies. Rootless-first — runs as uid=1000 with passwordless sudo (no root, no cap_add: ALL). Same candy list as charly-arch. Includes NVIDIA GPU runtime. MUST be invoked before building, deploying, configuring, or troubleshooting the charly-fedora box.
tools
Arch Linux box with the full charly toolchain. Rootless-first — runs as uid=1000 with passwordless sudo (no root, no cap_add: ALL). Composes /charly-coder:charly-mcp so the box is reachable as an MCP gateway on port 18765. NVIDIA GPU runtime composed in. MUST be invoked before building, deploying, configuring, or troubleshooting the charly-arch box.