ov-jupyter/skills/unsloth-studio-layer/SKILL.md
Unsloth Studio fine-tuning web UI on ports 8888/8000 with vLLM inference. Tier 2 environment-owner meta-layer composing llama-cpp + unsloth, owns pixi.toml. Use when working with Unsloth Studio, the fine-tuning web UI, or the unsloth-studio image.
npx skillsauth add overthinkos/overthink-plugins unsloth-studioInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
| Property | Value |
|----------|-------|
| Dependencies | cuda, supervisord |
| Sub-layers | llama-cpp, unsloth |
| Ports | 8888 (Studio UI), 8000 (vLLM API) |
| Volumes | workspace -> ~/workspace |
| Service | unsloth-studio (supervisord) |
| Install files | layer.yml, pixi.toml |
This layer owns the pixi.toml for the fine-tuning environment and composes two Tier 1 layers via layers: [llama-cpp, unsloth]. Build order: pixi environment → llama-cpp (binaries) → unsloth (vLLM wheel + unsloth pip + patch) → supervisord config.
| Variable | Value |
|----------|-------|
| NVIDIA_PYTHON_PROJECT | ~/.pixi |
| LD_LIBRARY_PATH | /usr/lib64:$HOME/llama.cpp |
Plus from sub-layers: LLAMA_CPP_PATH, UNSLOTH_SKIP_LLAMA_CPP_INSTALL, HF_HOME
Fine-tuning focused ML stack: PyTorch (CUDA 13.0), xformers, transformers, accelerate, vLLM runtime deps, HuggingFace (datasets, tokenizers, sentencepiece), fine-tuning (peft, trl, bitsandbytes, liger-kernel), GGUF tools
Runs pixi run start-studio which executes unsloth studio -H 0.0.0.0 -p 8888. The Studio launches its own vLLM API server on port 8000 for inference and synthetic data generation.
/ov-jupyter:unsloth-studio/ov-jupyter:llama-cpp — Sub-layer: llama.cpp binaries/ov-jupyter:unsloth — Sub-layer: vLLM + unsloth pip install + patch/ov-foundation:supervisord — Process manager dependency/ov-jupyter:jupyter-ml — Alternative: ML Jupyter with MCP (same Tier 1 sub-layers)/ov-foundation:python-ml — Alternative: core ML without UIUse when the user asks about:
/ov-build:layer — layer authoring reference (layer.yml schema, task verbs, service declarations)/ov-build:eval — declarative testing (eval: block, ov eval image, ov eval live)development
Claude Code multi-agent support in Overthink — sub-agents, dynamic workflows, and agent teams, and how each drives the existing `ov eval` disposable beds to test and verify. MUST be invoked before authoring or invoking an ov sub-agent / dynamic workflow / agent team, wiring agent-lifecycle hooks, or asking "which primitive should drive the R10 beds?".
tools
Mounts a virtiofs share tagged `workspace` at /workspace inside a VM guest via a systemd .mount unit. Use when a kind:vm entity shares a host directory into the guest and you need it auto-mounted (and re-mounted at every boot).
development
MUST be invoked before any work involving: the `kind: android` schema kind, a `target: android` deploy, the `apk:` layer package format (installing Android apps declaratively), AndroidDeployTarget, an in-pod emulator OR a remote/physical adb-endpoint device, or nested `pod → android` deployment. The first-class Android device + app surface that sits above `ov eval adb`/`appium`.
tools
Use when committing, branching, pushing, merging, tagging, creating PRs, or approving/merging PRs with gh — the feat/-branch, R10-gated, never-force-push landing workflow across the main repo + the plugins submodule + image/<distro> submodules. Covers sync-to-upstream, branch/worktree pruning, the fork+PR path for contributors without write access, and cross-repo @github landing order.