ov-jupyter/skills/notebook-llm-on-supercomputers/SKILL.md
LLMs on Supercomputers course notebook collection (TU Wien AI Factory Austria). 15 Jupyter notebooks covering prompt engineering, RAG, and fine-tuning. Data-only layer — no packages, no services, no dependencies. Use when working with the LLM course notebooks, LangChain tutorials, or RAG examples.
npx skillsauth add overthinkos/overthink-plugins notebook-llm-on-supercomputersInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
| Property | Value |
|----------|-------|
| Dependencies | (none) |
| Packages | (none) |
| Services | (none) |
| Volumes | workspace -> ~/workspace (from jupyter) |
| Data | data/llms_on_supercomputers -> workspace volume, dest: llms_on_supercomputers |
| Install files | (none) |
This is a data layer — it uses the data: field in layer.yml to map a directory of notebooks to a named volume with a subdirectory destination:
info: "LLMs on Supercomputers course notebooks (TU Wien AI Factory Austria)"
data:
- src: data/llms_on_supercomputers
volume: workspace
dest: llms_on_supercomputers
At build time, the contents are staged into /data/workspace/llms_on_supercomputers/ inside the image. At deploy time, ov config or ov update provisions them into the workspace volume.
15 Jupyter notebooks organized in 4 categories:
| Notebook | Topic |
|----------|-------|
| D0_00_Bazzite_AI_Setup.ipynb | Environment setup, GPU verification, Ollama connectivity |
| Notebook | Topic | Libraries |
|----------|-------|-----------|
| D1_01_Prompting_with_LangChain.ipynb | LangChain basics, local + Ollama models | LangChain, OpenAI, HuggingFace |
| D1_02_Prompt_templates_and_parsing.ipynb | Prompt templates, few-shot, structured output | LangChain, OpenAI |
| D1_05_Chaining.ipynb | Chain composition and routing | LangChain |
| D1_08_LLM_Evaluation.ipynb | LLM evaluation with evidently.ai | ollama, evidently |
| D1_09_LLM_as_a_Judge.ipynb | LLM-as-a-Judge evaluation pattern | OpenAI, evidently |
| D1_10_Prompt_Optimization.ipynb | Prompt optimization techniques | OpenAI, evidently |
| Notebook | Topic | Libraries |
|----------|-------|-----------|
| D2_01_rag_with_basic_tools.ipynb | RAG with pandas DataFrame as vector store | OpenAI, numpy, pandas |
| D2_02_rag_with_langchain_and_chromadb.ipynb | RAG pipeline with ChromaDB | LangChain, ChromaDB, OllamaEmbeddings |
| Notebook | Topic | Libraries |
|----------|-------|-----------|
| D3_01_Transformer_Architecture.ipynb | Transformer theory (no LLM calls) | PyTorch |
| D3_02_Finetuning_LLM_with_PyTorch.ipynb | Manual fine-tuning loop | PyTorch, transformers |
| D3_03_Finetuning_LLM_with_Huggingface.ipynb | Fine-tuning with HF Trainer | transformers, datasets |
| D3_04_Quantization.ipynb | Model quantization techniques | bitsandbytes, transformers |
| D3_05_PEFT.ipynb | Parameter-Efficient Fine-Tuning (LoRA) | peft, transformers |
| D3_06_Unsloth.ipynb | Fine-tuning with Unsloth | unsloth |
| File | Purpose |
|------|---------|
| notebooks.yaml | Structured catalog with subcategories |
| LICENSE | Apache 2.0 (TU Wien AI Factory Austria) |
| simplified_output.json | Sample data for D1_02 prompt templates |
| datasets/booking_queries_dataset.csv | Booking classification data |
| datasets/code_review_dataset.csv | Code review data |
| datasets/health_and_fitness_qna.csv | Health Q&A data |
9 notebooks (D0, D1, D2 series) connect to Ollama via the OLLAMA_HOST environment variable (default: http://localhost:11434). When Ollama is deployed via ov config ollama --update-all, the OLLAMA_HOST env var is auto-injected via env_provides to http://ov-ollama:11434 — both containers must be on the same ov Podman network. The D3 fine-tuning notebooks work locally with GPU — no Ollama needed.
The same importlib.reload(ollama) pattern from /ov-jupyter:notebook-ollama is applied to all cleanup cells that use import ollama for model unloading.
https://app.evidently.cloud for LLM evaluation dashboards (optional, notebooks work without it)hf.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF:Q4_K_Mllama3.2:latest/ov-jupyter:jupyter-ml-notebook/ov-build:layer — data field documentation and layer authoring rules/ov-core:config — data provisioning during ov config setup/ov-jupyter:notebook-finetuning — sibling data layer (Unsloth fine-tuning notebooks)/ov-jupyter:notebook-ollama — sibling data layer (Ollama API tutorials)/ov-jupyter:notebook-templates — sibling data layer (starter notebooks)/ov-ollama:ollama — the Ollama server image (must be running for D0-D2 notebooks)/ov-jupyter:jupyter-ml-notebook — the image that includes this layerUse when the user asks about:
/ov-build:eval — declarative testing (eval: block, ov eval image, ov eval live)development
Claude Code multi-agent support in Overthink — sub-agents, dynamic workflows, and agent teams, and how each drives the existing `ov eval` disposable beds to test and verify. MUST be invoked before authoring or invoking an ov sub-agent / dynamic workflow / agent team, wiring agent-lifecycle hooks, or asking "which primitive should drive the R10 beds?".
tools
Mounts a virtiofs share tagged `workspace` at /workspace inside a VM guest via a systemd .mount unit. Use when a kind:vm entity shares a host directory into the guest and you need it auto-mounted (and re-mounted at every boot).
development
MUST be invoked before any work involving: the `kind: android` schema kind, a `target: android` deploy, the `apk:` layer package format (installing Android apps declaratively), AndroidDeployTarget, an in-pod emulator OR a remote/physical adb-endpoint device, or nested `pod → android` deployment. The first-class Android device + app surface that sits above `ov eval adb`/`appium`.
tools
Use when committing, branching, pushing, merging, tagging, creating PRs, or approving/merging PRs with gh — the feat/-branch, R10-gated, never-force-push landing workflow across the main repo + the plugins submodule + image/<distro> submodules. Covers sync-to-upstream, branch/worktree pruning, the fork+PR path for contributors without write access, and cross-repo @github landing order.