ov-jupyter/skills/llama-cpp/SKILL.md
llama.cpp prebuilt binaries and GGUF conversion tools. Use when working with llama.cpp, GGUF model conversion, or llama-quantize/llama-cli.
npx skillsauth add overthinkos/overthink-plugins llama-cppInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
| Property | Value |
|----------|-------|
| Dependencies | None |
| Ports | — |
| Service | — |
| Install files | layer.yml, tasks: |
Downloads the latest llama.cpp release from GitHub into ~/llama.cpp:
Binaries: llama-quantize, llama-cli, shared libraries (lib*.so*)
Python tools: convert_hf_to_gguf.py, gguf-py package (from source tarball)
| Variable | Value | Purpose |
|----------|-------|---------|
| LLAMA_CPP_PATH | ~/llama.cpp | Location of llama.cpp binaries |
| PATH (appended) | ~/llama.cpp | Makes llama-quantize/llama-cli available |
This layer has no pixi.toml and no depends. It downloads prebuilt binaries and sets environment variables. It is designed to be composed into environment-owning layers (Tier 2) via the layers: field.
The user-phase tasks run after the pixi environment is established by the parent layer. The gguf Python package (for programmatic GGUF access) is declared in the parent layer's pixi.toml, not here.
/ov-foundation:python-ml — via layers: [llama-cpp]/ov-jupyter:jupyter-ml — via layers: [llama-cpp, unsloth]/ov-jupyter:unsloth-studio — via layers: [llama-cpp, unsloth]/ov-jupyter:unsloth — Fine-tuning (depends on llama.cpp for GGUF conversion)/ov-foundation:python-ml (via python-ml metalayer)/ov-immich:immich-ml (via python-ml metalayer)/ov-jupyter:jupyter-ml (via jupyter-ml metalayer)/ov-jupyter:jupyter-ml-notebook (via jupyter-ml metalayer)/ov-jupyter:unsloth-studio (via unsloth-studio metalayer)Use when the user asks about:
LLAMA_CPP_PATH environment variable/ov-build:layer — layer authoring reference (layer.yml schema, task verbs, service declarations)/ov-build:eval — declarative testing (eval: block, ov eval image, ov eval live)tools
OpenCharly CLI (charly) binary installed into container/VM images for in-container use. Use when working with charly binary deployment inside containers, native D-Bus support, or the full charly toolchain (charly binary + virtualization + gocryptfs + socat).
development
Operator CachyOS workstation profile — a kind:local template + target:local deploy that installs the full dev stack (30 candies) onto a CachyOS host via ShellExecutor. Lives in the overthinkos/cachyos submodule. MUST be invoked before editing or applying the charly-cachyos workstation profile.
tools
Fedora box with the full charly toolchain using shared candies. Rootless-first — runs as uid=1000 with passwordless sudo (no root, no cap_add: ALL). Same candy list as charly-arch. Includes NVIDIA GPU runtime. MUST be invoked before building, deploying, configuring, or troubleshooting the charly-fedora box.
tools
Arch Linux box with the full charly toolchain. Rootless-first — runs as uid=1000 with passwordless sudo (no root, no cap_add: ALL). Composes /charly-coder:charly-mcp so the box is reachable as an MCP gateway on port 18765. NVIDIA GPU runtime composed in. MUST be invoked before building, deploying, configuring, or troubleshooting the charly-arch box.