ov-layers/skills/ollama/SKILL.md
Ollama LLM server on port 11434 with CUDA GPU support and model persistence. Use when working with Ollama, LLM serving, or local AI model inference.
npx skillsauth add overthinkos/overthink-plugins ollamaInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
| Property | Value |
|----------|-------|
| Dependencies | cuda, supervisord |
| Ports | 11434 |
| Volumes | models -> ~/.ollama |
| Aliases | ollama -> ollama |
| Service | ollama (supervisord) |
| Install files | tasks: |
| Variable | Value |
|----------|-------|
| OLLAMA_HOST | 0.0.0.0 |
| OLLAMA_MODELS | ~/.ollama/models |
| Variable | Template Value | Resolved Example |
|----------|---------------|-----------------|
| OLLAMA_HOST | http://{{.ContainerName}}:11434 | http://ov-ollama:11434 |
Pod-aware: same-container consumers receive http://localhost:11434, cross-container consumers receive http://ov-ollama:11434. When ov config ollama runs, OLLAMA_HOST is automatically injected into the global deploy.yml env. Use ov config ollama --update-all to propagate to already-deployed services immediately.
See /ov:layer for env_provides field docs and /ov:config for --update-all.
# image.yml
ollama:
layers:
- ollama
ov alias install ollama # install host 'ollama' command
ollama run llama3 # uses the alias
/ov-images:ollama/ov-images:openclaw-ollama/ov-images:openclaw-ollama-sway-browserThe env_provides mechanism makes OLLAMA_HOST available to all containers. The hermes layer auto-detects this variable and configures itself to use local Ollama as its LLM provider (highest priority in the auto-detection chain: OLLAMA_HOST > OLLAMA_API_KEY > OPENROUTER_API_KEY). See /ov-layers:hermes for details on the auto-provider-configuration.
The layer ships 3 declarative checks embedded in the org.overthinkos.tests
OCI label (see /ov:test for the full schema):
ov image test):
ollama-binary — /usr/bin/ollama existsov test against a live service; uses
${HOST_PORT:11434} / ${CONTAINER_IP} so deploy-time port remapping
works unchanged):
ollama-tags-api — GET http://${CONTAINER_IP}:${HOST_PORT:11434}/api/tags returns 200ollama-version — ollama --version stdout matches ^ollama version/ov-layers:cuda -- CUDA toolkit dependency/ov-layers:supervisord -- process manager dependency/ov-layers:openclaw -- AI gateway that can use Ollama as backend/ov-layers:hermes -- AI agent that auto-detects OLLAMA_HOST for local Ollama provider/ov:config — Deploy with quadlet (secrets, volumes, env_provides injection)/ov:start — Start the Ollama service/ov:service — Manage Ollama service inside containerUse when the user asks about:
ollama host aliastools
OpenCharly CLI (charly) binary installed into container/VM images for in-container use. Use when working with charly binary deployment inside containers, native D-Bus support, or the full charly toolchain (charly binary + virtualization + gocryptfs + socat).
development
Operator CachyOS workstation profile — a kind:local template + target:local deploy that installs the full dev stack (30 candies) onto a CachyOS host via ShellExecutor. Lives in the overthinkos/cachyos submodule. MUST be invoked before editing or applying the charly-cachyos workstation profile.
tools
Fedora box with the full charly toolchain using shared candies. Rootless-first — runs as uid=1000 with passwordless sudo (no root, no cap_add: ALL). Same candy list as charly-arch. Includes NVIDIA GPU runtime. MUST be invoked before building, deploying, configuring, or troubleshooting the charly-fedora box.
tools
Arch Linux box with the full charly toolchain. Rootless-first — runs as uid=1000 with passwordless sudo (no root, no cap_add: ALL). Composes /charly-coder:charly-mcp so the box is reachable as an MCP gateway on port 18765. NVIDIA GPU runtime composed in. MUST be invoked before building, deploying, configuring, or troubleshooting the charly-arch box.