Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

ariffazil/ollama-on-vps

Name: ollama-on-vps
Author: ariffazil

hermes-skills/ai-providers/ollama-on-vps/SKILL.md

npx skillsauth add ariffazil/openclaw-workspace ollama-on-vps

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Ollama on VPS — arifOS Federation LLM Fallback

Running Container

Container: ollama-engine-prod
Image: ollama/ollama:latest
Network: docker bridge / compose network

Models Loaded (verified 2026-05-06)

qwen2.5:7b        — chat model (used by arifOS call_llm Tier 2 fallback)
bge-m3:latest     — embedding model (used by A-FORGE LongTermMemory)

Endpoints

Chat API (Ollama native)

POST http://127.0.0.1:11434/api/generate
{
  "model": "qwen2.5:7b",
  "prompt": "...",
  "stream": false,
  "temperature": 0.3,
  "options": {"num_predict": 1200}
}

Response: {"response": "...", "done": true}

Embeddings API

POST http://127.0.0.1:11434/api/embeddings
{
  "model": "bge-m3:latest",
  "prompt": "..."
}

Response: {"embedding": [...], "done": true}

Model List

GET http://127.0.0.1:11434/api/tags

Response: {"models": [{"name": "qwen2.5:7b"}, {"name": "bge-m3:latest"}]}

arifOS Usage (Tier 2 Fallback)

File: /root/arifOS/arifosmcp/runtime/llm_client.py

OLLAMA_BASE_URL = os.getenv("OLLAMA_BASE_URL") or os.getenv("OLLAMA_URL", "http://ollama:11434")
OLLAMA_MODEL = os.getenv("OLLAMA_MODEL", "qwen2.5:7b")

async def _call_ollama(system, user, response_schema, temperature, max_tokens=1200):
    prompt = f"{system}\n\n{user}"
    payload = {
        "model": OLLAMA_MODEL,
        "prompt": prompt,
        "stream": False,
        "temperature": temperature,
        "options": {"num_predict": max_tokens},
    }
    if response_schema:
        payload["format"] = "json"
    # ... httpx POST to /api/generate

arifOS calls Ollama when SEA-LION Tier 1 fails. All 3 LLM tools (mind_reason, heart_critique, reply_compose) fall back here.

A-FORGE Usage (Embedding)

File: /root/A-FORGE/src/memory/LongTermMemory.ts

const OLLAMA_URL = process.env.OLLAMA_URL ?? "http://localhost:11434";
const response = await fetch(`${OLLAMA_URL}/api/embeddings`, {
  method: "POST",
  headers: { "Content-Type": "application/json" },
  body: JSON.stringify({ model: "bge-m3:latest", prompt }),
});

Docker Network DNS

From inside other containers (arifOS MCP, A-FORGE):

http://ollama:11434  (Docker compose service name)

From VPS host:

http://127.0.0.1:11434

Troubleshooting

Ollama not responding:

curl -s http://127.0.0.1:11434/api/tags
# Empty = container down
docker ps | grep ollama

Model not loaded:

"model 'qwen2.5:7b' not found"

→ docker exec ollama-engine-prod ollama pull qwen2.5:7b

arifOS still trying SEA-LION: Check logs — if you see "SEA-LION HTTP 401" repeatedly, Tier 1 is failing and falling through to Ollama. This is EXPECTED behavior when SEA-LION key is invalid.

Adding New Models

docker exec -it ollama-engine-prod ollama pull <model>
# e.g.:
docker exec -it ollama-engine-prod ollama pull llama3:8b
docker exec -it ollama-engine-prod ollama pull nomic-embed-text

Update arifOS to use new model:

# In .env
OLLAMA_MODEL=llama3:8b

Update A-FORGE embedding model:

# In A-FORGE .env
OLLAMA_EMBEDDING_MODEL=nomic-embed-text

ariffazil/ollama-on-vps

hermes-skills/ai-providers/ollama-on-vps/SKILL.md

Ollama LLM running on VPS as arifOS/A-FORGE fallback — models, endpoints, embedding setup

data-ai

Updated May 12, 2026

$ install --global

skillsauth

npx skillsauth add ariffazil/openclaw-workspace ollama-on-vps

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 12, 2026, 3:24 AM122.7s1 file scanned

SKILL.md

name:: ollama-on-vps
description:: Ollama LLM running on VPS as arifOS/A-FORGE fallback — models, endpoints, embedding setup
triggers:: ollama, qwen, local LLM, LLM fallback
category:: ai-providers
last_updated:: 2026-05-06

Ollama on VPS — arifOS Federation LLM Fallback

Running Container

Container: ollama-engine-prod
Image: ollama/ollama:latest
Network: docker bridge / compose network

Models Loaded (verified 2026-05-06)

qwen2.5:7b        — chat model (used by arifOS call_llm Tier 2 fallback)
bge-m3:latest     — embedding model (used by A-FORGE LongTermMemory)

Endpoints

Chat API (Ollama native)

POST http://127.0.0.1:11434/api/generate
{
  "model": "qwen2.5:7b",
  "prompt": "...",
  "stream": false,
  "temperature": 0.3,
  "options": {"num_predict": 1200}
}

Response: {"response": "...", "done": true}

Embeddings API

POST http://127.0.0.1:11434/api/embeddings
{
  "model": "bge-m3:latest",
  "prompt": "..."
}

Response: {"embedding": [...], "done": true}

Model List

GET http://127.0.0.1:11434/api/tags

Response: {"models": [{"name": "qwen2.5:7b"}, {"name": "bge-m3:latest"}]}

arifOS Usage (Tier 2 Fallback)

File: /root/arifOS/arifosmcp/runtime/llm_client.py

OLLAMA_BASE_URL = os.getenv("OLLAMA_BASE_URL") or os.getenv("OLLAMA_URL", "http://ollama:11434")
OLLAMA_MODEL = os.getenv("OLLAMA_MODEL", "qwen2.5:7b")

async def _call_ollama(system, user, response_schema, temperature, max_tokens=1200):
    prompt = f"{system}\n\n{user}"
    payload = {
        "model": OLLAMA_MODEL,
        "prompt": prompt,
        "stream": False,
        "temperature": temperature,
        "options": {"num_predict": max_tokens},
    }
    if response_schema:
        payload["format"] = "json"
    # ... httpx POST to /api/generate

arifOS calls Ollama when SEA-LION Tier 1 fails. All 3 LLM tools (mind_reason, heart_critique, reply_compose) fall back here.

A-FORGE Usage (Embedding)

File: /root/A-FORGE/src/memory/LongTermMemory.ts

const OLLAMA_URL = process.env.OLLAMA_URL ?? "http://localhost:11434";
const response = await fetch(`${OLLAMA_URL}/api/embeddings`, {
  method: "POST",
  headers: { "Content-Type": "application/json" },
  body: JSON.stringify({ model: "bge-m3:latest", prompt }),
});

Docker Network DNS

From inside other containers (arifOS MCP, A-FORGE):

http://ollama:11434  (Docker compose service name)

From VPS host:

http://127.0.0.1:11434

Troubleshooting

Ollama not responding:

curl -s http://127.0.0.1:11434/api/tags
# Empty = container down
docker ps | grep ollama

Model not loaded:

"model 'qwen2.5:7b' not found"

→ docker exec ollama-engine-prod ollama pull qwen2.5:7b

arifOS still trying SEA-LION: Check logs — if you see "SEA-LION HTTP 401" repeatedly, Tier 1 is failing and falling through to Ollama. This is EXPECTED behavior when SEA-LION key is invalid.

Adding New Models

docker exec -it ollama-engine-prod ollama pull <model>
# e.g.:
docker exec -it ollama-engine-prod ollama pull llama3:8b
docker exec -it ollama-engine-prod ollama pull nomic-embed-text

Update arifOS to use new model:

# In .env
OLLAMA_MODEL=llama3:8b

Update A-FORGE embedding model:

# In A-FORGE .env
OLLAMA_EMBEDDING_MODEL=nomic-embed-text

Related Skills

ariffazil/XAUUSD-trading-stack

development

VerifiedTrustedCommunity

Federation-wide gold (XAUUSD) trading capability. Python stack, OANDA broker, backtesting, macro signals, RSI strategy. Every organ has a role.

2SKILL.mdUpdated Jul 24, 2026

ariffazil/XAUUSD-trading-stack

ariffazil/wealth-claim-state

development

VerifiedTrustedCommunity

Capital claim state management — tracks claim lifecycle across WEALTH organ.

2SKILL.mdUpdated Jul 24, 2026

ariffazil/wealth-claim-state

ariffazil/warga-constitutional

development

VerifiedTrustedCommunity

Archived constitutional warga placeholder retained only for audit provenance. Do not use for active work; use the live arifOS governance and constitutional skills instead.

2SKILL.mdUpdated Jul 24, 2026

ariffazil/warga-constitutional

ariffazil/warga

testing

VerifiedTrustedCommunity

Warga (citizen) agent skills for AAA federation members. See subdirectories for specialized warga skills.

2SKILL.mdUpdated Jul 24, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/ariffazil/openclaw-workspace.git

# Copy into Claude Code skills folder (global)
cp -r openclaw-workspace/hermes-skills/ai-providers/ollama-on-vps ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

ariffazil/openclaw-workspace

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT