Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

Heldinhow/ollama-local-ai

Name: ollama-local-ai
Author: Heldinhow

ollama-local-ai/SKILL.md

npx skillsauth add Heldinhow/awesome-opencode-dev-skills ollama-local-ai

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Ollama — Local AI

Installation & Setup

# macOS
brew install ollama

# Start the server (runs on http://localhost:11434)
ollama serve

# Pull a model
ollama pull llama3.2          # 3B — fast, general purpose
ollama pull llama3.2:1b       # 1B — ultra-fast, low RAM
ollama pull mistral           # 7B — strong reasoning
ollama pull codellama         # code-specialized
ollama pull nomic-embed-text  # embeddings
ollama pull phi4-mini         # Microsoft, small and capable

Model Management

ollama list                   # list installed models
ollama show llama3.2          # model info
ollama rm llama3.2            # remove model
ollama run llama3.2           # interactive chat in terminal

REST API

# Chat completion
curl http://localhost:11434/api/chat -d '{
  "model": "llama3.2",
  "messages": [{ "role": "user", "content": "Hello!" }],
  "stream": false
}'

# Generate (single turn)
curl http://localhost:11434/api/generate -d '{
  "model": "llama3.2",
  "prompt": "Why is the sky blue?",
  "stream": false
}'

# Embeddings
curl http://localhost:11434/api/embed -d '{
  "model": "nomic-embed-text",
  "input": "Text to embed"
}'

Integration with Vercel AI SDK

import { ollama } from 'ollama-ai-provider'
import { generateText, streamText } from 'ai'

// Text generation
const { text } = await generateText({
  model: ollama('llama3.2'),
  prompt: 'Explain quantum computing in one paragraph',
})

// Streaming
const stream = streamText({
  model: ollama('llama3.2'),
  messages: [
    { role: 'system', content: 'You are a helpful assistant.' },
    { role: 'user', content: 'Write a haiku about TypeScript.' },
  ],
})

for await (const chunk of stream.textStream) {
  process.stdout.write(chunk)
}

// Embeddings
import { embed } from 'ai'

const { embedding } = await embed({
  model: ollama.embedding('nomic-embed-text'),
  value: 'Some text to embed',
})

bun add ollama-ai-provider ai

Direct Ollama SDK

import { Ollama } from 'ollama'

const ollama = new Ollama({ host: 'http://localhost:11434' })

// Chat
const response = await ollama.chat({
  model: 'llama3.2',
  messages: [{ role: 'user', content: 'Hello' }],
})
console.log(response.message.content)

// Streaming
const stream = await ollama.chat({
  model: 'llama3.2',
  messages: [{ role: 'user', content: 'Tell me a story' }],
  stream: true,
})
for await (const chunk of stream) {
  process.stdout.write(chunk.message.content)
}

// Embeddings
const { embedding } = await ollama.embed({
  model: 'nomic-embed-text',
  input: 'text to embed',
})

Modelfile (Custom Models)

# Modelfile
FROM llama3.2

SYSTEM """
You are a senior TypeScript developer.
Always return typed, idiomatic TypeScript.
Prefer functional patterns and avoid classes.
"""

PARAMETER temperature 0.3
PARAMETER num_ctx 8192

ollama create my-ts-assistant -f Modelfile
ollama run my-ts-assistant

Model Selection Guide

| Use case | Recommended model | |---|---| | General chat | llama3.2 (3B) or mistral (7B) | | Fast/low RAM | llama3.2:1b or phi4-mini | | Code generation | codellama or qwen2.5-coder | | Embeddings | nomic-embed-text or mxbai-embed-large | | Long context | mistral with num_ctx 32768 | | Math/reasoning | phi4 or qwen2.5 |

Privacy & Performance Notes

All inference runs locally — no data leaves your machine
GPU acceleration: Ollama auto-detects Metal (macOS), CUDA, ROCm
RAM requirements: 4GB for 3B models, 8GB for 7B, 16GB+ for 13B+
Set OLLAMA_NUM_PARALLEL env var to serve multiple requests

Heldinhow/ollama-local-ai

ollama-local-ai/SKILL.md

Use when running LLMs locally with Ollama, integrating local models into apps, or building AI features without sending data to external APIs. Covers model management, API usage, and integration with AI SDK.

2 stars

development

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add Heldinhow/awesome-opencode-dev-skills ollama-local-ai

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 7:36 AM122.7s1 file scanned

SKILL.md

name:: ollama-local-ai
description:: Use when running LLMs locally with Ollama, integrating local models into apps, or building AI features without sending data to external APIs. Covers model management, API usage, and integration with AI SDK.

Ollama — Local AI

Installation & Setup

# macOS
brew install ollama

# Start the server (runs on http://localhost:11434)
ollama serve

# Pull a model
ollama pull llama3.2          # 3B — fast, general purpose
ollama pull llama3.2:1b       # 1B — ultra-fast, low RAM
ollama pull mistral           # 7B — strong reasoning
ollama pull codellama         # code-specialized
ollama pull nomic-embed-text  # embeddings
ollama pull phi4-mini         # Microsoft, small and capable

Model Management

ollama list                   # list installed models
ollama show llama3.2          # model info
ollama rm llama3.2            # remove model
ollama run llama3.2           # interactive chat in terminal

REST API

# Chat completion
curl http://localhost:11434/api/chat -d '{
  "model": "llama3.2",
  "messages": [{ "role": "user", "content": "Hello!" }],
  "stream": false
}'

# Generate (single turn)
curl http://localhost:11434/api/generate -d '{
  "model": "llama3.2",
  "prompt": "Why is the sky blue?",
  "stream": false
}'

# Embeddings
curl http://localhost:11434/api/embed -d '{
  "model": "nomic-embed-text",
  "input": "Text to embed"
}'

Integration with Vercel AI SDK

import { ollama } from 'ollama-ai-provider'
import { generateText, streamText } from 'ai'

// Text generation
const { text } = await generateText({
  model: ollama('llama3.2'),
  prompt: 'Explain quantum computing in one paragraph',
})

// Streaming
const stream = streamText({
  model: ollama('llama3.2'),
  messages: [
    { role: 'system', content: 'You are a helpful assistant.' },
    { role: 'user', content: 'Write a haiku about TypeScript.' },
  ],
})

for await (const chunk of stream.textStream) {
  process.stdout.write(chunk)
}

// Embeddings
import { embed } from 'ai'

const { embedding } = await embed({
  model: ollama.embedding('nomic-embed-text'),
  value: 'Some text to embed',
})

bun add ollama-ai-provider ai

Direct Ollama SDK

import { Ollama } from 'ollama'

const ollama = new Ollama({ host: 'http://localhost:11434' })

// Chat
const response = await ollama.chat({
  model: 'llama3.2',
  messages: [{ role: 'user', content: 'Hello' }],
})
console.log(response.message.content)

// Streaming
const stream = await ollama.chat({
  model: 'llama3.2',
  messages: [{ role: 'user', content: 'Tell me a story' }],
  stream: true,
})
for await (const chunk of stream) {
  process.stdout.write(chunk.message.content)
}

// Embeddings
const { embedding } = await ollama.embed({
  model: 'nomic-embed-text',
  input: 'text to embed',
})

Modelfile (Custom Models)

# Modelfile
FROM llama3.2

SYSTEM """
You are a senior TypeScript developer.
Always return typed, idiomatic TypeScript.
Prefer functional patterns and avoid classes.
"""

PARAMETER temperature 0.3
PARAMETER num_ctx 8192

ollama create my-ts-assistant -f Modelfile
ollama run my-ts-assistant

Model Selection Guide

Privacy & Performance Notes

All inference runs locally — no data leaves your machine
GPU acceleration: Ollama auto-detects Metal (macOS), CUDA, ROCm
RAM requirements: 4GB for 3B models, 8GB for 7B, 16GB+ for 13B+
Set OLLAMA_NUM_PARALLEL env var to serve multiple requests

Related Skills

Heldinhow/websocket-real-time

tools

VerifiedTrustedCommunity

Implement WebSocket communication for real-time bidirectional client-server communication.

2SKILL.mdUpdated Apr 24, 2026

Heldinhow/websocket-real-time

Heldinhow/webhook-handler

development

VerifiedTrustedCommunity

Implement webhook handlers for processing incoming events from external services.

2SKILL.mdUpdated Apr 24, 2026

Heldinhow/webhook-handler

Heldinhow/webapp-testing

development

VerifiedTrustedCommunity

Test web applications using Playwright for end-to-end browser testing.

2SKILL.mdUpdated Apr 24, 2026

Heldinhow/webapp-testing

Heldinhow/web-artifacts-builder

development

VerifiedTrustedCommunity

Build production-quality HTML artifacts using React, Tailwind CSS, and shadcn/ui.

2SKILL.mdUpdated Apr 24, 2026

Heldinhow/web-artifacts-builder

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/Heldinhow/awesome-opencode-dev-skills.git

# Copy into Claude Code skills folder (global)
cp -r awesome-opencode-dev-skills/ollama-local-ai ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

Heldinhow/awesome-opencode-dev-skills

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT