toolchains/ai/ops/local-llm-ops/SKILL.md
Local LLM operations with Ollama on Apple Silicon, including setup, model pulls, chat launchers, benchmarks, and diagnostics.
npx skillsauth add bobmatnyc/claude-mpm-skills local-llm-opsInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Your localLLM repo provides a full local LLM toolchain on Apple Silicon: setup scripts, a rich CLI chat launcher, benchmarks, and diagnostics. The operational path is: install Ollama, ensure the service is running, initialize the venv, pull models, then launch chat or benchmarks.
./setup_chatbot.sh
./chatllm
If no models are present:
ollama pull mistral
brew install ollamabrew services start ollama./setup_chatbot.shcurl http://localhost:11434/api/version./chatllm (primary launcher)./chat or ./chat.py (alternate launchers)./install_aliases.sh then llm, llm-code, llm-fastTask modes:
./chat -t coding -m codellama:70b
./chat -t creative -m llama3.1:70b
./chat -t analytical
Benchmarks are scripted in scripts/run_benchmarks.sh:
./scripts/run_benchmarks.sh
This runs bench_ollama.py with:
benchmarks/prompts.yamlbenchmarks/models.yamlRun the built-in diagnostic script when setup fails:
./diagnose.sh
Common fixes:
./setup_chatbot.shollama is in PATHollama pull mistral.venv~/.localllm/http://localhost:11434toolchains/universal/infrastructure/dockerdevelopment
Optimize web performance using Core Web Vitals, modern patterns (View Transitions, Speculation Rules), and framework-specific techniques
development
Best practices for documenting APIs and code interfaces, eliminating redundant documentation guidance per agent.
development
Comprehensive API design patterns covering REST, GraphQL, gRPC, versioning, authentication, and modern API best practices
development
Visual verification workflow for UI changes to accelerate code review and catch ...