.agents/skills/cercano-local/SKILL.md
Run prompts against local AI models via Cercano and Ollama. Use this for local inference — faster, private, and zero cost. Handles chat-style queries and agentic code generation with automatic validation. Offload summarization, explanation, code writing, and general LLM tasks to a local model instead of sending them to the cloud.
npx skillsauth add GoogleDevRelExplorations/cercano cercano-localInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Run prompts against local AI models through Cercano's MCP interface. Cercano routes requests to Ollama for local inference.
Tool name: cercano_local
| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| prompt | string | Yes | The prompt to run against local models. |
| file_path | string | No | Target file path for code changes. When provided with work_dir, enables the agentic code generation loop with validation. |
| work_dir | string | No | Working directory for code validation (go build/test). When provided with file_path, enables the agentic code generation loop. |
| context | string | No | Additional context such as existing code or file contents. |
| conversation_id | string | No | Conversation ID for multi-turn support across calls. |
Provide only prompt (and optionally context) for a direct LLM call. The response is the model's text output.
Provide prompt, file_path, and work_dir to enable a generate-validate loop. Cercano will:
Chat query:
{
"prompt": "What are the SOLID principles in software design?"
}
Code generation with context:
{
"prompt": "Add error handling to this function",
"file_path": "internal/handler/auth.go",
"work_dir": "/project",
"context": "func Login(w http.ResponseWriter, r *http.Request) { ... }"
}
Multi-turn conversation:
{
"prompt": "Now refactor that to use the repository pattern",
"conversation_id": "conv-abc123"
}
devops
Summarize text or files using local AI via Cercano without sending content to the cloud. Supports brief, medium, and detailed summary lengths.
devops
Submit cloud token usage data to Cercano (opt-in). This sends data, not a report — use cercano_stats to view usage. Helps track cloud tokens alongside local inference for accurate local-vs-cloud comparison.
tools
View Cercano usage statistics and cloud token savings. Shows total requests, tokens processed locally, cloud tokens reported by the host, percentage kept local, and breakdowns by tool, model, and day.
development
Research a question using DuckDuckGo search and local AI analysis. Crafts search queries, fetches top results, and synthesizes a sourced answer — all locally.