.claude/skills/savia-dual/SKILL.md
Inference sovereignty — transparent failover from Anthropic to local gemma4 when the cloud is slow, failing, rate-limited, or unreachable
npx skillsauth add gonzalezpazmonica/pm-workspace savia-dualInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Soberanía de inferencia dual. Cuando la nube va bien, calidad máxima. Cuando la nube falla, Savia sigue funcionando en local.
Esta skill se activa cuando el usuario necesita:
scripts/savia-dual-proxy.pyscripts/setup-savia-dual.shscripts/setup-savia-dual.ps1docs/rules/domain/savia-dual.md/savia-dual {install|start|stop|status|test}docs/savia-dual.md./scripts/setup-savia-dual.sh # Linux/macOS
pwsh .\scripts\setup-savia-dual.ps1 # Windows
El installer:
~/.savia/dual/config.json y ~/.savia/dual/env# Terminal 1: arrancar proxy
python3 scripts/savia-dual-proxy.py
# Terminal 2: cargar env y arrancar Claude Code
source ~/.savia/dual/env
claude
A partir de ese momento, Claude Code envía peticiones al proxy, que las
enruta según configuración. El usuario NO percibe la diferencia cuando
la nube responde bien. Cuando hay fallback, puede ver el motivo en
~/.savia/dual/events.jsonl.
ANTHROPIC_BASE_URL.gemma4 local NO es equivalente a Opus/Sonnet. Usar con expectativas:
| Tarea | Cloud | Local gemma4 | |---|---|---| | Lectura de memoria, /help, /sprint-status | ✅ | ✅ aceptable | | Conversación operativa | ✅ | 🟡 usable, más lento | | Specs SDD, code review | ✅ | ❌ calidad insuficiente | | Orquestación multi-agente | ✅ | ❌ pierde contexto |
Para uso ofimático de Savia (status, memoria, comandos simples) el modo fallback es perfectamente viable. Para trabajo profundo de ingeniería, reconectar a la nube antes de continuar.
testing
Create new skills, modify and improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, edit, or optimize an existing skill, run evals to test a skill, benchmark skill performance with variance analysis, or optimize a skill's description for better triggering accuracy.
tools
Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).
tools
Sistema proactivo de bienestar individual
development
Search the web to resolve context gaps — documentation, versions, CVEs, best practices. Auto-starts SearxNG Docker if available, falls back to WebSearch.