stevengonsalvez

116 verified skills1,130 total stars

reflect:cost

Report reflect drain spend over a time window — tokens split by cached (cache_read), uncached writes (cache_creation), and io (input+output), with a $ estimate, grouped by day / outcome / model / transcript. Reads the drainer's cost log and surfaces outlier runs and cache-reuse health (the 41.5M-token failure mode = low cache reuse + high cache writes). Use to answer "what is reflection costing me" for the last day / week.

documentation12

reflect:recall

Retrieve relevant prior learnings from the global knowledge base. Hybrid vector + graph search over 170+ indexed learnings, reranked by confidence, recency, and tag overlap. Use when starting work, debugging a recurring problem, or before implementing a feature that may have prior art.

development10

commit

Create well-formatted git commits for changes made during the session

development10

reflect

Full conversation scan for self-improvement. Detects behavioral corrections and knowledge signals, classifies them, proposes agent updates and knowledge notes with entity sidecars for GraphRAG indexing. Correct once, never again.

data-ai10

reflect

Unified learning capture from conversation analysis. Extracts behavioral corrections AND knowledge learnings (solved problems, patterns, decisions). Dual-indexes into QMD + GraphRAG for future retrieval. Philosophy: "Correct once, never again. Solve once, never re-research." Use when: (1) User corrects behavior, (2) Problem solved after debugging, (3) Session ending or context compaction, (4) User requests /reflect, (5) Pattern discovered worth preserving.

development10

ainb-fleet:broadcast

Fan out a single prompt to selected claude sessions across the fleet. Use when you need to apply the same instruction (e.g. `/clear`, `git pull`, `remote-control disconnect`) to many sessions at once. Routing: peers-first (broker HTTP) when peer registered, tmux send-keys fallback otherwise. Refuses to run without an explicit targeting flag (--all, --filter <regex>, or --cwd <substring>) — no implicit fan-out.

testing10

plan-gh

Create a detailed development plan and corresponding GitHub issues

development10

git-history-surgery

Safely split, reorder, or rewrite already-pushed commits using a throwaway worktree. Isolates the surgery from any existing checkout, verifies byte-identical tree before force-pushing, and uses --force-with-lease to refuse silent clobbers from concurrent work. Use when asked to split a pushed commit into atomic pieces, fix a bulked commit after the fact, or reshape recent history on a shared branch.

testing10

tui-style-guide

TUI style guide for consistent terminal interface design

documentation10

interview

Interview the user about a plan file to extract detailed requirements, clarify ambiguities, and uncover edge cases. Uses iterative questioning to produce a comprehensive specification.

testing10

instincts

Atomic behavioral instincts system. Captures micro-learnings as lightweight YAML entries with confidence scoring (0.3-0.9). Project-scoped instincts are stored locally; universal instincts feed into the reflect-kb GraphRAG knowledge base for cross-project retrieval. Use when: (1) A behavioral pattern should be remembered but is too small for a full learning note, (2) Building up project-specific conventions, (3) User wants quick lightweight corrections captured, (4) Accumulating micro-patterns during a session, (5) User requests /instincts.

development10

research

Conduct comprehensive research across multiple sources - codebase, web, and documentation - by spawning parallel sub-agents and synthesizing findings. Searches past learnings first, then codebase, docs, and optionally web.

development10

reflect-status

Show reflection metrics, pending reviews, sidecar coverage, and GraphRAG health. Read-only views into the reflect system state. Can also approve/reject pending low-confidence items.

data-ai10

critique

Get a Distinguished Engineer level technical critique of the current approach

tools10

ainb-fleet:sequence

Ordered multi-step prompts to fleet targets, ack-gated between steps via JSONL assistant-turn-end detection. Use for cycles like disconnect→reconnect→verify, or any flow where step N+1 requires step N to have completed first. The skill BLOCKS until each target's transcript shows the next assistant turn finishing OR per-step timeout fires (default 300s).

testing10

ainb-fleet:daemon

Long-running watcher that scans every claude session every 5s and auto-sends `continue` to any session whose recent tmux pane buffer matches a known API-error regex (rate_limited, overloaded_error, internal_server_error, request_timeout, socket_hang_up, fetch_failed, ECONNRESET). Use this when you want unattended recovery from transient API failures across the fleet.

development10

ainb-fleet:fleet-needs

Workflow-backed Jarvis control panel. Runs the deterministic `hangar` workflow with verb=needs (discover → enrich → prioritize), renders the Jarvis HUD from its render-ready cards, fires AskUserQuestion per blocked session, and routes each answer back via tmux send-keys (broker fallback only). Requires the workflow gate (CLAUDE_CODE_WORKFLOWS=1). If the gate is off, fall back to the prompt-driven `/ainb-fleet:needs` skill.

development10

ainb-fleet:needs

Center control panel — enumerate every claude session that is blocked waiting on something: a user answer (AskUserQuestion fired), an API error retry, an idle assistant turn-end with no follow-up, or an explicit WAITING: marker. Returns rich JSON with signal kind + context per session. Use this when you've stepped away from the fleet and want one place to see everything that wants your attention and answer it.

development10

shadcn-ui

Expert guidance for integrating and building applications with shadcn/ui components, including component discovery, installation, customization, and best practices.

development10

ainb-fleet:standup

Show fleet status — every claude session running on the host, merged across ainb + claude-peers broker + background jobs. Use when you need to enumerate sessions before composing an action, see which sessions have a peer registered (broker-routable) vs tmux-only, check the `summary` of each session, or pipe the list into jq for filtering. Default output: text table. Pass --format json for LLM consumption.

development10

handover

Generate a handover document for transferring work to another developer or spawning an async agent

development10

explain-to-me

Produce a self-contained, richly styled HTML explainer for any topic the user asks about. Picks the right template from a bundled set of 22 visual patterns (feature explainer, concept explainer, module map, PR review, ADR, options paper / trade-off analysis, system diagram, flow- chart, status report, slide deck, prototype, editor, etc.), fills it with real content, augments with inline diagrams via sister skills (/fireworks-tech-graph for architecture / flow / sequence diagrams, /graphify for knowledge graphs), applies a Claude-brand polish layer, and publishes to here.now at a topic-slug URL so the link is shareable immediately. Local-only output is available with --local. Use when Stevie says "/explain-to-me", "explain-to-me X", "make me an explainer for X", "give me an HTML explainer", "render this as a webpage", "ADR for X", "options paper for X", or asks for a rich visual writeup. The skill picks the template, names the choice up-front, and reaches for diagrams whenever the content shape needs them.

development10

start-local

Start local development environment with auto-detected services in a persistent tmux session

development10

validate

Verify implementation against specifications

testing10

reflect:recall

development10

reflect

data-ai10

standup

Read-only branch-scoped situation report for Stevie. Surfaces beads in-flight, what's been done since last invocation, what's pending, active swarms/coding-agents/subagents in this worktree, and — most importantly — what specifically needs Stevie's input (decisions, PR merges, AskUserQuestion threads, blocked beads). Output is progressive tables, one shape per stage, address the user as Stevie. Trigger on /standup, "give me a standup", "what's the state of this branch", "what do you need from me". Read-only — never claims beads, never merges, never posts anywhere.

testing10

reflect:errors-ack

Triage and acknowledge entries in the reflect errors sink (~/.reflect/errors.json). Invoked from the statusline ⚠N badge when pipeline errors accumulate (drain poison, parser crashes, ingest failures, hook timeouts).

devops10

sync-learnings

Sync user-level agent config changes back to toolkit repository (works for Claude, Codex, Copilot)

tools10

brainstorm

Turn an idea into an approved design spec. Brainstorm is the orchestrator - it scaffolds a topic-stub with subject-type detection and ASCII preview hypotheses, then DELEGATES the iterative Q&A to /interview. After /interview returns a spec, brainstorm self-reviews, gates on user approval, and hands off to /plan. No HTML, no browser - terminal + ASCII previews only.

development10

tmux-message

Reliable peer-to-peer message delivery to other Claude Code instances via tmux send-keys. Use as a fallback when claude-peers MCP send_message fails to surface in the receiver's inbox (delivered server-side but receiver never picks it up — observed behaviour). Also use when sending a directive to a known Claude Code TUI session by tmux session name or fuzzy hint, or when injecting a multi-line directive into a peer's prompt and submitting it. Trigger phrases — "claude-peers fallback", "tmux send-keys", "send to peer via tmux", "inject directive", "deliver to nanoclaw/hermes peer", "peer message". Tmux-only — won't reach peers running outside tmux.

tools10

swarm-create

Create a new swarm team from a Beads epic with N worker agents

data-ai10

reflect:ingest

The global knowledge indexer. Harvests ALL memory sources across all tools (Claude, Codex, Copilot, Gemini) and all project types into the unified GraphRAG + QMD knowledge base. Archives originals, generates entity sidecars, and dual-indexes for future retrieval. This is THE command that makes the knowledge base comprehensive.

tools10

agent-ops

Enterprise SRE patterns for AI agent operations. Provides cost caps, circuit breakers, stall detection, observability, and runbook-driven incident response for autonomous agent workflows. Use when: (1) Running long autonomous agent sessions, (2) Managing multi-agent swarms, (3) Monitoring agent costs and performance, (4) Debugging stuck or expensive agent loops, (5) Setting up agent observability.

development10

reflect

Self-improvement through conversation analysis. Extracts learnings from corrections and success patterns, permanently encoding them into agent definitions. Philosophy - Correct once, never again.

data-ai10

skill-creator

Create or update AgentSkills. Use when designing, structuring, or packaging skills with scripts, references, and assets.

data-ai10

ainb-fleet

Fleet orchestration overview — the `ainb fleet ...` Rust subcommand namespace for driving every claude session on the host. Routes to one of five sub-skills (standup / broadcast / sequence / needs / daemon). Invoke this for an at-a-glance map of what fleet can do; reach for the specific sub-skill for the verb you want.

development10

session-info

Display current session information and context

development10

start-android

Start Android development with Emulator, dev server, and optional Poltergeist auto-rebuild

development10

stitch-design

Unified entry point for Stitch design work. Handles prompt enhancement (UI/UX keywords, atmosphere), design system synthesis (.stitch/DESIGN.md), and high-fidelity screen generation/editing via Stitch MCP.

tools10

spawn-agent

Spawn Claude Agent in tmux Session

data-ai10

reflect:consolidate

Project-level memory consolidation. Merges orphaned worktree memory directories into a single .agents/MEMORY.md for the current project. Deduplicates, sections, and proposes cleanup of orphan dirs. Does NOT index into the global knowledge base — use reflect:ingest for that.

data-ai10

compound-docs

Internal module for knowledge capture. Called by /reflect for generating structured learning documents with YAML frontmatter and entity sidecars. Not user-invocable — use /reflect or /reflect --knowledge instead.

documentation10

reflect-status

Show reflection metrics, pending reviews, sidecar coverage, and GraphRAG health. Read-only views into the reflect system state. Can also approve/reject pending low-confidence items.

data-ai10

implement

Execute implementation plan step-by-step

tools10

find-missing-tests

Analyze codebase and identify missing test cases, then create GitHub issues for each gap

development10

gh-issue

Analyze and fix the specified GitHub issue

testing10

reflect:ingest

tools10

plan

Create a detailed implementation plan

data-ai10

plan-tdd

Create a test-driven development plan for the project

development10

scrapling-official

Scrape web pages using Scrapling with anti-bot bypass (like Cloudflare Turnstile), stealth headless browsing, spiders framework, adaptive scraping, and JavaScript rendering. Use when asked to scrape, crawl, or extract data from websites; web_fetch fails; the site has anti-bot protections; write Python code to scrape/crawl; or write spiders.

development10

security-audit

Three-agent adversarial security audit pipeline. Runs red team (attacker), blue team (defender), and auditor agents in sequence to find vulnerabilities, propose mitigations, and produce a final severity-ranked report. Use when: (1) Before deploying to production, (2) After adding auth/payment/data handling, (3) Periodic security review, (4) User requests /security-audit, (5) Code touches sensitive areas (credentials, encryption, user data).

development10

security-scan

Scan Claude Code configuration (.claude/ directory) for security vulnerabilities, misconfigurations, and injection risks using AgentShield. Checks CLAUDE.md, settings.json, MCP servers, hooks, and agent definitions. Use when: (1) Setting up a new project, (2) After modifying .claude/ configs, (3) Before committing config changes, (4) Periodic security hygiene, (5) User requests /security-scan.

tools10

toolkit/packages/skills/sentry-cli

--- name: sentry-cli description: Guide for using the Sentry CLI to interact with Sentry from the command line. Use when the user asks about viewing issues, events, projects, organizations, making API calls, or authenticating with Sentry via CLI. user-invocable: true--- # Sentry CLI Usage Guide Help users interact with Sentry from the command line using the `sentry` CLI. ## Prerequisites The CLI must be installed and authenticated before use. ### Installation ```bash curl https://cli.sentr

tools10

session-summary

Generate a comprehensive session summary

tools10

stitch-loop

Teaches agents to iteratively build websites using Stitch with an autonomous baton-passing loop pattern

development10

test-driven-development

Use when implementing any feature or bugfix, before writing implementation code. Enforces RED-GREEN-REFACTOR cycle with test-first approach.

development10

ui-ux-pro-max

UI/UX design intelligence. 67 styles, 96 palettes, 57 font pairings, 25 charts, 13 stacks (React, Next.js, Vue, Svelte, Astro, Nuxt, SwiftUI, React Native, Flutter, Tailwind, shadcn/ui, Jetpack Compose). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient.

development10

workflow

Guide through structured delivery workflow with plan, implement, validate phases

testing10

make-a-goal

Produce a fully-filled autonomous-run mega-prompt at .agents/goals/<slug>.md by interviewing the user for the missing context. The output file is the "set the goal · walk away · come back to shipped work" prompt — drop it into a fresh Claude Code / Codex session to run end-to-end without hand- holding. Use when Stevie says "/make-a-goal", "/make-a-goal <outcome>", "make a goal", "set a goal", "build me a goal prompt", or similar. The skill does NOT execute the goal — it only produces the artifact. Named `make-a-goal` (not `goal`) because `/goal` is a reserved system slash.

development10

expect-test

AI-powered browser testing using expect-cli. Auto-detects dev server, reads git diff, generates and executes browser tests via Playwright. Use when you need to verify UI changes in a real browser, test user flows, or validate fixes visually. Trigger on "test this in the browser", "verify the UI", "run expect", "browser test", or after completing UI fixes.

tools10

plugins

Manage and track installed plugins, skills, and extensions across all sources

tools10

coding-agent

Delegate coding tasks to subagents or run Claude Code/Codex in tmux sessions. Use the Task tool for focused multi-step coding work. Use tmux for long-running interactive sessions, parallel worktree-based fixes, and PR reviews. NOT for simple single-file edits — do those directly.

tools10

expose

Expose a local service on Tailscale with a unique path

data-ai10

cost-aware-pipeline

Cost-aware LLM pipeline patterns for optimal model routing, narrow retry strategies, and prompt caching. Reduces API costs 40-70% through intelligent model selection, targeted retries, and cache-friendly prompt structures. Use when: (1) Building multi-model pipelines, (2) Optimizing API costs, (3) Designing retry strategies for LLM calls, (4) Implementing prompt caching, (5) Choosing between haiku/sonnet/opus for sub-tasks.

development10

posthog-replay-analysis

Decode and analyse PostHog session replay recordings programmatically via the API. Use when investigating a user-reported incident (crash, infinite loop, unexpected navigation) where a PostHog session replay exists, and you want to extract the URL sequence, network request timeline, console logs, or DOM state WITHOUT opening the replay UI. Especially valuable when diagnosing iOS Safari/WebKit crashes ("A problem repeatedly occurred"), request storms, or any bug where the replay UI can't be scripted. Bypasses the broken `rrvideo` npm packages with a direct decode-and-chart approach.

development10

media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing, effects, composition). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

development10

tmux-monitor

Monitor and report status of all tmux sessions including dev environments, spawned agents, and running processes. Uses tmuxwatch for enhanced visibility.

development10

swarm-status

Display comprehensive status dashboard for a swarm team

tools10

global-learnings

Manage the global learnings knowledge base - an agent-agnostic system for storing, searching, and retrieving cross-project learnings. Uses GraphRAG for vector + graph-based retrieval and QMD for fast local semantic search. Works with any AI coding agent (Claude, Cursor, Copilot, Codex, etc.).

development10

research-cache

Manage and search the research cache for previously analyzed repositories

research10

webapp-testing

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

tools10

token-usage

Show Claude Code token usage across sessions — daily, weekly, per-project, and per-session breakdowns. Parses {{HOME_TOOL_DIR}}/projects/**/*.jsonl for consumption data. Use when the user asks about token usage, costs, how many tokens were used, session statistics, or wants a usage report.

tools10

swarm-shutdown

Gracefully shutdown a swarm team

tools10

swarm-join

Join an existing swarm team as a worker agent

data-ai10

tmux-status

Show status of all tmux sessions including dev environments, spawned agents, and running processes

development10

swarm-orchestration

A tmux-based persistent multi-agent swarm system with file-based inter-agent messaging

data-ai10

swarm-inbox

Read and send inter-agent messages within a swarm team

data-ai10

swarm-agent-troubleshooting

Diagnose and fix swarm agent spawn failures when agents don't start processing tasks

data-ai10

start-ios

Start iOS development with Simulator, dev server, and optional Poltergeist auto-rebuild

development10

session-metrics

View detailed metrics about the current session

data-ai10

langfuse-setup

Set up or disable Langfuse observability for Claude Code sessions. Manages hook configuration, credential verification, and connection testing.

development10

mobile-e2e-mcp

End-to-end mobile testing of Expo/React Native apps via claude-in-mobile MCP + mcporter. Android emulator preferred (iOS needs WebDriverAgent). Covers full setup: emulator boot, Metro start, Firebase auth, MCP tool usage, tap/screenshot patterns. Use when asked to "test functionality" on a mobile app, "walk through the app", or run E2E validation of user journeys on an Expo/React Native project.

tools10

claude-langfuse

Claude Code observability skill: analyze session traces stored in Langfuse, extract learnings from corrections, identify success patterns, and propose agent/skill improvements based on historical data. Powers self-improvement through trace analysis of Claude Code sessions.

development10

remotion

Generate walkthrough videos from Stitch projects using Remotion with smooth transitions, zooming, and text overlays

content-media9

debug-bridge

Browser automation and inspection for AI agents via WebSocket

tools9

liquid-glass

iOS 26 Liquid Glass design system for SwiftUI. Implements Apple's glassmorphism material effects, depth-based layering, and adaptive tinting. Follows Apple Human Interface Guidelines for glass materials. Use when: (1) Building iOS 26+ SwiftUI interfaces, (2) Implementing glassmorphism effects, (3) Creating translucent/frosted UI elements, (4) Designing with Apple's Liquid Glass aesthetic, (5) User mentions liquid glass, glassmorphism, or frosted glass UI.

development9

make-github-issues

Review codebase and create GitHub issues for identified problems

development9

oracle

Best practices for using the oracle CLI (prompt + file bundling, engines, sessions, and file attachment patterns).

tools9

cleanup-agent-worktree

Remove Agent Worktree

data-ai9

do-issues

Work on GitHub issues systematically with proper development workflow

development9

reflect:status

Show reflection metrics, pending reviews, sidecar coverage, and GraphRAG health. Read-only views into the reflect system state. Can also approve/reject pending low-confidence items.

data-ai9

react:components

Converts Stitch designs into modular Vite and React components using system-level networking and AST-based validation.

development9

reflect

Self-improvement through conversation analysis. Extracts learnings from corrections and success patterns, permanently encoding them into agent definitions. Philosophy - Correct once, never again.

data-ai9

reflect

development9

prime

Load context for a new agent session by analyzing codebase structure and README

development9

nano-banana-pro

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).

content-media9

merge-agent-work

Merge Agent Branch

data-ai9

frontend-design

Frontend design skill for UI/UX implementation - generates distinctive, production-grade interfaces

development9

design-md

Analyze Stitch projects and synthesize a semantic design system into DESIGN.md files

content-media9

retro-pdf

Convert markdown documents to professional, retro LaTeX-style PDFs with academic formatting, clickable TOC, and proper citations.

tools9

frontend-slides

Generate zero-dependency HTML presentations with keyboard navigation, print CSS, responsive design, and speaker notes. Creates a single self-contained HTML file — no build step, no framework, no CDN. Use when: (1) Creating slide decks or presentations, (2) Building pitch decks, (3) Making technical talks, (4) User asks for slides or a presentation, (5) Quick visual content for meetings.

development9

recover-sessions

Recover crashed or orphaned agent sessions

data-ai9

crypto-research

Comprehensive cryptocurrency market research and analysis using specialized AI agents. Analyzes market data, price trends, news sentiment, technical indicators, macro correlations, and investment opportunities. Use when researching cryptocurrencies, analyzing crypto markets, evaluating digital assets, or investigating blockchain projects like Bitcoin, Ethereum, Solana, etc.

testing9

ats-resume-matcher

Professional ATS (Applicant Tracking System) resume matching and scoring tool that operates with the precision of enterprise systems like Greenhouse, Lever, Workday, and Breezy HR. Use this skill when: (1) Matching a resume against a job description to calculate fit scores (2) Analyzing resume-JD alignment with detailed category breakdowns (3) Identifying gaps between candidate qualifications and job requirements (4) Getting actionable suggestions to improve resume match percentage (5) Preparing a resume for ATS optimization before job applications Supports PDF, DOCX, Markdown, and plain text inputs for both resumes and job descriptions.

tools9

attach-agent-worktree

Attach to Agent Session

data-ai9

autonomous-loops

Six proven autonomous agent loop patterns with guard rails. Provides reusable patterns for generate->validate->fix, explore->hypothesize->test, and other autonomous workflows. Includes the reviewer-never-authored principle for quality assurance. Use when: (1) Building autonomous agent workflows, (2) Designing self-correcting pipelines, (3) Implementing agent retry/fix loops, (4) Setting up multi-agent review processes, (5) User asks about agent loop patterns.

development9

browser-verify

Combined AI browser testing + visual inspection. Runs expect-cli for automated test generation/execution, then debug-bridge for screenshots and DOM inspection. Use when you need to verify UI changes with both automated tests AND visual evidence. Trigger on "verify in browser", "browser verify", "visual test", "test and screenshot", or after completing UI fixes that need proof.

tools9

reflect:consolidate

data-ai9

discuss

Discuss and explore ideas before committing to an approach

tools9

notebooklm

Use this skill to query your Google NotebookLM notebooks directly from Claude Code for source-grounded, citation-backed answers from Gemini. Browser automation, library management, persistent auth. Drastically reduced hallucinations through document-only responses.

tools9

enhance-prompt

Transforms vague UI ideas into polished, Stitch-optimized prompts. Enhances specificity, adds UI/UX keywords, injects design system context, and structures output for better generation results.

testing9

resume-formatter

Professional resume formatting and PDF generation tool. Use this skill when: (1) Creating a new resume tailored to a specific job description (2) Converting an existing resume to the standard HTML template (3) Generating a PDF from an HTML resume (4) Updating resume content while maintaining consistent formatting Produces professional, ATS-friendly resumes with consistent blue-themed styling.

tools9