
Design engineering principles for making interfaces feel polished. Use when building UI components, reviewing frontend code, implementing animations, hover states, shadows, borders, typography, micro-interactions, enter/exit animations, or any visual detail work. Triggers on UI polish, design details, "make it feel better", "feels off", stagger animations, border radius, optical alignment, font smoothing, tabular numbers, image outlines, box shadows.
Agentic memory system for writers - track characters, relationships, scenes, and themes
Verify that a change really works before you claim completion
Improves typography by fixing font choices, hierarchy, sizing, weight, and readability so text feels intentional. Use when the user mentions fonts, type, readability, text hierarchy, sizing looks off, or wants more polished, intentional typography.
Reliable agent spawning in tmux with load-wait and verification
Monitor social media platforms (Reddit, Twitter/X, LinkedIn, Hacker News) for startup mentions, competitor activity, and market sentiment using Browser Use API for authenticated scraping.
Detect user feedback in Slack threads, classify intent, and propagate direction changes to specs, Issues, and agents.
Query, store, and loop on persistent research knowledge. Check prior experiments before building.
Auto-generate a README.md with product overview, tech stack, setup instructions, and architecture. Use when the user needs a README for their startup repository, wants to document the project for new developers, or needs to keep the README in sync with the actual project state.
Generic release assistant — analyzes repo release rules, caches them in .omc/RELEASE_RULE.md, then guides the release
Self-referential loop until task completion with configurable verification reviewer
Tones down visually aggressive or overstimulating designs, reducing intensity while preserving quality. Use when the user mentions too bold, too loud, overwhelming, aggressive, garish, or wants a calmer, more refined aesthetic.
Configure popular MCP servers for enhanced agent capabilities
Structured milestone progress reports posted to Slack with metrics from GitHub, CI, and cost telemetry.
Create distinctive, production-grade frontend interfaces with high design quality. Generates creative, polished code that avoids generic AI aesthetics. Use when the user asks to build web components, pages, artifacts, posters, or applications, or when any design skill requires project context. Call with 'craft' for shape-then-build, 'teach' for design context setup, or 'extract' to pull reusable components and tokens into the design system.
Automated incident lifecycle from detection through diagnosis, fix, deploy, verification, and post-mortem with budget-guarded escalation.
Integrate error tracking with Sentry for frontend and backend applications. Capture, classify, deduplicate, and alert on errors with deploy-version tagging.
Classify errors as FATAL, TRANSIENT, or UNKNOWN and transform raw stack traces into actionable agent instructions.
Evaluation framework using pass@k metrics to measure agent reliability with diff-based eval selection.
Prevent feature creep when building software, apps, and AI-powered products. Use this skill when planning features, reviewing scope, building MVPs, managing backlogs, or when a user says "just one more feature." Helps developers and AI agents stay focused, ship faster, and avoid bloated products.
2-stage pipeline: trace (causal investigation) -> deep-interview (requirements crystallization) with 3-point injection
Automated deploy pipeline for Vercel (frontend), Railway (backend), and Convex (database) with pre/post checks and rollback. Use when deploying services to production, configuring deploy ordering and health checks, setting up rollback procedures, or gating deploys on test and security audit results.
Create blog posts powered by unique data the SaaS generates. Use instead of generic AI content. Posts include proprietary benchmarks, user-aggregated insights, and original analysis that no competitor can replicate. Query SEO Chat API for topic validation.
Per-agent per-session cost tracking with configurable ceilings and model tier optimization.
Diagnose the current OMC session or repo state using logs, traces, state, and focused reproduction
Detect context window limits and perform clean resets with structured handoff documents for session continuity.
Deep security review patterns for authorization logic, data access boundaries, action isolation, rate limiting, and protecting sensitive operations
Writing queries, mutations, actions, and HTTP actions with proper argument validation, error handling, internal functions, and runtime considerations
Deep competitor research using browser agents to visit actual competitor sites. Captures screenshots, extracts real pricing, scores UX friction, evaluates API/agent experience, and produces a comparison matrix. Use when analyzing a competitive landscape before product planning.
Full autonomous execution from idea to working code
Claude-Codex-Gemini tri-model orchestration via /ask codex + /ask gemini, then Claude synthesizes results
Extract and codify brand guidelines from design assets into a reusable brand configuration. Use when the user needs to establish brand consistency, extract color palettes and typography from design tokens or CSS, document voice and tone guidelines, create a brand configuration file for other agents to reference, or audit existing content for brand compliance.
Clean AI-generated code slop with a regression-safe, deletion-first workflow and optional reviewer-only mode
Create new agent definitions with specific instruction sets, skill assignments, and behavioral rules. Use when the user wants a new type of agent (e.g., research-papers agent, customer-support agent, data-pipeline agent) or wants to modify an existing agent's behavior.
Adapt designs to work across different screen sizes, devices, contexts, or platforms. Implements breakpoints, fluid layouts, and touch targets. Use when the user mentions responsive design, mobile layouts, breakpoints, viewport adaptation, or cross-device compatibility.
Detect and eliminate signs of AI-generated writing. Use when producing any user-facing text — blog posts, landing pages, docs, emails, social media, README, marketing copy. Enforces human-quality prose by flagging Wikipedia's documented AI writing tells.
Run technical quality checks across accessibility, performance, theming, responsive design, and anti-patterns. Generates a scored report with P0-P3 severity ratings and actionable plan. Use when the user wants an accessibility check, performance audit, or technical quality review.
Amplify safe or boring designs to make them more visually interesting and stimulating. Increases impact while maintaining usability. Use when the user says the design looks bland, generic, too safe, lacks personality, or wants more visual impact and character.
Diagnose and fix oh-my-claudecode installation issues
Pushes interfaces past conventional limits with technically ambitious implementations — shaders, spring physics, scroll-driven reveals, 60fps animations. Use when the user wants to wow, impress, go all-out, or make something that feels extraordinary.
Plan the UX and UI for a feature before writing code. Runs a structured discovery interview, then produces a design brief that guides implementation. Use during the planning phase to establish design direction, constraints, and strategy before any code is written.
Use first for install/update routing — sends setup, doctor, or MCP requests to the correct OMC setup flow
Schema migration strategies for evolving applications including adding new fields, backfilling data, removing deprecated fields, index migrations, and zero-downtime migration patterns
Autonomous startup builder — idea to running company in 11 phases
Quick security audit checklist covering authentication, function exposure, argument validation, row-level access control, and environment variable handling
Extract a learned skill from the current conversation
Improve layout, spacing, and visual rhythm. Fixes monotonous grids, inconsistent spacing, and weak visual hierarchy. Use when the user mentions layout feeling off, spacing issues, visual hierarchy, crowded UI, alignment problems, or wanting better composition.
Auto-generate Vitest unit tests and Playwright e2e tests from product spec acceptance criteria using TDD. Use when writing tests before implementation, generating test stubs from acceptance criteria, enforcing TDD red-green workflow, evaluating test quality for brittleness or looseness, or gating features on code coverage thresholds.
Auto-generate a CONTRIBUTING.md covering dev setup, coding standards, and PR process. Use when the user needs a contributing guide for their repository, wants to document the development workflow for new contributors, or needs to keep contribution docs in sync with actual project configuration.
Review reusable project knowledge and decide what belongs in project memory, notepad, or durable docs
Turn a repeatable workflow from the current session into a reusable OMC skill draft
Process-first advisor routing for Claude, Codex, or Gemini via `omc ask`, with artifact capture and no raw CLI assembly
Generate hero images, illustrations, and visual assets for SaaS websites using AI image APIs. Every page MUST have a visual centerpiece — never just text. Use Fal.ai with Flux 2 Pro for best quality. Triggers when building landing pages, hero sections, or any page that needs custom imagery.
Scaffold SEO-optimized blog post systems derived from product specs and competitor research. Use when the user needs a blog section for their startup website, wants to generate blog content from product specs, needs SEO-optimized posts with proper metadata and heading structure, or wants a complete blog infrastructure with RSS, sitemap, and pagination.
OMC agent catalog, available tools, team pipeline routing, commit protocol, and skills registry. Auto-loads when delegating to agents, using OMC tools, orchestrating teams, making commits, or invoking skills.
Integrate PostHog product analytics into a Next.js application with page views, event tracking, conversion funnels, A/B testing via feature flags, and API access for programmatic dashboard queries. Use when setting up product analytics, tracking user actions, defining conversion funnels, or enabling A/B experiments.
Review a feature and enhance it with purposeful animations, micro-interactions, and motion effects that improve usability and delight. Use when the user mentions adding animation, transitions, micro-interactions, motion design, hover effects, or making the UI feel more alive.
Run automated WCAG 2.1 AA accessibility audits using axe-core via Playwright. Use when implementing or reviewing UI features to enforce accessibility compliance, catch color contrast failures, validate keyboard navigation, check screen reader landmarks, verify alt text and ARIA attributes, or gate feature completion on accessibility standards.
Configure notification integrations (Telegram, Discord, Slack) via natural language
Umbrella skill for all Convex development patterns. Routes to specific skills like convex-functions, convex-realtime, convex-agents, etc.
Cancel any active OMC mode (autopilot, ralph, ultrawork, ultraqa, swarm, ultrapilot, pipeline, team)
Install or refresh oh-my-claudecode for plugin, npm, and local-dev setups from the canonical setup flow
CLI-team runtime for claude, codex, or gemini workers in tmux panes when you need process-based parallel execution
Set up and maintain a GitHub Actions CI/CD pipeline with parallel lint/typecheck/test jobs, staging deploys on merge to main, production deploys on release tags, health checks, and automatic rollback on failure. Use when configuring CI/CD, adding deployment workflows, setting up branch protection, or implementing automatic rollback.
Improve unclear UX copy, error messages, microcopy, labels, and instructions to make interfaces easier to understand. Use when the user mentions confusing text, unclear labels, bad error messages, hard-to-follow instructions, or wanting better UX writing.
Add strategic color to features that are too monochromatic or lack visual interest, making interfaces more engaging and expressive. Use when the user mentions the design looking gray, dull, lacking warmth, needing more color, or wanting a more vibrant or expressive palette.
Building AI agents with the Convex Agent component including thread management, tool integration, streaming responses, RAG patterns, and workflow orchestration
Guidelines for building production-ready Convex apps covering function organization, query patterns, validation, TypeScript usage, error handling, and the Zen of Convex design philosophy
How to create, structure, and publish self-contained Convex components with proper isolation, exports, and dependency management
Patterns for building reactive apps including subscription management, optimistic updates, cache behavior, and paginated queries with cursor-based loading
Defining and validating database schemas with proper typing, index configuration, optional fields, unions, and migration strategies for schema changes
Evaluate design from a UX perspective, assessing visual hierarchy, information architecture, emotional resonance, cognitive load, and overall quality with quantitative scoring, persona-based testing, automated anti-pattern detection, and actionable feedback. Use when the user asks to review, critique, evaluate, or give feedback on a design or component.
Run periodic full-codebase security and quality scans via Cubic with automated GitHub Issue creation. Use when setting up scheduled codebase-wide scanning, detecting architectural drift or accumulated tech debt, catching cross-cutting security vulnerabilities, or configuring automated issue triage from scan findings.
Scheduled function patterns for background tasks including interval scheduling, cron expressions, job monitoring, retry strategies, and best practices for long-running tasks
Complete file handling including upload flows, serving files via URL, storing generated files from actions, deletion, and accessing file metadata from system tables
External API integration and webhook handling including HTTP endpoint routing, request/response handling, authentication, CORS configuration, and webhook signature validation
Automate dependency management with scheduled security audits, auto-created PRs for safe updates, human review for breaking changes, freshness scoring, and license compliance checks. Use when running npm audit, updating dependencies, checking for vulnerabilities, or tracking dependency freshness.
Strip designs to their essence by removing unnecessary complexity. Great design is simple, powerful, and clean. Use when the user asks to simplify, declutter, reduce noise, remove elements, or make a UI cleaner and more focused.
Socratic deep interview with mathematical ambiguity gating before autonomous execution
Add moments of joy, personality, and unexpected touches that make interfaces memorable and enjoyable to use. Elevates functional to delightful. Use when the user asks to add polish, personality, animations, micro-interactions, delight, or make an interface feel fun or memorable.
Generate optimized loop prompts for agents that run continuously. Use when spawning persistent agents (harness-researcher, slop-cleaner, growth monitor, ops monitor) or any agent that should NEVER STOP until manually interrupted. Encodes patterns from autoresearch, ui-loop, ralph, and Karpathy.
Auto-generate user-facing documentation including API reference, user guides, and changelog. Use when the user needs a documentation site, wants API docs generated from code comments and type definitions, needs user guides derived from a product spec, or wants a changelog generated from git history.
Invoke parallel document-specialist agents for external web searches and documentation lookup
Compare a harness-built product against a reference product in the same category. Scores feature count, code depth, test coverage, page count, and UX complexity. Use after building a product to identify what's missing vs production-grade open source.
Track all task state via GitHub Issues and Project boards with automated column transitions and audit trail comments.
Configure HUD display options (layout, presets, display elements)
Turn a one-line request into a schema-compliant GitHub issue draft or creation command. Use when the user gives a short bug report, feature request, refactor idea, or task summary and wants a normalized issue with title, type, severity, description, acceptance criteria, and verification steps.
Run hypothesis-driven A/B tests on the landing page, measure conversion improvements via PostHog feature flags, and report outcomes with statistical significance. Use when optimizing landing page conversion, testing CTA changes, running experiments on page elements, or analyzing funnel drop-offs.
Generate Terms of Service, Privacy Policy, and Cookie Policy from startup type and jurisdiction. Use before any public launch. Produces legally-aware templates that cover standard requirements (GDPR, CCPA, SOC2 basics) without being actual legal advice.
Aggregate and search logs from Vercel and Railway with structured format, ring buffer storage, and agent-queryable search interface.
Third-party review of finished Claude sessions. Reads completion signals written by the plugin's Stop hook, analyzes each session's transcript for patterns (skill-heavy/code-light loops, duplicate skill invocations, context compaction thrash), writes verdicts to .harness/metrics/, and appends learnings to .harness/learnings/knowledge.md. Use this when you want to audit what happened in recent sessions and feed learnings back into chain config.
Orchestrate parallel scientist agents for comprehensive analysis with AUTO mode
Multi-layered security scanning for dependency vulnerabilities, secret detection, and OWASP top 10 compliance. Use when auditing dependencies for CVEs, scanning code for hardcoded secrets, setting up pre-commit hooks for secret detection, reviewing API routes for injection or auth flaws, or gating PRs and deploys on security findings.
Diagnoses and fixes UI performance across loading speed, rendering, animations, images, and bundle size. Use when the user mentions slow, laggy, janky, performance, bundle size, load time, or wants a faster, smoother experience.
Monitor Core Web Vitals (LCP, INP, CLS) using Lighthouse CI with budget thresholds and deploy gating. Use when setting up performance monitoring, defining performance budgets, tracking bundle sizes, gating deploys on performance regressions, or auditing route-level performance over time.
Strategic planning with optional interview workflow
Performs a final quality pass fixing alignment, spacing, consistency, and micro-detail issues before shipping. Use when the user mentions polish, finishing touches, pre-launch review, something looks off, or wants to go from good to great.
Continuous post-deploy monitoring and growth loop. Use after deploying a startup to keep it running, growing, and improving 24/7. Combines monitoring, growth, content, and maintenance into one persistent loop.
Generate programmatic SEO pages at scale from structured data. Use when a SaaS has a natural "matrix" of variations (format conversions, city pages, tool comparisons). Creates unique, data-enriched pages that survive Google's "zero information gain" filter. Query SEO Chat API for validation.
Worktree-first dev environment manager for issues, PRs, and features with optional tmux sessions
Consensus planning entrypoint that auto-gates vague ralph/autopilot/team requests before execution
Query SEO Chat API for sourced SEO answers, URL audits, and content strategy. Use when optimizing pages for search, planning content, auditing URLs, or researching SEO best practices. Backed by 500+ curated SEO sources with inline citations.
Autonomous evolutionary code improvement engine with tournament selection
Generate comprehensive SEO assets from the product spec including sitemap.xml, robots.txt, meta tags, Open Graph tags, JSON-LD structured data, and canonical URLs. Use when setting up SEO for a new site, adding meta tags or structured data, generating sitemaps, integrating Lighthouse SEO audits into CI, or ensuring social sharing previews work correctly.
Manage local skills - list, add, remove, search, edit, setup wizard
Clean AI-generated code slop with a regression-safe, deletion-first workflow
Generate and schedule platform-adapted social media posts with brand-consistent voice. Use when the user needs social media content for Twitter/X or LinkedIn, wants to promote product announcements or blog posts on social platforms, needs a consistent posting cadence, or wants to track and analyze social engagement metrics.
Negotiate structured success criteria contracts between generator and evaluator agents before each implementation sprint. Use when defining acceptance criteria for a feature sprint, establishing machine-verifiable success conditions, setting up generator-evaluator negotiation loops, or tracking contract iteration history for audit.
Add a new tool to the project stack at runtime. Reads the tool catalog for known configs, installs the package, updates stacks.yml and .env, generates boilerplate, creates a tracking Issue, and posts an investor update. Use when an agent discovers it needs a new integration (analytics, payments, auth, email, monitoring, etc.) or when the user requests adding a tool.
N coordinated agents on shared task list using Claude Code native teams
Three-tier memory system (hot, warm, cold) to prevent context pollution with stale state and preserve knowledge across resets.
Evidence-driven tracing lane that orchestrates competing tracer hypotheses in Claude built-in team mode
ATIF trajectory serialization with ring buffer for post-hoc debugging, eval dataset construction, and regression detection.
QA cycling workflow - test, verify, fix, repeat until goal met
Parallel execution engine for high-throughput task completion
Continuous health check polling with failure detection, consecutive-failure thresholds, Slack alerts, incident triggering, and rolling uptime tracking.
Build an in-app feedback widget that collects user feedback, categorizes it by type (bug, feature request, general), routes it to appropriate agents, and converts feature requests into GitHub Issues. Use when adding feedback collection, building feedback widgets, setting up user feedback routing, or aggregating feedback trends.
Screenshot every page with Playwright, feed to visual QA agent for design evaluation, report results to Slack. Use after every build phase to verify UI matches the chosen design preset. Integrates with the Playwright skill and the website-creation preset system.
Structured visual QA verdict for screenshot-to-reference comparisons
Build production-quality SaaS websites with opinionated design presets. Use when creating any startup website. The user MUST pick a design style before building. Enforces shadcn/ui, Figma design principles, specific CSS values per style, and anti-AI-writing. Load alongside anti-ai-writing skill. LIGHT MODE ONLY.
Deep codebase initialization with hierarchical AGENTS.md documentation
LLM Wiki — persistent markdown knowledge base that compounds across sessions (Karpathy model)