
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Use when the user asks to diagnose a bug, find the root cause of a failing behavior, or run a controlled hypothesis-by-hypothesis investigation with experiments and evidence. Produces a diagnostic report with a recommended fix (but does not apply the fix). Do NOT trigger on "fix this bug" when the cause is obvious, or on simple error-message lookups.
Use when the user asks to review PAST sessions (not the current one), find recurring patterns across multiple sessions, or improve project primitives (rules, skills, hooks, docs) based on historical IDE transcripts. Do NOT trigger on current-session self-review — use reflect instead.
Adapt project-local flowai primitives (skills, agents, AGENTS.md artifacts, hooks) to project specifics. Standalone adaptation — independent of update.
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Use when the user asks to review PAST sessions (not the current one), find recurring patterns across multiple sessions, or improve project primitives (rules, skills, hooks, docs) based on historical IDE transcripts. Do NOT trigger on current-session self-review — use reflect instead.
Adapt project-local flowai primitives (skills, agents, AGENTS.md artifacts, hooks) to project specifics. Standalone adaptation — independent of update.
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Use ONLY when the user explicitly asks to plan a task, create a task file in documents/tasks/, produce a GODS-format breakdown, or prepare a critiqued implementation strategy before coding. Do NOT trigger on brief design discussion, ad-hoc code suggestions, or casual "how would you approach X?" questions.
Use ONLY when the user explicitly asks to plan a task, create a task file in documents/tasks/, produce a GODS-format breakdown, or prepare a critiqued implementation strategy before coding. Do NOT trigger on brief design discussion, ad-hoc code suggestions, or casual "how would you approach X?" questions.
Use when the user asks to diagnose a bug, find the root cause of a failing behavior, or run a controlled hypothesis-by-hypothesis investigation with experiments and evidence. Produces a diagnostic report with a recommended fix (but does not apply the fix). Do NOT trigger on "fix this bug" when the cause is obvious, or on simple error-message lookups.
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Guide for creating effective flowai commands. This skill should be used when users want to create a new command (or update an existing command) that extends flowai's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Two-phase "Lead Engineer" audit: full scan across 8 categories, then interactive issue-by-issue resolution with the user.
Extract Claude Code system prompt from its compiled binary or JS bundle into a structured template file. Produces a document mirroring the real API request with exact prompt text, ant-only variants, feature-gated sections, and minified-variable mappings. Use when extracting, documenting, or comparing Claude Code prompt versions.
Adds TypeScript/Deno code style rules to project AGENTS.md. Use when setting up new Deno projects or when user needs to establish code style guidelines.
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
How to write tasks using GODS framework
Guidelines for writing comprehensive Product Requirements Documents (PRD)
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Use when the user provides a source (URL, file path, or free text) to save into the project's memex — a long-term knowledge bank for AI agents. Stores the raw source, extracts entities into cross-linked pages, runs a backlink audit, and updates the index and activity log. Do NOT trigger on casual reads; only when the intent is to persist a source into the memex.
How to write tasks using GODS framework
Creates interactive HTML teaching materials with clickable state diagrams. Use when the user asks to produce an explorable tutorial artifact, not to answer a question.
Create and manage GitHub issues using the GODS framework. Use when creating, updating, or triaging GitHub issues.
Guidelines for writing comprehensive Product Requirements Documents (PRD)
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
Scaffold flowai-workflow in a project. Runs `flowai-workflow init` to copy a bundled workflow (github-inbox, autonomous-sdlc, etc.) into .flowai-workflow/<name>/, then adapts workflow.yaml and agents/agent-*.md to project specifics. Use when asked to add flowai-workflow, scaffold a new DAG workflow, or create an agent pipeline.
How to write in informational style
Scaffold flowai-workflow in a project. Runs `flowai-workflow init` to copy a bundled workflow (github-inbox, autonomous-sdlc, etc.) into .flowai-workflow/<name>/, then adapts workflow.yaml and agents/agent-*.md to project specifics. Use when asked to add flowai-workflow, scaffold a new DAG workflow, or create an agent pipeline.
Supervise a flowai-workflow run. Launches `flowai-workflow run <workflow>`, polls runs/<run-id>/state.json every 30 s, and on node failure or stall diagnoses the root cause from artifacts, applies a fix to workflow.yaml or an agent prompt, then resumes via --resume. Use when asked to babysit a flowai-workflow run or iterate fix-and-resume autonomously.
Supervise a flowai-workflow run. Launches `flowai-workflow run <workflow>`, polls runs/<run-id>/state.json every 30 s, and on node failure or stall diagnoses the root cause from artifacts, applies a fix to workflow.yaml or an agent prompt, then resumes via --resume. Use when asked to babysit a flowai-workflow run or iterate fix-and-resume autonomously.
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
How to fix tests
Use when the user asks for a JIT review, wants to catch hidden regressions in a staged/unstaged diff or commit range, or says "check my changes for hidden bugs". Synthesizes ephemeral Catching JiTTests (pass on parent, fail on diff). Not for fixing known failing tests (see flowai-fix-tests) or general code review (flowai-review).
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Creates interactive HTML teaching materials with clickable state diagrams. Use when the user asks to produce an explorable tutorial artifact, not to answer a question.
How to conduct a Q&A session with the user. Canonical reference for FR-UNIVERSAL.QA-FORMAT.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
How to fix tests
Use when the user asks for a JIT review, wants to catch hidden regressions in a staged/unstaged diff or commit range, or says "check my changes for hidden bugs". Synthesizes ephemeral Catching JiTTests (pass on parent, fail on diff). Not for fixing known failing tests (see flowai-fix-tests) or general code review (flowai-review).
Create and manage GitHub issues using the GODS framework. Use when creating, updating, or triaging GitHub issues.
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
How to write in informational style
Use when the user asks to audit a memex (long-term knowledge bank for AI agents) for orphans, dead wikilinks, missing sections, contradictions, or index drift. Runs a deterministic structural check, layers LLM-judgement findings, optionally auto-fixes trivial issues with `--fix`. Do NOT trigger on general code linting.
Use when the user provides a source (URL, file path, or free text) to save into the project's memex — a long-term knowledge bank for AI agents. Stores the raw source, extracts entities into cross-linked pages, runs a backlink audit, and updates the index and activity log. Do NOT trigger on casual reads; only when the intent is to persist a source into the memex.
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
Use when the user asks a question to be answered from the project's memex (long-term knowledge bank for AI agents) rather than training data or the live web. Reads the index, opens relevant pages, follows one wikilink hop, synthesises a cited answer, files it back. Do NOT trigger on source-code or live-session questions.
Use when a flowai benchmark fails and you need the cause from run artifacts before editing. Reads judge-evidence.md, the sandbox SKILL.md, and scenario mod.ts; classifies the failure against a known taxonomy; produces an evidence-grounded report (no fixes). Do NOT trigger for passing benchmarks or generic skill iteration.
Use when a flowai benchmark fails and you need the cause from run artifacts before editing. Reads judge-evidence.md, the sandbox SKILL.md, and scenario mod.ts; classifies the failure against a known taxonomy; produces an evidence-grounded report (no fixes). Do NOT trigger for passing benchmarks or generic skill iteration.
Use when the user asks to audit a memex (long-term knowledge bank for AI agents) for orphans, dead wikilinks, missing sections, contradictions, or index drift. Runs a deterministic structural check, layers LLM-judgement findings, optionally auto-fixes trivial issues with `--fix`. Do NOT trigger on general code linting.
Use when the user asks a question to be answered from the project's memex (long-term knowledge bank for AI agents) rather than training data or the live web. Reads the index, opens relevant pages, follows one wikilink hop, synthesises a cited answer, files it back. Do NOT trigger on source-code or live-session questions.
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Run all framework benchmarks. Default is sequential (single `deno task bench`). Use `--parallel` for 5-subagent parallel mode. Triggers on "run all benchmarks", "bench all", "/bench-all".
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Streamlined commit workflow — fewer tool calls, targeted doc sync
Manage Django database migrations (makemigrations, migrate, squash).
Analyze previous IDE session history to find recurring patterns in agent behavior, identify systemic issues, and propose improvements to project primitives (rules, skills, hooks). Use when the user asks to review past sessions, find recurring problems, or improve development workflow based on historical data.
Iterative issue investigation with user-controlled hypothesis selection
Manage Django database migrations (makemigrations, migrate, squash).
Scaffold a complete AI agent application skeleton with LLM integration, tool calling, observability, cost tracking, session management, and content fetching using @korchasa/ai-skel-ts. Use when the user asks to create an AI agent, add LLM integration, scaffold an AI application, or mentions "ai-skel", "agent skeleton", or "AI scaffold". Works with any programming language.
Scaffold a complete AI agent application skeleton with LLM integration, tool calling, observability, cost tracking, session management, and content fetching using @korchasa/ai-skel-ts. Use when the user asks to create an AI agent, add LLM integration, scaffold an AI application, or mentions "ai-skel", "agent skeleton", or "AI scaffold". Works with any programming language.
Run all framework benchmarks. Default is sequential (single `deno task bench`). Use `--parallel` for 5-subagent parallel mode. Triggers on "run all benchmarks", "bench all", "/bench-all".
How to conduct a Q&A session with the user
How to conduct a Q&A session with the user
Experimental committed-tasks variant of flowai-plan. User-invoked. Writes tasks at documents/tasks/<YYYY>/<MM>/<slug>.md with extended frontmatter (date, status, implements, tags, related_tasks).
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Initialize project with AGENTS.md and rules, handling both Greenfield (new) and Brownfield (existing) projects.
Use when AGENTS.md is out of sync with the installed template (after `flowai sync` reports it changed, or user asks to re-adapt/realign AGENTS.md). Reads {ide}/assets/AGENTS.template.md, proposes a merge preserving project sections, writes on approval. Do NOT trigger on small edits.
Automated commit workflow with atomic grouping
Use when the user asks for a project health audit, maintenance sweep, or multi-category lead-engineer scan followed by interactive issue-by-issue resolution with user approval. Do NOT trigger on routine lint/test runs, single-file cleanups, or standard "check project" requests.
Use when the user asks to review CURRENT uncommitted changes (staged, unstaged, or branch diff) as QA + lead engineer before committing — verdict on task completion, code quality, architecture, cleanup. Requires an existing diff. Do NOT trigger on generic "is this code good?" questions without a diff, or on post-merge code review of historical commits.
Experimental committed-tasks variant of flowai-plan. User-invoked. Writes tasks at documents/tasks/<YYYY>/<MM>/<slug>.md with extended frontmatter (date, status, implements, tags, related_tasks).
Use when the user asks to reflect on the CURRENT session, analyze the agent's own process, audit context usage in this conversation, or extract lessons from how the current task was executed. Do NOT trigger for historical multi-session analysis — use flowai-reflect-by-history for that. Do NOT trigger on plain "what did you do?" style recaps.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.
Guide for creating effective flowai commands. This skill should be used when users want to create a new command (or update an existing command) that extends flowai's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Streamlined two-phase workflow: review, then commit. Verdict gate between phases. Self-contained — execute the inlined steps directly, do NOT invoke other skills via the Skill tool.
Adapt project-local flowai primitives (skills, agents, AGENTS.md artifacts, hooks) to project specifics. Standalone adaptation — independent of flowai-update.
Use ONLY when the user explicitly asks to plan a task, create a task file in documents/tasks/, produce a GODS-format breakdown, or prepare a critiqued implementation strategy before coding. Do NOT trigger on brief design discussion, ad-hoc code suggestions, or casual "how would you approach X?" questions.
Use when the user asks to review PAST sessions (not the current one), find recurring patterns across multiple sessions, or improve project primitives (rules, skills, hooks, docs) based on historical IDE transcripts. Do NOT trigger on current-session self-review — use flowai-reflect instead.
Use when AGENTS.md is out of sync with the installed template (after `flowai sync` reports it changed, or user asks to re-adapt/realign AGENTS.md). Reads {ide}/assets/AGENTS.template.md, proposes a merge preserving project sections, writes on approval. Do NOT trigger on small edits.
Use when the user asks to review CURRENT uncommitted changes (staged, unstaged, or branch diff) as QA + lead engineer before committing — verdict on task completion, code quality, architecture, cleanup. Requires an existing diff. Do NOT trigger on generic "is this code good?" questions without a diff, or on post-merge code review of historical commits.
Use ONLY when the user explicitly asks to plan a task, create a task file in documents/tasks/, produce a GODS-format breakdown, or prepare a critiqued implementation strategy before coding. Do NOT trigger on brief design discussion, ad-hoc code suggestions, or casual "how would you approach X?" questions.
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Streamlined commit workflow — fewer tool calls, targeted doc sync
Self-contained — execute the inlined steps directly. Three-phase full-cycle workflow: write a committed plan, implement under TDD, then review and commit with a verdict gate. Do NOT invoke other skills via the Skill tool.
Streamlined two-phase workflow: review, then commit. Verdict gate between phases. Self-contained — execute the inlined steps directly, do NOT invoke other skills via the Skill tool.
Scaffold a complete AI agent application skeleton with LLM integration, tool calling, observability, cost tracking, session management, and content fetching using @korchasa/ai-skel-ts. Use when the user asks to create an AI agent, add LLM integration, scaffold an AI application, or mentions "ai-skel", "agent skeleton", or "AI scaffold". Works with any programming language.
Scaffold a complete AI agent application skeleton with LLM integration, tool calling, observability, cost tracking, session management, and content fetching using @korchasa/ai-skel-ts. Use when the user asks to create an AI agent, add LLM integration, scaffold an AI application, or mentions "ai-skel", "agent skeleton", or "AI scaffold". Works with any programming language.
Reconcile project-owned AGENTS.md, CLAUDE.md, and scaffolded artifacts with the current flowai framework templates.
Self-contained — execute the inlined steps directly. Three-phase full-cycle workflow: write a committed plan, implement under TDD, then review and commit with a verdict gate. Do NOT invoke other skills via the Skill tool.
Use when the user asks to plan a LARGE feature that spans multiple sessions or phases — produces documents/tasks/<YYYY>/<MM>/epic-<name>.md with dependency-ordered phases, atomic tasks, and per-phase status tracking. Use flowai-plan for single-session tasks. Do NOT trigger on generic planning, roadmap, or brainstorming requests.
Use when AGENTS.md is out of sync with the installed template (after `flowai sync` reports it changed, or user asks to re-adapt/realign AGENTS.md). Reads {ide}/assets/AGENTS.template.md, proposes a merge preserving project sections, writes on approval. Do NOT trigger on small edits.
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode, flowai), skill mounting, and security hardening.
Use when the user asks to plan a LARGE feature that spans multiple sessions or phases — produces documents/tasks/<YYYY>/<MM>/epic-<name>.md with dependency-ordered phases, atomic tasks, and per-phase status tracking. Use flowai-plan for single-session tasks. Do NOT trigger on generic planning, roadmap, or brainstorming requests.
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode, flowai), skill mounting, and security hardening.
Use when the user asks to review PAST sessions (not the current one), find recurring patterns across multiple sessions, or improve project primitives (rules, skills, hooks, docs) based on historical IDE transcripts. Do NOT trigger on current-session self-review — use flowai-reflect instead.
Use when the user asks to reflect on the CURRENT session, analyze the agent's own process, audit context usage in this conversation, or extract lessons from how the current task was executed. Do NOT trigger for historical multi-session analysis — use flowai-reflect-by-history for that. Do NOT trigger on plain "what did you do?" style recaps.
Use when the user asks to diagnose a bug, find the root cause of a failing behavior, or run a controlled hypothesis-by-hypothesis investigation with experiments and evidence. Produces a diagnostic report with a recommended fix (but does not apply the fix). Do NOT trigger on "fix this bug" when the cause is obvious, or on simple error-message lookups.
Use when the user asks to diagnose a bug, find the root cause of a failing behavior, or run a controlled hypothesis-by-hypothesis investigation with experiments and evidence. Produces a diagnostic report with a recommended fix (but does not apply the fix). Do NOT trigger on "fix this bug" when the cause is obvious, or on simple error-message lookups.
Use when the user asks for a project health audit, maintenance sweep, or multi-category lead-engineer scan followed by interactive issue-by-issue resolution with user approval. Do NOT trigger on routine lint/test runs, single-file cleanups, or standard "check project" requests.
Guide for creating effective flowai commands. This skill should be used when users want to create a new command (or update an existing command) that extends flowai's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Use when the user asks for a project health audit, maintenance sweep, or multi-category lead-engineer scan followed by interactive issue-by-issue resolution with user approval. Do NOT trigger on routine lint/test runs, single-file cleanups, or standard "check project" requests.
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Two-phase "Lead Engineer" audit: full scan across 8 categories, then interactive issue-by-issue resolution with the user.
Record an architectural decision (chosen path + rejected alternatives + rationale) as a persistent MADR-style ADR in documents/adr/. Triggers — "record this decision", "write an ADR", "capture our rationale for picking X", "запиши ADR". Do NOT use for regular task planning (use flowai-skill-plan), already-decided trivial choices, or simple bug fixes.
Reconcile project-owned AGENTS.md, CLAUDE.md, and scaffolded artifacts with the current flowai framework templates.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
Create critiqued plan in documents/tasks/ using GODS framework with proactive uncertainty resolution
Extract Claude Code system prompt from its compiled binary or JS bundle into a structured template file. Produces a document mirroring the real API request with exact prompt text, ant-only variants, feature-gated sections, and minified-variable mappings. Use when extracting, documenting, or comparing Claude Code prompt versions.
Guide for creating effective flowai commands. This skill should be used when users want to create a new command (or update an existing command) that extends flowai's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Analyze agent's process, logic, technical decisions, context usage, and undocumented discoveries to find behavioral errors, poor engineering choices, inefficiencies, and missing knowledge in project instructions.
Reconcile project-owned AGENTS.md, CLAUDE.md, and scaffolded artifacts with the current flowai framework templates.
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
How to conduct a Q&A session with the user. Canonical reference for FR-UNIVERSAL.QA-FORMAT.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
How to write tasks using GODS framework
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
Initialize project with AGENTS.md and rules, handling both Greenfield (new) and Brownfield (existing) projects.
Creates interactive HTML teaching materials with clickable state diagrams. Use when the user asks to produce an explorable tutorial artifact, not to answer a question.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Delegate a task to another AI IDE's CLI (codex / claude / opencode / cursor-agent) through an isolated-context subagent. Triggers on "delegate to <ide>", "have <ide> do <task>", "execute <task> in <ide>", "offload to <ide>". For one-shot relay or fan-out comparison use `ai-ide-runner` instead.
Guidelines for writing comprehensive Product Requirements Documents (PRD)
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
Use when the user asks to reflect on the CURRENT session, analyze the agent's own process, audit context usage in this conversation, or extract lessons from how the current task was executed. Do NOT trigger for historical multi-session analysis — use reflect-by-history for that. Do NOT trigger on plain "what did you do?" style recaps.
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Build AI IDE plugin MCP elements: stdio JSON-RPC servers, tool schemas, host wiring for Claude Code/Codex, event-name mapping, and validation.
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
Use when a flowai benchmark fails and you need the cause from run artifacts before editing. Reads judge-evidence.md, the sandbox SKILL.md, and scenario mod.ts; classifies the failure against a known taxonomy; produces an evidence-grounded report (no fixes). Do NOT trigger for passing benchmarks or generic skill iteration.
Design or create plugins for one or more AI IDEs, including official-doc lookup, skills, apps, MCP tools, hooks, packaging, assets, manifests, marketplaces, and per-IDE validation.
Use when the user asks to plan a LARGE feature that spans multiple sessions or phases — produces an epic under the `tasks` role from AGENTS.md with dependency-ordered phases, atomic tasks, and per-phase status tracking. Use plan for single-session tasks. Do NOT trigger on generic planning, roadmap, or brainstorming requests.
Build AI IDE plugin hook elements: shared policies plus Claude Code/Codex/OpenCode/Cursor adapters for lifecycle events, blocking, audit, and context injection.
Designs AI IDE plugin marketplaces: requirements, constraints, host-specific payloads, local dogfood, release sync, validation, and related skill routing.
Build AI IDE plugin MCP elements: stdio JSON-RPC servers, tool schemas, host wiring for Claude Code/Codex, event-name mapping, and validation.
How to write in informational style
Create and manage GitHub issues using the GODS framework. Use when creating, updating, or triaging GitHub issues.
Reconcile project-owned AGENTS.md, CLAUDE.md, and scaffolded artifacts with the current flowai framework templates.
Use when the user asks to plan a LARGE feature that spans multiple sessions or phases — produces an epic under the `tasks` role from AGENTS.md with dependency-ordered phases, atomic tasks, and per-phase status tracking. Use plan for single-session tasks. Do NOT trigger on generic planning, roadmap, or brainstorming requests.
Use when the user asks to reflect on the CURRENT session, analyze the agent's own process, audit context usage in this conversation, or extract lessons from how the current task was executed. Do NOT trigger for historical multi-session analysis — use reflect-by-history for that. Do NOT trigger on plain "what did you do?" style recaps.
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
Use when a flowai benchmark fails and you need the cause from run artifacts before editing. Reads judge-evidence.md, the sandbox SKILL.md, and scenario mod.ts; classifies the failure against a known taxonomy; produces an evidence-grounded report (no fixes). Do NOT trigger for passing benchmarks or generic skill iteration.
How to conduct a Q&A session with the user. Canonical reference for FR-UNIVERSAL.QA-FORMAT.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
How to write tasks using GODS framework
Design or create plugins for one or more AI IDEs, including official-doc lookup, skills, apps, MCP tools, hooks, packaging, assets, manifests, marketplaces, and per-IDE validation.
Build AI IDE plugin hook elements: shared policies plus Claude Code/Codex/OpenCode/Cursor adapters for lifecycle events, blocking, audit, and context injection.
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
Designs AI IDE plugin marketplaces: requirements, constraints, host-specific payloads, local dogfood, release sync, validation, and related skill routing.
Creates interactive HTML teaching materials with clickable state diagrams. Use when the user asks to produce an explorable tutorial artifact, not to answer a question.
Create and manage GitHub issues using the GODS framework. Use when creating, updating, or triaging GitHub issues.
How to write in informational style
Guidelines for writing comprehensive Product Requirements Documents (PRD)
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Delegate a task to another AI IDE's CLI (codex / claude / opencode / cursor-agent) through an isolated-context subagent. Triggers on "delegate to <ide>", "have <ide> do <task>", "execute <task> in <ide>", "offload to <ide>". For one-shot relay or fan-out comparison use `ai-ide-runner` instead.
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
How to fix tests
How to fix tests
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
How to write tasks using GODS framework
Adds TypeScript/Deno code style rules to project AGENTS.md. Use when setting up new Deno projects or when user needs to establish code style guidelines.
Adds TypeScript strict mode code style rules to project AGENTS.md. Use when setting up TypeScript projects with strict mode or establishing coding standards.
Adds TypeScript strict mode code style rules to project AGENTS.md. Use when setting up TypeScript projects with strict mode or establishing coding standards.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.
Streamlined two-phase workflow: review, then commit. Verdict gate between phases. Self-contained — execute the inlined steps directly, do NOT invoke other skills via the Skill tool.
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Automated commit workflow with atomic grouping
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode, flowai), skill mounting, and security hardening.
How to conduct a Q&A session with the user. Canonical reference for FR-UNIVERSAL.QA-FORMAT.
Create and manage GitHub issues using the GODS framework. Use when creating, updating, or triaging GitHub issues.
Use when the user asks to add Deno/TypeScript code-style rules to AGENTS.md, or during initial Deno project setup when code-style guidelines need to be established. Do NOT trigger for non-Deno TypeScript projects (use flowai-setup-agent-code-style-ts-strict), or for runtime-agnostic style advice.
Use when the user asks to add Deno/TypeScript code-style rules to AGENTS.md, or during initial Deno project setup when code-style guidelines need to be established. Do NOT trigger for non-Deno TypeScript projects (use flowai-setup-agent-code-style-ts-strict), or for runtime-agnostic style advice.
Adapt project-local flowai primitives (skills, agents, AGENTS.md artifacts, hooks) to project specifics. Standalone adaptation — independent of flowai-update.
Delegate a task to another AI IDE's CLI (codex / claude / opencode / cursor-agent) through an isolated-context subagent. Triggers on "delegate to <ide>", "have <ide> do <task>", "execute <task> in <ide>", "offload to <ide>". For one-shot relay or fan-out comparison use `flowai-ai-ide-runner` instead.
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Use when the user asks to add TypeScript strict-mode code-style rules to AGENTS.md for a TypeScript project using strict mode. Do NOT trigger for Deno projects (use flowai-setup-agent-code-style-ts-deno) or non-strict TS configurations.
Delegate a task to another AI IDE's CLI (codex / claude / opencode / cursor-agent) through an isolated-context subagent. Triggers on "delegate to <ide>", "have <ide> do <task>", "execute <task> in <ide>", "offload to <ide>". For one-shot relay or fan-out comparison use `flowai-ai-ide-runner` instead.
Use when the user asks to add TypeScript strict-mode code-style rules to AGENTS.md for a TypeScript project using strict mode. Do NOT trigger for Deno projects (use flowai-setup-agent-code-style-ts-deno) or non-strict TS configurations.
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Streamlined two-phase workflow: review, then commit. Verdict gate between phases. Self-contained — execute the inlined steps directly, do NOT invoke other skills via the Skill tool.
Use when the user asks to review CURRENT uncommitted changes (staged, unstaged, or branch diff) as QA + lead engineer before committing — verdict on task completion, code quality, architecture, cleanup. Requires an existing diff. Do NOT trigger on generic "is this code good?" questions without a diff, or on post-merge code review of historical commits.
Initialize project with AGENTS.md and rules, handling both Greenfield (new) and Brownfield (existing) projects.
Use when the user asks to audit a memex (long-term knowledge bank for AI agents) for orphans, dead SALP REFs, missing sections, contradictions, or index drift. Runs a deterministic structural check, layers LLM-judgement findings, optionally auto-fixes trivial issues with `--fix`. Do NOT trigger on general code linting.
Use when the user asks to add Deno/TypeScript code-style rules to AGENTS.md, or during initial Deno project setup when code-style guidelines need to be established. Do NOT trigger for non-Deno TypeScript projects (use setup-agent-code-style-strict), or for runtime-agnostic style advice.
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Use when the user asks to reflect on the CURRENT session, analyze the agent's own process, audit context usage in this conversation, or extract lessons from how the current task was executed. Do NOT trigger for historical multi-session analysis — use flowai-skill-reflect-by-history for that. Do NOT trigger on plain "what did you do?" style recaps.
Use when the user asks for a project health audit, maintenance sweep, or multi-category lead-engineer scan followed by interactive issue-by-issue resolution with user approval. Do NOT trigger on routine lint/test runs, single-file cleanups, or standard "check project" requests.
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Use when the user asks to audit a memex (long-term knowledge bank for AI agents) for orphans, dead SALP REFs, missing sections, contradictions, or index drift. Runs a deterministic structural check, layers LLM-judgement findings, optionally auto-fixes trivial issues with `--fix`. Do NOT trigger on general code linting.
Use when the user asks to add TypeScript strict-mode code-style rules to AGENTS.md for a TypeScript project using strict mode. Do NOT trigger for Deno projects (use setup-agent-code-style-deno) or non-strict TS configurations.
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode, flowai), skill mounting, and security hardening.
Use when the user provides a source (URL, file path, or free text) to save into the project's memex — a long-term knowledge bank for AI agents. Stores the raw source, extracts entities into cross-linked pages, runs a backlink audit, and updates the index and activity log. Do NOT trigger on casual reads; only when the intent is to persist a source into the memex.
Use when the user asks a question to be answered from the project's memex (long-term knowledge bank for AI agents) rather than training data or the live web. Reads the index, opens relevant pages, follows one SALP-REF hop, synthesises a cited answer, files it back. Do NOT trigger on source-code or live-session questions.
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode, flowai), skill mounting, and security hardening.
Use when the user provides a source (URL, file path, or free text) to save into the project's memex — a long-term knowledge bank for AI agents. Stores the raw source, extracts entities into cross-linked pages, runs a backlink audit, and updates the index and activity log. Do NOT trigger on casual reads; only when the intent is to persist a source into the memex.
Use when the user asks to add Deno/TypeScript code-style rules to AGENTS.md, or during initial Deno project setup when code-style guidelines need to be established. Do NOT trigger for non-Deno TypeScript projects (use setup-agent-code-style-strict), or for runtime-agnostic style advice.
Use when the user asks a question to be answered from the project's memex (long-term knowledge bank for AI agents) rather than training data or the live web. Reads the index, opens relevant pages, follows one SALP-REF hop, synthesises a cited answer, files it back. Do NOT trigger on source-code or live-session questions.
Use when the user asks to add TypeScript strict-mode code-style rules to AGENTS.md for a TypeScript project using strict mode. Do NOT trigger for Deno projects (use setup-agent-code-style-deno) or non-strict TS configurations.
Use when the user asks to review PAST sessions (not the current one), find recurring patterns across multiple sessions, or improve project primitives (rules, skills, hooks, docs) based on historical IDE transcripts. Do NOT trigger on current-session self-review — use flowai-skill-reflect instead.
Use when the user asks to review CURRENT uncommitted changes (staged, unstaged, or branch diff) as QA + lead engineer before committing — verdict on task completion, code quality, architecture, cleanup. Requires an existing diff. Do NOT trigger on generic "is this code good?" questions without a diff, or on post-merge code review of historical commits.
Use when the user asks to diagnose a bug, find the root cause of a failing behavior, or run a controlled hypothesis-by-hypothesis investigation with experiments and evidence. Produces a diagnostic report with a recommended fix (but does not apply the fix). Do NOT trigger on "fix this bug" when the cause is obvious, or on simple error-message lookups.
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Use when the user asks to review PAST sessions (not the current one), find recurring patterns across multiple sessions, or improve project primitives (rules, skills, hooks, docs) based on historical IDE transcripts. Do NOT trigger on current-session self-review — use flowai-skill-reflect instead.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating effective flowai commands. This skill should be used when users want to create a new command (or update an existing command) that extends flowai's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Use when the user asks to plan a LARGE feature that spans multiple sessions or phases — produces documents/tasks/<YYYY>/<MM>/epic-<name>.md with dependency-ordered phases, atomic tasks, and per-phase status tracking. Use flowai-skill-plan for single-session tasks. Do NOT trigger on generic planning, roadmap, or brainstorming requests.
Guide for creating effective flowai commands. This skill should be used when users want to create a new command (or update an existing command) that extends flowai's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Use when the user asks to diagnose a bug, find the root cause of a failing behavior, or run a controlled hypothesis-by-hypothesis investigation with experiments and evidence. Produces a diagnostic report with a recommended fix (but does not apply the fix). Do NOT trigger on "fix this bug" when the cause is obvious, or on simple error-message lookups.
Use when the user asks to plan a LARGE feature that spans multiple sessions or phases — produces documents/tasks/<YYYY>/<MM>/epic-<name>.md with dependency-ordered phases, atomic tasks, and per-phase status tracking. Use flowai-skill-plan for single-session tasks. Do NOT trigger on generic planning, roadmap, or brainstorming requests.
Run prompts in Claude Code, OpenCode, Cursor, or Codex CLIs from the current session — pick one IDE, fan out across several, or compare models. You are a courier that relays the other runtime's stdout verbatim, do not synthesise your own answer. Use on "run in <ide>", "compare <ide> vs <ide>", "try on <model>", "which IDE handles X better", "run across models".
Use ONLY when the user explicitly asks to plan a task, create a task file in documents/tasks/, produce a GODS-format breakdown, or prepare a critiqued implementation strategy before coding. Do NOT trigger on brief design discussion, ad-hoc code suggestions, or casual "how would you approach X?" questions.
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Use when the user asks to reflect on the CURRENT session, analyze the agent's own process, audit context usage in this conversation, or extract lessons from how the current task was executed. Do NOT trigger for historical multi-session analysis — use flowai-skill-reflect-by-history for that. Do NOT trigger on plain "what did you do?" style recaps.
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode, flowai), skill mounting, and security hardening.
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
Creates interactive HTML teaching materials with clickable state diagrams. Use when the user asks to produce an explorable tutorial artifact, not to answer a question.
How to fix tests
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
Use when a flowai benchmark fails and you need the cause from run artifacts before editing. Reads judge-evidence.md, the sandbox SKILL.md, and scenario mod.ts; classifies the failure against a known taxonomy; produces an evidence-grounded report (no fixes). Do NOT trigger for passing benchmarks or generic skill iteration.
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
Guidelines for writing comprehensive Product Requirements Documents (PRD)
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Creates interactive HTML teaching materials with clickable state diagrams. Use when the user asks to produce an explorable tutorial artifact, not to answer a question.
How to fix tests
Create structured epic for large features using phased decomposition. Produces documents/tasks/epic-{name}.md with dependency-ordered phases, atomic tasks, explicit boundaries, and per-phase status tracking.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
Use when the user asks for a JIT review, wants to catch hidden regressions in a staged/unstaged diff or commit range, or says "check my changes for hidden bugs". Synthesizes ephemeral Catching JiTTests (pass on parent, fail on diff). Not for fixing known failing tests (see flowai-skill-fix-tests) or general code review (flowai-skill-review).
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
Use when the user asks for a JIT review, wants to catch hidden regressions in a staged/unstaged diff or commit range, or says "check my changes for hidden bugs". Synthesizes ephemeral Catching JiTTests (pass on parent, fail on diff). Not for fixing known failing tests (see flowai-skill-fix-tests) or general code review (flowai-skill-review).
How to write in informational style
Guidelines for writing comprehensive Product Requirements Documents (PRD)
How to write in informational style
Use when the user asks to add Deno/TypeScript code-style rules to AGENTS.md, or during initial Deno project setup when code-style guidelines need to be established. Do NOT trigger for non-Deno TypeScript projects (use flowai-skill-setup-agent-code-style-ts-strict), or for runtime-agnostic style advice.
Use when the user asks to add Deno/TypeScript code-style rules to AGENTS.md, or during initial Deno project setup when code-style guidelines need to be established. Do NOT trigger for non-Deno TypeScript projects (use flowai-skill-setup-agent-code-style-ts-strict), or for runtime-agnostic style advice.
Use when a flowai benchmark fails and you need the cause from run artifacts before editing. Reads judge-evidence.md, the sandbox SKILL.md, and scenario mod.ts; classifies the failure against a known taxonomy; produces an evidence-grounded report (no fixes). Do NOT trigger for passing benchmarks or generic skill iteration.
Use when the user asks to audit a memex (long-term knowledge bank for AI agents) for orphans, dead wikilinks, missing sections, contradictions, or index drift. Runs a deterministic structural check, layers LLM-judgement findings, optionally auto-fixes trivial issues with `--fix`. Do NOT trigger on general code linting.
Use when the user provides a source (URL, file path, or free text) to save into the project's memex — a long-term knowledge bank for AI agents. Stores the raw source, extracts entities into cross-linked pages, runs a backlink audit, and updates the index and activity log. Do NOT trigger on casual reads; only when the intent is to persist a source into the memex.
Use when the user asks to add TypeScript strict-mode code-style rules to AGENTS.md for a TypeScript project using strict mode. Do NOT trigger for Deno projects (use flowai-skill-setup-agent-code-style-ts-deno) or non-strict TS configurations.
How to write tasks using GODS framework
Create and manage GitHub issues using the GODS framework. Use when creating, updating, or triaging GitHub issues.
Use when the user provides a source (URL, file path, or free text) to save into the project's memex — a long-term knowledge bank for AI agents. Stores the raw source, extracts entities into cross-linked pages, runs a backlink audit, and updates the index and activity log. Do NOT trigger on casual reads; only when the intent is to persist a source into the memex.
Use when the user asks to add TypeScript strict-mode code-style rules to AGENTS.md for a TypeScript project using strict mode. Do NOT trigger for Deno projects (use flowai-skill-setup-agent-code-style-ts-deno) or non-strict TS configurations.
Use when the user asks a question to be answered from the project's memex (long-term knowledge bank for AI agents) rather than training data or the live web. Reads the index, opens relevant pages, follows one wikilink hop, synthesises a cited answer, files it back. Do NOT trigger on source-code or live-session questions.
Use when the user asks to audit a memex (long-term knowledge bank for AI agents) for orphans, dead wikilinks, missing sections, contradictions, or index drift. Runs a deterministic structural check, layers LLM-judgement findings, optionally auto-fixes trivial issues with `--fix`. Do NOT trigger on general code linting.
Initialize project with AGENTS.md and rules, handling both Greenfield (new) and Brownfield (existing) projects.
Use ONLY when the user explicitly asks to plan a task, create a task file in documents/tasks/, produce a GODS-format breakdown, or prepare a critiqued implementation strategy before coding. Do NOT trigger on brief design discussion, ad-hoc code suggestions, or casual "how would you approach X?" questions.
Record an architectural decision (chosen path + rejected alternatives + rationale) as a persistent MADR-style ADR in documents/adr/. Triggers — "record this decision", "write an ADR", "capture our rationale for picking X", "запиши ADR". Do NOT use for regular task planning (use flowai-skill-plan), already-decided trivial choices, or simple bug fixes.
How to conduct a Q&A session with the user. Canonical reference for FR-UNIVERSAL.QA-FORMAT.
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Create structured epic for large features using phased decomposition. Produces documents/tasks/epic-{name}.md with dependency-ordered phases, atomic tasks, explicit boundaries, and per-phase status tracking.
Iterative issue investigation with user-controlled hypothesis selection
Create critiqued plan in documents/tasks/ using GODS framework with proactive uncertainty resolution
Analyze previous IDE session history to find recurring patterns in agent behavior, identify systemic issues, and propose improvements to project primitives (rules, skills, hooks). Use when the user asks to review past sessions, find recurring problems, or improve development workflow based on historical data.
Review current changes as QA engineer and lead engineer simultaneously: verify task completion, code quality, architecture, and cleanup.
Review current changes as QA engineer and lead engineer simultaneously: verify task completion, code quality, architecture, and cleanup.
Analyze and answer user questions in Autonomous Mode by reading documentation and inspecting codebase
Analyze and answer user questions in Autonomous Mode by reading documentation and inspecting codebase
Analyze agent's process, logic, technical decisions, context usage, and undocumented discoveries to find behavioral errors, poor engineering choices, inefficiencies, and missing knowledge in project instructions.
Use when AGENTS.md is out of sync with the installed template (after `flowai sync` reports it changed, or user asks to re-adapt/realign AGENTS.md). Reads {ide}/assets/AGENTS.template.md, proposes a merge preserving project sections, writes on approval. Do NOT trigger on small edits.
How to conduct a Q&A session with the user. Canonical reference for FR-UNIVERSAL.QA-FORMAT.
Use when the user asks a question to be answered from the project's memex (long-term knowledge bank for AI agents) rather than training data or the live web. Reads the index, opens relevant pages, follows one wikilink hop, synthesises a cited answer, files it back. Do NOT trigger on source-code or live-session questions.
Re-adapt AGENTS.md when the installed template changed. Reads {ide}/assets/AGENTS.template.md, diffs it against the project AGENTS.md, proposes a merge preserving project-specific sections, asks for confirmation, writes on approval. Use after `flowai sync` reports AGENTS.template.md changed.
Re-adapt AGENTS.md when the installed template changed. Reads {ide}/assets/AGENTS.template.md, diffs it against the project AGENTS.md, proposes a merge preserving project-specific sections, asks for confirmation, writes on approval. Use after `flowai sync` reports AGENTS.template.md changed.
Use when the user asks for a project health audit, maintenance sweep, or multi-category lead-engineer scan followed by interactive issue-by-issue resolution with user approval. Do NOT trigger on routine lint/test runs, single-file cleanups, or standard "check project" requests.
Use when the user asks for a project health audit, maintenance sweep, or multi-category lead-engineer scan followed by interactive issue-by-issue resolution with user approval. Do NOT trigger on routine lint/test runs, single-file cleanups, or standard "check project" requests.
Adapt flowai-workflow agents to the current project after a framework update. Merges upstream agent changes with project-specific customizations.
Analyze and answer user questions in Autonomous Mode by reading documentation and inspecting codebase
Iterative issue investigation with user-controlled hypothesis selection
Perform a comprehensive "Lead Engineer" audit: structure, consistency, code quality, technical debt, documentation coverage, and terminology checks.
Adds TypeScript strict mode code style rules to project AGENTS.md. Use when setting up TypeScript projects with strict mode or establishing coding standards.
Scaffold a complete AI agent application skeleton with LLM integration, tool calling, observability, cost tracking, session management, and content fetching. Use when the user asks to create an AI agent, add LLM integration, scaffold an AI application, or mentions "ai-skel", "agent skeleton", or "AI scaffold". Works with any programming language.
Understand and interact with the cursor-agent CLI, including its output formats (JSON, text, stream-json), session management (resume), and integration with benchmarks. Use when working on cursor-agent CLI, benchmark runners, or automated agent interactions.
Multi-directional deep research with sub-agents, scoring, source verification, and cited synthesis. Use for comprehensive research, deep analysis, or evidence-backed answers. Produces executive summary + markdown report. Supports search tools, playwright-cli, and MCP.
Manage Deno Deploy cloud services using both `deno deploy` and `deployctl`. Use this skill for deploying projects, managing cloud environments (Build/Dev/Prod contexts), monitoring logs, and troubleshooting deployment issues (like private npm dependencies).
Guide for creating effective AssistFlow commands. This skill should be used when users want to create a new command (or update an existing command) that extends AssistFlow's capabilities with specialized knowledge, workflows, or tool integrations. Works across IDEs (Cursor, Claude Code, OpenCode).
Guide for creating persistent AI rules (coding standards, project conventions, file-specific patterns). Use when users want to create a rule, add coding standards, set up project conventions, configure file-specific patterns, or ask about rules placement. Works across IDEs (Cursor, Claude Code, OpenCode).
Automates browser interactions for web testing, form filling, screenshots, and data extraction using playwright-cli. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Writing a Development Enhancement Proposal (DEP) - a document for proposing technical improvements
How to write in informational style
Guidelines for writing comprehensive Product Requirements Documents (PRD)
Adapt flowai-workflow agents to the current project after a framework update. Merges upstream agent changes with project-specific customizations.
Guide for creating effective Subagents (custom agent definitions). Use when users want to create a new subagent, set up task-specific agents, configure code reviewers, debuggers, or domain-specific assistants. Works across IDEs (Cursor, OpenCode).
Guide for writing prompts for reasoning/smart models (Gemini Pro, GPT-4o, Claude 3.5 Sonnet), focused on structure and context.
Copy a bundled flowai-workflow into the user's project so they can adapt and run it locally. Use when the user wants to start using flowai-workflow in a new repo.
Execute a bundled or project-local flowai-workflow DAG. Use to launch a workflow run from inside the host IDE.
Supervisor for one flowai-workflow run (Codex). Starts or resumes a single workflow, diagnoses failures from run artifacts, patches root causes, resumes the same run. Run inside an isolated Codex worker subagent spawned by the `supervise`/`orchestrate` skill.
Create, maintain, and run evidence-based benchmarks for AI agents. Use when setting up testing infrastructure, writing new test scenarios, or evaluating agent performance.
Creation and configuration of event hooks/plugins to manage agent behavior, command filtering, auditing, and automation. Works across IDEs (Cursor, Claude Code, OpenCode). Use when you need to: (1) Create a new hook (e.g., for formatting or security checks), (2) Configure hooks/plugins, (3) Implement logic for blocking or modifying agent actions via scripts.
Run the flowai-workflow orchestration policy loop. Delegates policy execution to the workflow orchestrator agent, which selects workflows from ORCHESTRATION.md and supervisor summaries.
Live flowai-workflow run supervisor. Use only for requests to run, monitor, continue, or resume a current workflow now. Exclude last-night, finished, post-mortem, root-cause-report, and conceptual questions.
Copy a bundled flowai-workflow into the user's project so they can adapt and run it locally. Use when the user wants to start using flowai-workflow in a new repo.
Scaffold or adapt flowai-workflow DAGs in a project. Use for adding flowai-workflow, adapting an existing .flowai-workflow/<name>, validating workflow.yaml, or wiring agent prompts/scripts.
Initialize a new flowai-workflow project. Analyzes the codebase to detect project settings, asks the user to confirm or adjust, then scaffolds the .flowai-workflow/ directory.
Execute a bundled or project-local flowai-workflow DAG. Use to launch a workflow run from inside Claude Code without installing the CLI separately.
Create GitHub issue as a requirement ticket: problem + requirements only, no implementation details. Use when user asks to create a ticket, issue, or task for a feature/fix/change.
Analyze agent's process, logic, technical decisions, context usage, and undocumented discoveries to find behavioral errors, poor engineering choices, inefficiencies, and missing knowledge in project instructions.
Initialize project with AGENTS.md and rules, handling both Greenfield (new) and Brownfield (existing) projects.
Create critiqued plan in whiteboard.md using GODS framework with proactive uncertainty resolution
How to conduct a Q&A session with the user
Analyze total token usage including conversation history, system prompts, and active rules to estimate cost.
Draw and edit Mermaid diagrams in Markdown. Use when the user wants to visualize processes, flows, sequences, or asks for diagrams.
Guide for writing stable, effective prompts for instant/fast models (Gemini Flash, GPT-4o Mini, Haiku), suitable for beginners.
Adds TypeScript/Deno code style rules to project AGENTS.md. Use when setting up new Deno projects or when user needs to establish code style guidelines.
Set up .devcontainer for AI IDE development. Generates devcontainer.json and optional Dockerfile for project tech stack, AI CLI integration (Claude Code, OpenCode), secrets, skill mounting, and security hardening. Use for AI-assisted devcontainer setup or flow-init delegation.
How to write tasks using GODS framework
Create structured specification for large features using phased decomposition. Produces documents/spec-{name}.md with dependency-ordered phases, atomic tasks, explicit boundaries, and per-phase status tracking.
Review current changes as QA engineer and lead engineer simultaneously: verify task completion, code quality, architecture, and cleanup.
Initialize a new flowai-workflow project. Analyzes the codebase to detect project settings, asks the user to confirm or adjust, then scaffolds the .flowai-workflow/ directory.
Automated commit workflow with atomic grouping
Initialize a new flowai-workflow project. Analyzes the codebase to detect project settings, asks the user to confirm or adjust, then scaffolds the .flowai-workflow/ directory.
Adapt flowai-workflow agents to the current project after a framework update. Merges upstream agent changes with project-specific customizations.
Run the flowai-workflow orchestration policy loop. Delegates policy execution to the workflow orchestrator agent, which selects workflows from ORCHESTRATION.md and supervisor summaries.
Scaffold or adapt flowai-workflow DAGs in a project. Use for adding flowai-workflow, adapting an existing .flowai-workflow/<name>, validating workflow.yaml, or wiring agent prompts/scripts.
Live flowai-workflow run supervisor. Use only for requests to run, monitor, continue, or resume a current workflow now. Exclude last-night, finished, post-mortem, root-cause-report, and conceptual questions.
How to fix tests
Composite command: review changes then commit only if approved. Inlines flow-review and flow-commit with a verdict gate between them.
How to manage GitHub tickets via MCP tools using the GODS framework. Use when creating, updating, or triaging GitHub issues.
Guide for creating effective Agent Skills (SKILL.md packages). Use when users want to create a new skill, write a skill, author a SKILL.md, or ask about skill structure, best practices, or skill file format. Works across IDEs (Cursor, Claude Code, OpenCode).
Configure and maintain Deno development commands (check, test, dev, prod). Use when the user wants to set up or update the standard command interface in deno.json and scripts/ directory.
Scaffold a complete AI agent application skeleton with LLM integration, tool calling, observability, cost tracking, session management, and content fetching using @korchasa/ai-skel-ts. Use when the user asks to create an AI agent, add LLM integration, scaffold an AI application, or mentions "ai-skel", "agent skeleton", or "AI scaffold". Works with any programming language.
Long-cycle flowai-workflow orchestrator (Codex). Reads project orchestration policy, selects the next workflow, and returns a structured supervisor delegation request. Run inside an isolated Codex worker subagent spawned by the `orchestrate` skill.
Manage all Deno capabilities via CLI (run, test, fmt, lint, task, add, etc.). Use this skill when you need to execute Deno commands, manage dependencies, run tests, or handle local development workflows.