Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

sharkitect-solutions/ai-agents-architect

Name: ai-agents-architect
Author: sharkitect-solutions

skills/ai-agents-architect/SKILL.md

npx skillsauth add sharkitect-solutions/sharkitect-claude-toolkit ai-agents-architect

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

AI Agent Architecture

Think like an architect who has shipped agents to production and learned that most agent failures are architecture failures — the wrong pattern for the problem, too many tools, no escape hatches. The hardest decision is usually "should this be an agent at all?"

The Agent Tax — Why Most Tasks Don't Need Agents

Every agent adds cost you must justify:

| Tax | What It Costs | Typical Impact | |-----|--------------|----------------| | Latency | Each reasoning step = 1-5s LLM call | 5-step task = 5-25s minimum | | Token cost | Reasoning + tool descriptions + history per step | 3-10x vs single LLM call | | Unpredictability | Non-deterministic paths through tools | Same input, different results | | Debuggability | Multi-step traces hard to reproduce | 10x debugging time | | Failure surface | Each step can fail, hallucinate, or loop | Compound failure rates |

Should this be an agent?
│
├─ Is the task STATIC (same steps every time)?
│  └─ YES → Use a deterministic pipeline. No agent needed.
│     (ETL, format conversion, template filling)
│
├─ Does it need CONDITIONAL logic but predictable branches?
│  └─ YES → Use a chain/router. Still no agent.
│     (Classify → route to handler, if/else workflows)
│
├─ Does it need to DISCOVER what to do based on results?
│  └─ YES → This is an agent use case.
│     (Research tasks, debugging, multi-step problem solving)
│
└─ Does it need to ADAPT its plan mid-execution?
   └─ YES → This is a strong agent use case.
      (Complex reasoning, open-ended exploration)

The brutal truth: 70% of "agent" projects in production are pipelines with an LLM call in the middle. They don't need ReAct loops, tool registries, or memory systems. They need a well-written prompt and a json.loads().

Architecture Pattern Selection

| Pattern | Best For | Avoid When | Typical Steps | |---------|----------|------------|---------------| | ReAct | Exploratory tasks, tool-heavy work | Deterministic sequences, >10 steps | 3-8 | | Plan-Execute | Complex multi-step tasks with clear subgoals | Simple tasks, rapidly changing context | 5-20 | | Routing | Classification → specialized handler | Tasks needing iteration or discovery | 1-2 | | Multi-Agent | Distinct roles with different tool sets | When single agent with role-switching works | Varies | | OODA | Real-time reactive systems, monitoring | Batch processing, one-shot tasks | Continuous |

When ReAct Breaks Down

ReAct (Reason-Act-Observe) is the default pattern but has specific failure modes:

Long tasks (>8 steps): Context window fills with observation history. The agent "forgets" early steps. Solution: summarize observations, don't accumulate raw output.
High-branching tasks: Too many valid tool choices per step. The agent dithers or picks randomly. Solution: reduce available tools per step via dynamic tool filtering.
Precise multi-step sequences: ReAct's flexibility becomes a liability when steps MUST happen in order. Solution: Plan-Execute with a fixed step list.

When to Use Plan-Execute Over ReAct

Plan-Execute is better when:
- Task has >5 clear sub-steps
- Steps have dependencies (step 3 needs step 1's output)
- You want human review of the plan before execution
- Failure at step N shouldn't require restarting from step 1

ReAct is better when:
- You don't know how many steps are needed
- Each step's action depends on what you discover
- The task is exploratory (research, debugging)
- Speed matters more than predictability

Tool Design — The Most Undervalued Skill

Tool descriptions matter more than the system prompt. The agent reads tool descriptions at every step to decide which tool to call. Bad descriptions = wrong tool selection = agent failure.

What Makes a Good Tool Schema

BAD tool description:
  "search" - "Searches for things"

GOOD tool description:
  "search_knowledge_base" - "Search internal knowledge base for
   product documentation and support articles. Returns top 5 matching
   documents with relevance scores. Use for: customer questions about
   product features, troubleshooting steps, pricing info. Do NOT use
   for: general web search, competitor info, real-time data."

The rules:

Name is a verb phrase — search_knowledge_base not kb or search
Description says WHEN to use it — not just what it does
Description says WHEN NOT to use it — prevents mis-selection
Parameters have examples — the agent sees the schema, not your code
Return format is documented — agent must know what it gets back

The Tool Count Problem

| Tools Available | Selection Accuracy | Impact | |----------------|-------------------|--------| | 1-5 | ~95% correct | Reliable | | 6-15 | ~85% correct | Acceptable | | 16-30 | ~65% correct | Frequent wrong tool | | 30+ | ~40% correct | Agent is guessing |

Solutions when you have many tools:

Dynamic tool filtering: Only show tools relevant to the current step
Tool categories: Group tools, let agent pick category first, then specific tool
Specialized sub-agents: Each sub-agent gets 3-5 tools for its domain

Multi-Agent Decision Framework

Do you need multiple agents?
│
├─ Do different parts need DIFFERENT tool sets?
│  ├─ YES and tools would conflict → Multi-agent
│  └─ YES but tools are compatible → Single agent, more tools (if <15)
│
├─ Do different parts need DIFFERENT system prompts?
│  ├─ YES, fundamentally different personas → Multi-agent
│  └─ YES, minor tone shifts → Single agent with role-switching
│
├─ Do parts need to run in PARALLEL?
│  ├─ YES → Multi-agent (parallel execution)
│  └─ NO → Likely single agent
│
└─ Is the task DECOMPOSABLE into independent subtasks?
   ├─ YES, clean boundaries → Multi-agent with orchestrator
   └─ NO, tightly coupled → Single agent

Multi-Agent Communication Patterns

| Pattern | How It Works | Failure Mode | |---------|-------------|-------------| | Orchestrator | Central agent delegates to specialists | Orchestrator becomes bottleneck; misunderstands specialist output | | Pipeline | Agent A's output feeds Agent B | No feedback loop; error in A propagates silently | | Debate | Multiple agents critique each other | Converges to consensus mush; tokens explode | | Hierarchical | Manager agents supervise worker agents | Over-engineering; each layer adds latency + cost |

Default to orchestrator pattern. It's the simplest to debug, easiest to extend, and has the clearest failure modes. Only use other patterns when orchestrator demonstrably fails.

Agent Failure Modes (What Production Teaches You)

| Failure | Symptom | Root Cause | Fix | |---------|---------|------------|-----| | Infinite loop | Agent repeats same action | No loop detection, bad stop condition | Max iterations + action deduplication | | Hallucinated tool call | Agent fabricates tool output without calling it | Tool description unclear, or model confused | Verify tool was actually called in traces | | Tool selection drift | Agent picks wrong tool increasingly | Context window filling with irrelevant history | Summarize history, filter tools per step | | Plan abandonment | Agent ignores its own plan mid-execution | New observation contradicts plan, no replan logic | Explicit replan trigger when observations diverge | | Graceless failure | Agent errors out with no useful output | No fallback, no partial result handling | Return partial results + clear error context | | Silent wrong answer | Agent confidently returns incorrect result | No verification step, no self-check | Add verification tool, structured self-critique |

The Escape Hatch Pattern

Every agent MUST have a way to gracefully give up:

After N failed attempts at the same sub-task:
1. Return what you HAVE accomplished (partial results)
2. Explain what you COULDN'T do and why
3. Suggest what a human should do next
4. Do NOT retry the same failing action

Without escape hatches, agents loop until they hit token limits, waste money, and return nothing useful.

Rationalization Table

| Rationalization | When It Appears | Why It's Wrong | |----------------|-----------------|----------------| | "Let's build an agent for this" | Starting any LLM task | Most tasks are pipelines. Ask "does this need to discover what to do?" first. | | "More tools = more capable" | Designing agent tool set | More tools = worse selection accuracy. 5-10 focused tools beat 30 unfocused ones. | | "We need multiple agents" | Complex task decomposition | Single agent with role-switching handles most cases. Multi-agent adds communication overhead. | | "ReAct handles everything" | Choosing architecture | ReAct breaks on long tasks, precise sequences, and high-branching decisions. Match pattern to task. | | "The agent will figure it out" | Skipping tool description quality | Tool descriptions are the agent's primary decision input. Vague descriptions = random tool selection. |

NEVER

NEVER build an agent when a deterministic pipeline handles the task — agents add latency, cost, and unpredictability that must be justified by genuinely dynamic reasoning
NEVER give an agent >15 tools without dynamic filtering — selection accuracy drops below useful threshold at ~16+ tools
NEVER skip the escape hatch — agents without graceful failure will loop until token limits, wasting cost and returning nothing
NEVER put "when to use this tool" only in the system prompt — the agent reads tool descriptions at every step; the system prompt fades from attention in long contexts
NEVER assume multi-agent is better than single-agent — each agent boundary is a communication failure point; default to single agent until it demonstrably can't handle the task
NEVER deploy an agent without loop detection — max iterations + action deduplication are non-negotiable production requirements

Red Flags

[ ] Building an agent for a task that follows the same steps every time — this is a pipeline
[ ] Agent has 20+ tools with no filtering strategy — tool selection will be unreliable
[ ] No max iteration limit on the agent loop — will run until token budget exhaustion
[ ] Tool descriptions say what the tool does but not when to use it — agent can't make good selection decisions
[ ] Multi-agent system where agents rarely communicate — probably should be independent pipelines
[ ] No partial result return on failure — agent either succeeds completely or returns nothing
[ ] Agent tested only on happy-path inputs — adversarial and edge-case inputs will reveal architecture gaps

sharkitect-solutions/ai-agents-architect

skills/ai-agents-architect/SKILL.md

Use when deciding WHETHER to build an AI agent (vs pipeline/chain), choosing an agent architecture pattern (ReAct, Plan-Execute, routing, multi-agent), designing tool schemas for agents, or debugging agent failures (loops, hallucinated tool calls, degraded tool selection). Use when the question is about agent DESIGN, not implementation. NEVER for implementing specific agent frameworks (use agent-development, agents-crewai). NEVER for agent memory design (use agent-memory-systems). NEVER for agent evaluation (use agent-evaluation).

tools

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add sharkitect-solutions/sharkitect-claude-toolkit ai-agents-architect

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 8:50 PM1.7s1 file scanned

SKILL.md

name:: ai-agents-architect
description:: Use when deciding WHETHER to build an AI agent (vs pipeline/chain), choosing an agent architecture pattern (ReAct, Plan-Execute, routing, multi-agent), designing tool schemas for agents, or debugging agent failures (loops, hallucinated tool calls, degraded tool selection). Use when the question is about agent DESIGN, not implementation. NEVER for implementing specific agent frameworks (use agent-development, agents-crewai). NEVER for agent memory design (use agent-memory-systems). NEVER for agent evaluation (use agent-evaluation).
version:: 2.0
optimized:: true
optimized_date:: 2026-03-10

AI Agent Architecture

The Agent Tax — Why Most Tasks Don't Need Agents

Every agent adds cost you must justify:

Should this be an agent?
│
├─ Is the task STATIC (same steps every time)?
│  └─ YES → Use a deterministic pipeline. No agent needed.
│     (ETL, format conversion, template filling)
│
├─ Does it need CONDITIONAL logic but predictable branches?
│  └─ YES → Use a chain/router. Still no agent.
│     (Classify → route to handler, if/else workflows)
│
├─ Does it need to DISCOVER what to do based on results?
│  └─ YES → This is an agent use case.
│     (Research tasks, debugging, multi-step problem solving)
│
└─ Does it need to ADAPT its plan mid-execution?
   └─ YES → This is a strong agent use case.
      (Complex reasoning, open-ended exploration)

Architecture Pattern Selection

When ReAct Breaks Down

ReAct (Reason-Act-Observe) is the default pattern but has specific failure modes:

Long tasks (>8 steps): Context window fills with observation history. The agent "forgets" early steps. Solution: summarize observations, don't accumulate raw output.
High-branching tasks: Too many valid tool choices per step. The agent dithers or picks randomly. Solution: reduce available tools per step via dynamic tool filtering.
Precise multi-step sequences: ReAct's flexibility becomes a liability when steps MUST happen in order. Solution: Plan-Execute with a fixed step list.

When to Use Plan-Execute Over ReAct

Plan-Execute is better when:
- Task has >5 clear sub-steps
- Steps have dependencies (step 3 needs step 1's output)
- You want human review of the plan before execution
- Failure at step N shouldn't require restarting from step 1

ReAct is better when:
- You don't know how many steps are needed
- Each step's action depends on what you discover
- The task is exploratory (research, debugging)
- Speed matters more than predictability

Tool Design — The Most Undervalued Skill

Tool descriptions matter more than the system prompt. The agent reads tool descriptions at every step to decide which tool to call. Bad descriptions = wrong tool selection = agent failure.

What Makes a Good Tool Schema

BAD tool description:
  "search" - "Searches for things"

GOOD tool description:
  "search_knowledge_base" - "Search internal knowledge base for
   product documentation and support articles. Returns top 5 matching
   documents with relevance scores. Use for: customer questions about
   product features, troubleshooting steps, pricing info. Do NOT use
   for: general web search, competitor info, real-time data."

The rules:

Name is a verb phrase — search_knowledge_base not kb or search
Description says WHEN to use it — not just what it does
Description says WHEN NOT to use it — prevents mis-selection
Parameters have examples — the agent sees the schema, not your code
Return format is documented — agent must know what it gets back

The Tool Count Problem

Solutions when you have many tools:

Dynamic tool filtering: Only show tools relevant to the current step
Tool categories: Group tools, let agent pick category first, then specific tool
Specialized sub-agents: Each sub-agent gets 3-5 tools for its domain

Multi-Agent Decision Framework

Do you need multiple agents?
│
├─ Do different parts need DIFFERENT tool sets?
│  ├─ YES and tools would conflict → Multi-agent
│  └─ YES but tools are compatible → Single agent, more tools (if <15)
│
├─ Do different parts need DIFFERENT system prompts?
│  ├─ YES, fundamentally different personas → Multi-agent
│  └─ YES, minor tone shifts → Single agent with role-switching
│
├─ Do parts need to run in PARALLEL?
│  ├─ YES → Multi-agent (parallel execution)
│  └─ NO → Likely single agent
│
└─ Is the task DECOMPOSABLE into independent subtasks?
   ├─ YES, clean boundaries → Multi-agent with orchestrator
   └─ NO, tightly coupled → Single agent

Multi-Agent Communication Patterns

Default to orchestrator pattern. It's the simplest to debug, easiest to extend, and has the clearest failure modes. Only use other patterns when orchestrator demonstrably fails.

Agent Failure Modes (What Production Teaches You)

The Escape Hatch Pattern

Every agent MUST have a way to gracefully give up:

After N failed attempts at the same sub-task:
1. Return what you HAVE accomplished (partial results)
2. Explain what you COULDN'T do and why
3. Suggest what a human should do next
4. Do NOT retry the same failing action

Without escape hatches, agents loop until they hit token limits, waste money, and return nothing useful.

Rationalization Table

NEVER

NEVER build an agent when a deterministic pipeline handles the task — agents add latency, cost, and unpredictability that must be justified by genuinely dynamic reasoning
NEVER give an agent >15 tools without dynamic filtering — selection accuracy drops below useful threshold at ~16+ tools
NEVER skip the escape hatch — agents without graceful failure will loop until token limits, wasting cost and returning nothing
NEVER put "when to use this tool" only in the system prompt — the agent reads tool descriptions at every step; the system prompt fades from attention in long contexts
NEVER assume multi-agent is better than single-agent — each agent boundary is a communication failure point; default to single agent until it demonstrably can't handle the task
NEVER deploy an agent without loop detection — max iterations + action deduplication are non-negotiable production requirements

Red Flags

[ ] Building an agent for a task that follows the same steps every time — this is a pipeline
[ ] Agent has 20+ tools with no filtering strategy — tool selection will be unreliable
[ ] No max iteration limit on the agent loop — will run until token budget exhaustion
[ ] Tool descriptions say what the tool does but not when to use it — agent can't make good selection decisions
[ ] Multi-agent system where agents rarely communicate — probably should be independent pipelines
[ ] No partial result return on failure — agent either succeeds completely or returns nothing
[ ] Agent tested only on happy-path inputs — adversarial and edge-case inputs will reveal architecture gaps

Related Skills

sharkitect-solutions/paid-ads

development

VerifiedTrustedCommunity

When the user wants help with paid advertising campaigns on Google Ads, Meta (Facebook/Instagram), LinkedIn, Twitter/X, or other ad platforms. Also use when the user mentions 'PPC,' 'paid media,' 'ad copy,' 'ad creative,' 'ROAS,' 'CPA,' 'ad campaign,' 'retargeting,' or 'audience targeting.' This skill covers campaign strategy, ad creation, audience targeting, and optimization.

SKILL.mdUpdated May 29, 2026

sharkitect-solutions/paid-ads

sharkitect-solutions/skills/using-sharkitect-methodology

testing

VerifiedTrustedCommunity

--- name: using-sharkitect-methodology description: Use when starting any conversation in a Sharkitect workspace OR before any task involving NEW pricing, positioning, proposal, strategy, plan-execution, or schema-design work — mandates invocation of Sharkitect-specific methodology skills (pricing-strategy, marketing-strategy-pmm, smb-cfo, hq-revenue-ops, executing-plans, brainstorming) under the same anti-rationalization discipline as using-superpowers. Documentation has failed 4 times across H

SKILL.mdUpdated May 13, 2026

sharkitect-solutions/skills/using-sharkitect-methodology

sharkitect-solutions/end-session

testing

VerifiedTrustedCommunity

Use when user says 'end session', 'wrap up', 'stop for the day', 'done for today', 'close out', 'save session', 'wrapping up', or invokes /end-session. Runs the full 9-step end-of-session protocol: resource audit, MEMORY.md update, lessons capture, plan status, pending items, workspace checklist, .tmp/ audit, git commit+push, Supabase brain sync, session brief, summary. Final step schedules a detached self-kill of the current session ONLY (3s delay) so the window closes cleanly. Other claude.exe processes (active workspaces) are NOT touched -- orphan cleanup is handled separately by Claude-Orphan-Cleanup-Hourly with proper age safeguards. Do NOT use for: mid-session quick saves (use session-checkpoint), skill syncing (use sync-skills.py), brain memory queries (use supabase-sync.py pull), document freshness reviews (use document-lifecycle), resource gap detection (use resource-auditor).

SKILL.mdUpdated May 12, 2026

sharkitect-solutions/end-session

sharkitect-solutions/humanizer

testing

VerifiedTrustedCommunity

Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, passive voice, negative parallelisms, and filler phrases.

SKILL.mdUpdated May 7, 2026

sharkitect-solutions/humanizer

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/sharkitect-solutions/sharkitect-claude-toolkit.git

# Copy into Claude Code skills folder (global)
cp -r sharkitect-claude-toolkit/skills/ai-agents-architect ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

sharkitect-solutions/sharkitect-claude-toolkit

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT