Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

latestaiagents/mcp-tool-design

Name: mcp-tool-design
Author: latestaiagents

skills/mcp-mastery/mcp-tool-design/SKILL.md

npx skillsauth add latestaiagents/agent-skills mcp-tool-design

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

MCP Tool Design

A well-designed tool is invoked correctly by the agent on the first try. A bad one causes loops, wrong-tool selection, or hallucinated arguments.

When to Use

Adding a new tool to an MCP server
Debugging "the model never picks this tool" or "the model picks the wrong tool"
Reviewing a PR that adds tools
Cleaning up a server with 30+ tools

The Three Levers

Agents pick tools based on name, description, and parameter schema — in that order of signal strength. Every design choice should strengthen at least one.

Naming Rules

| Good | Bad | Why | |---|---|---| | search_issues | issueSearcher | snake_case, verb-led | | get_user_by_email | user_lookup | specific over vague | | create_pr_comment | comment | namespaced by object | | list_repos | repos | action is explicit |

Rule: <verb>_<object>[_<qualifier>]. If two tools could answer the same query, one has the wrong name.

Description Rules

Descriptions are what the model reads most carefully. Budget ~1-3 sentences:

<Verb-led action>. <When to use it / when NOT to use it>. <Any gotchas>.

Example — weak vs strong

Weak:

Creates an issue.

Strong:

Create a GitHub issue in the specified repo. Use this for new bug reports or feature requests. Do NOT use to comment on an existing issue — use create_issue_comment for that. Title is required; body supports markdown.

The "when NOT to use" line is the highest-leverage sentence you can write — it routes disambiguation without the agent needing to enumerate all tools.

Parameter Schema Rules

Describe every field — .describe() in Zod, Field(description=...) in Pydantic. Undescribed fields get hallucinated values
Use enums for closed sets — priority: z.enum(["low", "med", "high"]) beats priority: z.string()
Default the optional — if 80% of calls use limit=50, set the default; don't make the agent guess
Prefer primitives at the top level — nested objects increase hallucination rates
Avoid freeform "options" bags — split into discrete flags

Example — good schema

server.tool(
  "search_issues",
  "Search GitHub issues across a repo. Returns up to 50 issues matching the query. " +
  "Use for finding issues by keyword or label. For a specific issue by number, use `get_issue` instead.",
  {
    repo: z.string().describe("owner/name format, e.g. 'anthropic/claude-code'"),
    query: z.string().describe("Full-text search query, GitHub search syntax supported"),
    state: z.enum(["open", "closed", "all"]).default("open"),
    labels: z.array(z.string()).optional().describe("Filter by label names (AND semantics)"),
    limit: z.number().int().min(1).max(100).default(25),
  },
  async (args) => { /* ... */ },
);

Return Shape Rules

The model sees the tool result as text. Structure matters:

Lead with a summary line — "Found 3 open issues matching 'auth bug':"
Include IDs so the model can chain calls
Truncate aggressively — don't return 200 rows when 10 will do; include "... 190 more, use offset=10" hint
Never return raw HTML — strip or convert to markdown

return {
  content: [{
    type: "text",
    text: [
      `Found ${results.length} issues matching "${query}":`,
      ...results.slice(0, 10).map(r => `- #${r.number} ${r.title} (${r.state})`),
      results.length > 10 ? `... ${results.length - 10} more. Narrow query or paginate with offset.` : "",
    ].filter(Boolean).join("\n"),
  }],
};

Surface Size

Tool selection accuracy degrades past ~15 tools per connected server set. If you have 40 CRUD operations, collapse them:

Before: create_issue, update_issue, delete_issue, close_issue, reopen_issue, assign_issue, ... (12 tools)
After: issue_action(action: "create"|"update"|"close"|"reopen", ...) (1 tool with discriminated schema)

Only collapse if the sub-actions share 80%+ of their schema. Otherwise the union becomes a mess.

Anti-Patterns

The God Tool — execute(query: string) that dispatches everything. Model can't pick wisely; hallucinates queries
Duplicate tools — search, find, lookup, query all doing similar things
Hidden required args — options: object where some keys are required; agent skips them
Silent truncation — returning 10 of 500 results without saying so; agent assumes it got everything
Tool-name collisions across servers — if user has 3 servers each with a search tool, the model gets confused. Prefix: linear_search, github_search

Validation Checklist

Before shipping a tool:

[ ] Name is verb-led and unambiguous
[ ] Description has ≥1 "when NOT to use" sentence if there's a nearby tool
[ ] Every parameter has .describe()
[ ] Closed-set fields use enums
[ ] Defaults cover the common case
[ ] Return includes IDs for chaining
[ ] Return summarizes + truncates large lists
[ ] Tested with MCP Inspector that the model can invoke it zero-shot

Best Practices

Read your descriptions aloud — if you stumble, the model will too
Pilot with one real conversation before adding 10 tools at once
When a user reports "the agent won't use my tool", check description first, schema second, name third
Log unresolved tool calls (agent tried to call a non-existent tool) — it reveals the tool it WANTED

latestaiagents/mcp-tool-design

skills/mcp-mastery/mcp-tool-design/SKILL.md

Design MCP tool schemas, names, and descriptions that AI agents actually pick correctly and use without hand-holding. Covers the anti-patterns that make agents loop, pick wrong tools, or hallucinate arguments. Use this skill when designing or reviewing MCP tools, debugging "the agent isn't using my tool", or pruning a bloated tool surface. Activate when: MCP tool design, tool description, agent picks wrong tool, too many tools, tool schema, tool naming.

2 stars

tools

Updated Apr 23, 2026

$ install --global

skillsauth

npx skillsauth add latestaiagents/agent-skills mcp-tool-design

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 2:56 AM11.8s1 file scanned

SKILL.md

name:: mcp-tool-design
description:: |
Activate when:: MCP tool design, tool description, agent picks wrong tool, too many tools, tool schema, tool naming.

MCP Tool Design

A well-designed tool is invoked correctly by the agent on the first try. A bad one causes loops, wrong-tool selection, or hallucinated arguments.

When to Use

Adding a new tool to an MCP server
Debugging "the model never picks this tool" or "the model picks the wrong tool"
Reviewing a PR that adds tools
Cleaning up a server with 30+ tools

The Three Levers

Agents pick tools based on name, description, and parameter schema — in that order of signal strength. Every design choice should strengthen at least one.

Naming Rules

Rule: <verb>_<object>[_<qualifier>]. If two tools could answer the same query, one has the wrong name.

Description Rules

Descriptions are what the model reads most carefully. Budget ~1-3 sentences:

<Verb-led action>. <When to use it / when NOT to use it>. <Any gotchas>.

Example — weak vs strong

Weak:

Creates an issue.

Strong:

Create a GitHub issue in the specified repo. Use this for new bug reports or feature requests. Do NOT use to comment on an existing issue — use create_issue_comment for that. Title is required; body supports markdown.

The "when NOT to use" line is the highest-leverage sentence you can write — it routes disambiguation without the agent needing to enumerate all tools.

Parameter Schema Rules

Describe every field — .describe() in Zod, Field(description=...) in Pydantic. Undescribed fields get hallucinated values
Use enums for closed sets — priority: z.enum(["low", "med", "high"]) beats priority: z.string()
Default the optional — if 80% of calls use limit=50, set the default; don't make the agent guess
Prefer primitives at the top level — nested objects increase hallucination rates
Avoid freeform "options" bags — split into discrete flags

Example — good schema

server.tool(
  "search_issues",
  "Search GitHub issues across a repo. Returns up to 50 issues matching the query. " +
  "Use for finding issues by keyword or label. For a specific issue by number, use `get_issue` instead.",
  {
    repo: z.string().describe("owner/name format, e.g. 'anthropic/claude-code'"),
    query: z.string().describe("Full-text search query, GitHub search syntax supported"),
    state: z.enum(["open", "closed", "all"]).default("open"),
    labels: z.array(z.string()).optional().describe("Filter by label names (AND semantics)"),
    limit: z.number().int().min(1).max(100).default(25),
  },
  async (args) => { /* ... */ },
);

Return Shape Rules

The model sees the tool result as text. Structure matters:

Lead with a summary line — "Found 3 open issues matching 'auth bug':"
Include IDs so the model can chain calls
Truncate aggressively — don't return 200 rows when 10 will do; include "... 190 more, use offset=10" hint
Never return raw HTML — strip or convert to markdown

return {
  content: [{
    type: "text",
    text: [
      `Found ${results.length} issues matching "${query}":`,
      ...results.slice(0, 10).map(r => `- #${r.number} ${r.title} (${r.state})`),
      results.length > 10 ? `... ${results.length - 10} more. Narrow query or paginate with offset.` : "",
    ].filter(Boolean).join("\n"),
  }],
};

Surface Size

Tool selection accuracy degrades past ~15 tools per connected server set. If you have 40 CRUD operations, collapse them:

Before: create_issue, update_issue, delete_issue, close_issue, reopen_issue, assign_issue, ... (12 tools)
After: issue_action(action: "create"|"update"|"close"|"reopen", ...) (1 tool with discriminated schema)

Only collapse if the sub-actions share 80%+ of their schema. Otherwise the union becomes a mess.

Anti-Patterns

The God Tool — execute(query: string) that dispatches everything. Model can't pick wisely; hallucinates queries
Duplicate tools — search, find, lookup, query all doing similar things
Hidden required args — options: object where some keys are required; agent skips them
Silent truncation — returning 10 of 500 results without saying so; agent assumes it got everything
Tool-name collisions across servers — if user has 3 servers each with a search tool, the model gets confused. Prefix: linear_search, github_search

Validation Checklist

Before shipping a tool:

[ ] Name is verb-led and unambiguous
[ ] Description has ≥1 "when NOT to use" sentence if there's a nearby tool
[ ] Every parameter has .describe()
[ ] Closed-set fields use enums
[ ] Defaults cover the common case
[ ] Return includes IDs for chaining
[ ] Return summarizes + truncates large lists
[ ] Tested with MCP Inspector that the model can invoke it zero-shot

Best Practices

Read your descriptions aloud — if you stumble, the model will too
Pilot with one real conversation before adding 10 tools at once
When a user reports "the agent won't use my tool", check description first, schema second, name third
Log unresolved tool calls (agent tried to call a non-existent tool) — it reveals the tool it WANTED

Related Skills

latestaiagents/skill-testing

development

VerifiedTrustedCommunity

Test skills for correct activation, content quality, and regression — both automated checks (frontmatter validity, lint) and manual verification (query-suite activation testing). Covers CI integration and how to catch skill regressions before users do. Use this skill when adding skills to a repo, setting up CI for a skill library, or debugging "the skill exists but doesn't work". Activate when: test skills, validate skills, skill CI, skill linting, skill activation test, skill regression.

2SKILL.mdUpdated Apr 23, 2026

latestaiagents/skill-testing

latestaiagents/skill-frontmatter

documentation

VerifiedTrustedCommunity

Write the YAML frontmatter for a SKILL.md file so it activates reliably — name, description, and activation keywords that the model matches against. Covers length, tone, and the most common frontmatter mistakes. Use this skill when authoring a new skill, fixing a skill that isn't auto-activating, or reviewing skills for publication. Activate when: SKILL.md frontmatter, skill description, skill activation, skill YAML, write a skill, author a skill.

2SKILL.mdUpdated Apr 23, 2026

latestaiagents/skill-frontmatter

latestaiagents/skill-activation-patterns

development

VerifiedTrustedCommunity

Design skills that fire at the right moment — neither over-eager (noise) nor under-eager (silent). Covers activation specificity, trigger phrases, disambiguation between overlapping skills, and debugging activation. Use this skill when multiple skills could fire on the same query, a skill never fires, or a skill fires too often. Activate when: skill won't activate, skill over-activates, overlapping skills, skill triggers, skill selection, skill disambiguation.

2SKILL.mdUpdated Apr 23, 2026

latestaiagents/skill-activation-patterns

latestaiagents/progressive-disclosure

development

VerifiedTrustedCommunity

Structure SKILL.md content so the model reads just enough — concise summary up front, progressively deeper detail, examples on demand. Covers section ordering, length budgets, when to split into multiple skills. Use this skill when writing or refactoring a skill body, one skill has grown too long, or a skill is wordy but not useful. Activate when: SKILL.md structure, skill content, skill too long, split skill, progressive disclosure, skill body.

2SKILL.mdUpdated Apr 23, 2026

latestaiagents/progressive-disclosure

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/latestaiagents/agent-skills.git

# Copy into Claude Code skills folder (global)
cp -r agent-skills/skills/mcp-mastery/mcp-tool-design ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

latestaiagents/agent-skills

2 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT