Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

langwatch/analytics

Name: analytics
Author: langwatch

skills/analytics/SKILL.md

npx skillsauth add langwatch/langwatch analytics

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Analyze Agent Performance with LangWatch

This skill queries and presents analytics. It does NOT write code.

Preferred: Use the LangWatch CLI

If the langwatch CLI is available (check with langwatch --help), prefer it over MCP tools:

# Quick project overview
langwatch status

# Query metrics with presets
langwatch analytics query --metric trace-count      # Total traces
langwatch analytics query --metric total-cost       # Total cost
langwatch analytics query --metric avg-latency      # Average latency
langwatch analytics query --metric p95-latency      # P95 latency
langwatch analytics query --metric eval-pass-rate   # Evaluation pass rate

# Search traces
langwatch trace search -q "error" --limit 10        # Find error traces
langwatch trace search --start-date 2026-01-01      # Custom date range

# Get trace details
langwatch trace get <traceId>                       # Human-readable
langwatch trace get <traceId> -f json               # Raw JSON
langwatch trace export --format csv -o traces.csv   # Export as CSV
langwatch trace export --format jsonl --limit 500   # Export as JSONL

Set LANGWATCH_API_KEY in the environment before running CLI commands.

Alternative: Use MCP Tools

If the CLI is not available, use MCP tools instead.

Step 1: Set up the LangWatch MCP

See MCP Setup for installation instructions.

Step 2: Discover Available Metrics

Call discover_schema with category "all" to learn the full set of available metrics, aggregations, and filters

CRITICAL: Always call discover_schema first. Do NOT hardcode or guess metric names.

Step 3: Query Analytics

Use the appropriate MCP tool based on what the user needs:

Trends and Aggregations

Use get_analytics for time-series data and aggregate metrics:

Total LLM cost for the last 7 days -- metric "performance.total_cost", aggregation "sum"
P95 latency -- metric "performance.completion_time", aggregation "p95"
Token usage over time -- metric "performance.total_tokens", aggregation "sum"
Error rate -- metric "metadata.error", aggregation "count"

Finding Specific Traces

Use search_traces to find individual requests matching criteria:

Traces with errors
Traces from a specific user or session
Traces matching a keyword or pattern

Step 4: Inspect Individual Traces

Use get_trace with a trace ID to drill into details:

View the full request/response
See token counts and costs per span
Inspect error messages and stack traces
Examine individual LLM calls within a multi-step agent

Step 5: Present Findings

Summarize the data clearly for the user:

Lead with the key numbers they asked about
Highlight anomalies or concerning trends (cost spikes, latency increases, error rate changes)
Provide context by comparing to previous periods when relevant
Suggest next steps if issues are found (e.g., "The p95 latency spiked on Tuesday -- here are the slowest traces from that day")

Common Mistakes

Do NOT try to write code -- this skill queries existing data, no SDK installation or code changes
If using MCP, always call discover_schema first -- do NOT hardcode metric names
If using CLI, use the preset names (trace-count, total-cost, avg-latency, etc.)
Do NOT use platform_ MCP tools for creating resources -- this skill is read-only analytics
Do NOT present raw JSON to the user -- summarize the data in a clear, human-readable format

langwatch/analytics

skills/analytics/SKILL.md

Analyze your AI agent's performance using LangWatch analytics. Use when the user wants to understand costs, latency, error rates, usage trends, or debug specific traces. Works with any LangWatch-instrumented agent.

3,203 stars

development

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add langwatch/langwatch analytics

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 1:21 PM4.9s1 file scanned

SKILL.md

name:: analytics
user-prompt:: How is my agent performing?
description:: Analyze your AI agent's performance using LangWatch analytics. Use when the user wants to understand costs, latency, error rates, usage trends, or debug specific traces. Works with any LangWatch-instrumented agent.
license:: MIT
compatibility:: Requires Node.js for MCP setup. Works with Claude Code, Claude Web, and similar AI assistants.

Analyze Agent Performance with LangWatch

This skill queries and presents analytics. It does NOT write code.

Preferred: Use the LangWatch CLI

If the langwatch CLI is available (check with langwatch --help), prefer it over MCP tools:

# Quick project overview
langwatch status

# Query metrics with presets
langwatch analytics query --metric trace-count      # Total traces
langwatch analytics query --metric total-cost       # Total cost
langwatch analytics query --metric avg-latency      # Average latency
langwatch analytics query --metric p95-latency      # P95 latency
langwatch analytics query --metric eval-pass-rate   # Evaluation pass rate

# Search traces
langwatch trace search -q "error" --limit 10        # Find error traces
langwatch trace search --start-date 2026-01-01      # Custom date range

# Get trace details
langwatch trace get <traceId>                       # Human-readable
langwatch trace get <traceId> -f json               # Raw JSON
langwatch trace export --format csv -o traces.csv   # Export as CSV
langwatch trace export --format jsonl --limit 500   # Export as JSONL

Set LANGWATCH_API_KEY in the environment before running CLI commands.

Alternative: Use MCP Tools

If the CLI is not available, use MCP tools instead.

Step 1: Set up the LangWatch MCP

See MCP Setup for installation instructions.

Step 2: Discover Available Metrics

Call discover_schema with category "all" to learn the full set of available metrics, aggregations, and filters

CRITICAL: Always call discover_schema first. Do NOT hardcode or guess metric names.

Step 3: Query Analytics

Use the appropriate MCP tool based on what the user needs:

Trends and Aggregations

Use get_analytics for time-series data and aggregate metrics:

Total LLM cost for the last 7 days -- metric "performance.total_cost", aggregation "sum"
P95 latency -- metric "performance.completion_time", aggregation "p95"
Token usage over time -- metric "performance.total_tokens", aggregation "sum"
Error rate -- metric "metadata.error", aggregation "count"

Finding Specific Traces

Use search_traces to find individual requests matching criteria:

Traces with errors
Traces from a specific user or session
Traces matching a keyword or pattern

Step 4: Inspect Individual Traces

Use get_trace with a trace ID to drill into details:

View the full request/response
See token counts and costs per span
Inspect error messages and stack traces
Examine individual LLM calls within a multi-step agent

Step 5: Present Findings

Summarize the data clearly for the user:

Lead with the key numbers they asked about
Highlight anomalies or concerning trends (cost spikes, latency increases, error rate changes)
Provide context by comparing to previous periods when relevant
Suggest next steps if issues are found (e.g., "The p95 latency spiked on Tuesday -- here are the slowest traces from that day")

Common Mistakes

Do NOT try to write code -- this skill queries existing data, no SDK installation or code changes
If using MCP, always call discover_schema first -- do NOT hardcode metric names
If using CLI, use the preset names (trace-count, total-cost, avg-latency, etc.)
Do NOT use platform_ MCP tools for creating resources -- this skill is read-only analytics
Do NOT present raw JSON to the user -- summarize the data in a clear, human-readable format

Related Skills

langwatch/tracing

development

VerifiedTrustedCommunity

Add LangWatch tracing and observability to your code. Use for both onboarding (instrument an entire codebase) and targeted operations (add tracing to a specific function or module). Supports Python and TypeScript with all major frameworks.

3,203SKILL.mdUpdated Apr 15, 2026

langwatch/scenarios

tools

VerifiedTrustedCommunity

Test your AI agent with simulation-based scenarios. Covers writing scenario test code (Scenario SDK), creating platform scenarios (CLI or MCP), and red teaming for security vulnerabilities. Auto-detects whether to use code or platform approach based on context.

3,203SKILL.mdUpdated Apr 15, 2026

langwatch/test-compliance

testing

VerifiedTrustedCommunity

Test that your AI agent stays observational and doesn't give prescriptive advice in regulated domains (healthcare, finance, legal). Creates scenario tests for boundary enforcement and red team tests for adversarial probing. Use when your agent advises but must not prescribe.

3,203SKILL.mdUpdated Apr 15, 2026

langwatch/test-compliance

langwatch/test-cli-usability

tools

VerifiedTrustedCommunity

Write scenario tests that verify your CLI tool is usable by AI agents. Ensures commands work non-interactively, provide clear output, and don't hang on prompts. Use when you want to prove your CLI is agent-friendly.

3,203SKILL.mdUpdated Apr 15, 2026

langwatch/test-cli-usability

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/langwatch/langwatch.git

# Copy into Claude Code skills folder (global)
cp -r langwatch/skills/analytics ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

langwatch/langwatch

3,203 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT