Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

getsentry/testing-guidelines

Name: testing-guidelines
Author: getsentry

.agents/skills/testing-guidelines/SKILL.md

npx skillsauth add getsentry/warden testing-guidelines

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Testing Guidelines

Follow these principles when writing tests for this codebase.

Core Principles

1. Mock External Services, Use Real Fixtures

ALWAYS mock third-party network services. ALWAYS use fixtures based on real-world data.

Fixtures must be scrubbed of PII (use dummy data like [email protected], user-123)
Capture real API responses, then sanitize them
Never make actual network calls in tests

2. Prefer Integration Tests Over Unit Tests

Focus on end-to-end style tests that validate inputs and outputs, not implementation details.

Test the public interface, not internal methods
Unit tests are valuable for edge cases in pure functions, but integration tests are the priority
If refactoring breaks tests but behavior is unchanged, the tests were too coupled to implementation

3. Minimize Edge Case Testing

Don't test every variant of a problem.

Cover the common path thoroughly
Skip exhaustive input permutations
Skip unlikely edge cases that add maintenance burden without value
One representative test per category of input is usually sufficient

4. Always Add Regression Tests for Bugs

When a bug is identified, ALWAYS add a test that would have caught it.

The test should fail before the fix and pass after
Name it descriptively to document the bug
This prevents the same bug from recurring

Note: Regression tests are for unintentional broken behavior (bugs), not intentional changes. Intentional feature removals, deprecations, or breaking changes do NOT need regression tests—these are design decisions, not defects.

5. Cover Every User Entry Point

ALWAYS have at least one basic test for each customer/user entry point.

CLI commands, API endpoints, public/exported functions
Test the common/happy path first
This proves the entry point works at all

Note: "Entry point" means the public interface—exported functions, CLI commands, API routes. Internal/private functions are NOT entry points, even if they handle user-facing flags or options. Test entry points; internal functions get coverage through those tests.

6. Tests Validate Before Manual QA

Tests are how we validate ANY functionality works before manual testing.

Write tests first or alongside code, not as an afterthought
If you can't test it, reconsider the design
Passing tests should give confidence to ship

Technical Guidelines

File Organization

Test files use *.test.ts extension
Co-locate tests with source: foo.ts → foo.test.ts

Test Isolation

Every test must:

Run independently without affecting other tests
Use temporary directories for file operations
Clean up resources in afterEach hooks

import { describe, it, expect, beforeEach, afterEach } from 'vitest';
import { mkdirSync, rmSync, writeFileSync } from 'node:fs';
import { join } from 'node:path';
import { tmpdir } from 'node:os';

describe('my feature', () => {
  let tempDir: string;

  beforeEach(() => {
    tempDir = join(tmpdir(), `warden-test-${Date.now()}`);
    mkdirSync(tempDir, { recursive: true });
  });

  afterEach(() => {
    rmSync(tempDir, { recursive: true, force: true });
  });

  it('does something with files', () => {
    writeFileSync(join(tempDir, 'test.ts'), 'content');
    // ... test code
  });
});

Pure Function Tests

For pure functions without side effects, no special setup is needed:

import { describe, it, expect } from 'vitest';
import { matchGlob } from './matcher.js';

describe('matchGlob', () => {
  it('matches exact paths', () => {
    expect(matchGlob('src/index.ts', 'src/index.ts')).toBe(true);
  });
});

Running Tests

pnpm test              # Run all tests in watch mode
pnpm test:run          # Run all tests once

Checklist Before Submitting

[ ] New entry points have at least one happy-path test
[ ] Bug fixes (not intentional changes) include a regression test
[ ] External services are mocked with sanitized fixtures
[ ] Tests validate behavior, not implementation
[ ] No shared state between tests

getsentry/testing-guidelines

.agents/skills/testing-guidelines/SKILL.md

Guide for writing tests. Use when adding new functionality, fixing bugs, or when tests are needed. Emphasizes integration tests, real-world fixtures, and regression coverage.

137 stars

testing

Updated Apr 5, 2026

$ install --global

skillsauth

npx skillsauth add getsentry/warden testing-guidelines

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 5, 2026, 1:22 PM42.2s1 file scanned

SKILL.md

name:: testing-guidelines
description:: Guide for writing tests. Use when adding new functionality, fixing bugs, or when tests are needed. Emphasizes integration tests, real-world fixtures, and regression coverage.

Testing Guidelines

Follow these principles when writing tests for this codebase.

Core Principles

1. Mock External Services, Use Real Fixtures

ALWAYS mock third-party network services. ALWAYS use fixtures based on real-world data.

Fixtures must be scrubbed of PII (use dummy data like [email protected], user-123)
Capture real API responses, then sanitize them
Never make actual network calls in tests

2. Prefer Integration Tests Over Unit Tests

Focus on end-to-end style tests that validate inputs and outputs, not implementation details.

Test the public interface, not internal methods
Unit tests are valuable for edge cases in pure functions, but integration tests are the priority
If refactoring breaks tests but behavior is unchanged, the tests were too coupled to implementation

3. Minimize Edge Case Testing

Don't test every variant of a problem.

Cover the common path thoroughly
Skip exhaustive input permutations
Skip unlikely edge cases that add maintenance burden without value
One representative test per category of input is usually sufficient

4. Always Add Regression Tests for Bugs

When a bug is identified, ALWAYS add a test that would have caught it.

The test should fail before the fix and pass after
Name it descriptively to document the bug
This prevents the same bug from recurring

5. Cover Every User Entry Point

ALWAYS have at least one basic test for each customer/user entry point.

CLI commands, API endpoints, public/exported functions
Test the common/happy path first
This proves the entry point works at all

6. Tests Validate Before Manual QA

Tests are how we validate ANY functionality works before manual testing.

Write tests first or alongside code, not as an afterthought
If you can't test it, reconsider the design
Passing tests should give confidence to ship

Technical Guidelines

File Organization

Test files use *.test.ts extension
Co-locate tests with source: foo.ts → foo.test.ts

Test Isolation

Every test must:

Run independently without affecting other tests
Use temporary directories for file operations
Clean up resources in afterEach hooks

import { describe, it, expect, beforeEach, afterEach } from 'vitest';
import { mkdirSync, rmSync, writeFileSync } from 'node:fs';
import { join } from 'node:path';
import { tmpdir } from 'node:os';

describe('my feature', () => {
  let tempDir: string;

  beforeEach(() => {
    tempDir = join(tmpdir(), `warden-test-${Date.now()}`);
    mkdirSync(tempDir, { recursive: true });
  });

  afterEach(() => {
    rmSync(tempDir, { recursive: true, force: true });
  });

  it('does something with files', () => {
    writeFileSync(join(tempDir, 'test.ts'), 'content');
    // ... test code
  });
});

Pure Function Tests

For pure functions without side effects, no special setup is needed:

import { describe, it, expect } from 'vitest';
import { matchGlob } from './matcher.js';

describe('matchGlob', () => {
  it('matches exact paths', () => {
    expect(matchGlob('src/index.ts', 'src/index.ts')).toBe(true);
  });
});

Running Tests

pnpm test              # Run all tests in watch mode
pnpm test:run          # Run all tests once

Checklist Before Submitting

[ ] New entry points have at least one happy-path test
[ ] Bug fixes (not intentional changes) include a regression test
[ ] External services are mocked with sanitized fixtures
[ ] Tests validate behavior, not implementation
[ ] No shared state between tests

Related Skills

getsentry/warden

development

VerifiedTrustedCommunity

Run Warden to analyze code changes before committing. Use when asked to "run warden", "check my changes", "review before commit", "warden config", "warden.toml", "create a warden skill", "add trigger", or any Warden-related local development task.

332SKILL.mdUpdated Apr 5, 2026

getsentry/security-review

development

VerifiedTrustedCommunity

Finds exploitable application security vulnerabilities in code changes. Use for Warden security scans, appsec review, OWASP-style checks, authentication or authorization bugs, injection, XSS, SSRF, path traversal, secrets, unsafe crypto, webhook verification, open redirects, or sensitive data exposure.

330SKILL.mdUpdated May 28, 2026

getsentry/security-review

getsentry/code-review

development

VerifiedTrustedCommunity

Finds real correctness bugs in code changes. Use for adversarial code review, bug hunts, regression review, PR correctness review, logic errors, data loss, race conditions, state bugs, interface contract breaks, error handling bugs, edge cases, broken builds, or broken workflows. Excludes style, readability, architecture, AppSec, and best-practice-only feedback unless the issue causes a demonstrable bug.

330SKILL.mdUpdated May 28, 2026

getsentry/code-review

getsentry/warden-sweep

development

VerifiedTrustedCommunity

Full-repository code sweep. Scans every file with Warden, verifies findings through deep tracing, creates draft PRs for validated issues. Use when asked to "sweep the repo", "scan everything", "find all bugs", "full codebase review", "batch code analysis", or run Warden across the entire repository.

330SKILL.mdUpdated Apr 5, 2026

getsentry/warden-sweep

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/getsentry/warden.git

# Copy into Claude Code skills folder (global)
cp -r warden/.agents/skills/testing-guidelines ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

getsentry/warden

137 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT