Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

tylerjrbuell/identity-and-guardrails

Name: identity-and-guardrails
Author: tylerjrbuell

apps/docs/skills/identity-and-guardrails/SKILL.md

npx skillsauth add tylerjrbuell/reactive-agents-ts identity-and-guardrails

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Identity and Guardrails

Agent objective

Produce a builder with guardrails, behavioral contracts, and safety controls correctly configured so the agent operates within defined bounds and can be stopped when needed.

When to load this skill

Deploying an agent in a multi-tenant or public-facing context
Restricting which tools an agent can call or which topics it can address
Requiring human approval before certain tool calls
Needing runtime pause/resume/stop controls
Enforcing token/iteration/output-length budgets at the contract level

Implementation baseline

import { ReactiveAgents } from "@reactive-agents/runtime";

const agent = await ReactiveAgents.create()
  .withName("assistant")
  .withProvider("anthropic")
  .withReasoning({ defaultStrategy: "adaptive" })
  .withTools({ allowedTools: ["web-search", "http-get", "file-read", "checkpoint"] })
  .withGuardrails({
    injection: true,     // detect prompt injection attempts
    pii: true,           // detect and mask PII in inputs/outputs
    toxicity: true,      // detect toxic content
  })
  .withBehavioralContracts({
    deniedTools: ["file-write", "shell-execute"],
    maxToolCalls: 30,
    maxIterations: 20,
    requireDisclosure: true,   // agent must disclose it is an AI
  })
  .withKillSwitch()   // enables pause/resume/stop/terminate controls
  .withIdentity()     // enables identity and session tracking
  .withAudit()        // records all tool calls and decisions to audit log
  .build();

Key patterns

Guardrails

.withGuardrails()
// Enables all detectors with defaults: injection=true, pii=true, toxicity=true

.withGuardrails({
  injection: true,           // detect "ignore previous instructions" attacks
  pii: false,                // disable PII masking (e.g., agent legitimately processes PII)
  toxicity: true,
  customBlocklist: ["competitor-product", "internal-codename"],  // substring blocklist
})

Guardrail violations abort the turn and return a structured error — the agent never processes the blocked content.

Behavioral contracts

Full field reference for .withBehavioralContracts(contract):

.withBehavioralContracts({
  deniedTools: ["file-delete", "shell-execute"],    // tools the agent may NEVER call
  allowedTools: ["web-search", "file-read"],        // if set, ONLY these tools are allowed
  maxToolCalls: 50,                                  // hard stop after N total tool calls
  maxIterations: 20,                                 // hard stop after N reasoning iterations
  maxOutputLength: 4000,                             // truncate/block output over N chars
  deniedTopics: ["competitor names", "legal advice"], // topics agent must refuse
  requireDisclosure: true,                           // first response must disclose AI identity
})

Contract violations are enforced at runtime — violations halt the current turn with a ContractViolation error.

Kill switch (runtime control)

.withKillSwitch()
// Enables runtime controls on the built agent handle:

const handle = agent.run("Do a long task...");

// Graceful pause (waits for current phase to finish)
await handle.pause();

// Resume from paused state
await handle.resume();

// Graceful stop (finishes current phase, then stops)
await handle.stop("User requested cancellation");

// Immediate termination
await handle.terminate("Emergency shutdown");

Kill switch controls are no-ops if .withKillSwitch() was not called during build.

Identity and audit

.withIdentity()   // enables agent identity headers, session IDs, and persona tracking
.withAudit()      // records all tool calls, guardrail decisions, and contract checks to an audit trail

Identity and audit work independently — enable both for full traceability.

GuardrailsOptions reference

| Field | Type | Default | Notes | |-------|------|---------|-------| | injection | boolean | true | Prompt injection detection | | pii | boolean | true | PII detection and masking | | toxicity | boolean | true | Toxic content detection | | customBlocklist | string[] | [] | Case-insensitive substring blocklist |

BehavioralContract reference

| Field | Type | Notes | |-------|------|-------| | deniedTools | string[] | Tools that may never be called | | allowedTools | string[] | If set, only these tools are allowed | | maxToolCalls | number | Hard stop after N total tool calls | | maxIterations | number | Hard stop after N reasoning iterations | | maxOutputLength | number | Max output characters before truncation | | deniedTopics | string[] | Topics the agent must refuse | | requireDisclosure | boolean | Agent must disclose AI identity |

Pitfalls

.withGuardrails() with no args enables ALL detectors — disable selectively if your use case legitimately handles PII
deniedTools in a contract and allowedTools in .withTools() are independent — a tool can be in .withTools({ allowedTools }) but still blocked by a contract's deniedTools
Kill switch controls (pause, resume, stop, terminate) are no-ops without .withKillSwitch() — no error is thrown, calls are silently ignored
requireDisclosure enforces the agent states it is AI in its first response — this is a prompt-level enforcement, not cryptographic
Contract violations raise ContractViolation errors — handle these in your error callback or the agent run will throw
.withAudit() without a log destination writes to the observability stream — add .withObservability() to capture audit events

tylerjrbuell/identity-and-guardrails

apps/docs/skills/identity-and-guardrails/SKILL.md

Enable prompt injection detection, PII masking, behavioral contracts, kill switch controls, and agent identity for safe production deployments.

8 stars

devops

Updated Apr 21, 2026

$ install --global

skillsauth

npx skillsauth add tylerjrbuell/reactive-agents-ts identity-and-guardrails

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 1:56 AM158.0s1 file scanned

SKILL.md

name:: identity-and-guardrails
description:: Enable prompt injection detection, PII masking, behavioral contracts, kill switch controls, and agent identity for safe production deployments.
compatibility:: Reactive Agents TypeScript projects using @reactive-agents/*
author:: reactive-agents
version:: 2.0
tier:: capability

Identity and Guardrails

Agent objective

Produce a builder with guardrails, behavioral contracts, and safety controls correctly configured so the agent operates within defined bounds and can be stopped when needed.

When to load this skill

Deploying an agent in a multi-tenant or public-facing context
Restricting which tools an agent can call or which topics it can address
Requiring human approval before certain tool calls
Needing runtime pause/resume/stop controls
Enforcing token/iteration/output-length budgets at the contract level

Implementation baseline

import { ReactiveAgents } from "@reactive-agents/runtime";

const agent = await ReactiveAgents.create()
  .withName("assistant")
  .withProvider("anthropic")
  .withReasoning({ defaultStrategy: "adaptive" })
  .withTools({ allowedTools: ["web-search", "http-get", "file-read", "checkpoint"] })
  .withGuardrails({
    injection: true,     // detect prompt injection attempts
    pii: true,           // detect and mask PII in inputs/outputs
    toxicity: true,      // detect toxic content
  })
  .withBehavioralContracts({
    deniedTools: ["file-write", "shell-execute"],
    maxToolCalls: 30,
    maxIterations: 20,
    requireDisclosure: true,   // agent must disclose it is an AI
  })
  .withKillSwitch()   // enables pause/resume/stop/terminate controls
  .withIdentity()     // enables identity and session tracking
  .withAudit()        // records all tool calls and decisions to audit log
  .build();

Key patterns

Guardrails

.withGuardrails()
// Enables all detectors with defaults: injection=true, pii=true, toxicity=true

.withGuardrails({
  injection: true,           // detect "ignore previous instructions" attacks
  pii: false,                // disable PII masking (e.g., agent legitimately processes PII)
  toxicity: true,
  customBlocklist: ["competitor-product", "internal-codename"],  // substring blocklist
})

Guardrail violations abort the turn and return a structured error — the agent never processes the blocked content.

Behavioral contracts

Full field reference for .withBehavioralContracts(contract):

.withBehavioralContracts({
  deniedTools: ["file-delete", "shell-execute"],    // tools the agent may NEVER call
  allowedTools: ["web-search", "file-read"],        // if set, ONLY these tools are allowed
  maxToolCalls: 50,                                  // hard stop after N total tool calls
  maxIterations: 20,                                 // hard stop after N reasoning iterations
  maxOutputLength: 4000,                             // truncate/block output over N chars
  deniedTopics: ["competitor names", "legal advice"], // topics agent must refuse
  requireDisclosure: true,                           // first response must disclose AI identity
})

Contract violations are enforced at runtime — violations halt the current turn with a ContractViolation error.

Kill switch (runtime control)

.withKillSwitch()
// Enables runtime controls on the built agent handle:

const handle = agent.run("Do a long task...");

// Graceful pause (waits for current phase to finish)
await handle.pause();

// Resume from paused state
await handle.resume();

// Graceful stop (finishes current phase, then stops)
await handle.stop("User requested cancellation");

// Immediate termination
await handle.terminate("Emergency shutdown");

Kill switch controls are no-ops if .withKillSwitch() was not called during build.

Identity and audit

.withIdentity()   // enables agent identity headers, session IDs, and persona tracking
.withAudit()      // records all tool calls, guardrail decisions, and contract checks to an audit trail

Identity and audit work independently — enable both for full traceability.

GuardrailsOptions reference

BehavioralContract reference

Pitfalls

.withGuardrails() with no args enables ALL detectors — disable selectively if your use case legitimately handles PII
deniedTools in a contract and allowedTools in .withTools() are independent — a tool can be in .withTools({ allowedTools }) but still blocked by a contract's deniedTools
Kill switch controls (pause, resume, stop, terminate) are no-ops without .withKillSwitch() — no error is thrown, calls are silently ignored
requireDisclosure enforces the agent states it is AI in its first response — this is a prompt-level enforcement, not cryptographic
Contract violations raise ContractViolation errors — handle these in your error callback or the agent run will throw
.withAudit() without a log destination writes to the observability stream — add .withObservability() to capture audit events

Related Skills

tylerjrbuell/reactive-agents

development

VerifiedTrustedCommunity

Orient to the Reactive Agents framework, understand the builder API shape, and select the right capability skills for your task.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/reactive-agents

tylerjrbuell/quality-assurance

testing

VerifiedTrustedCommunity

Enable output verification (hallucination detection, semantic entropy, self-consistency), add post-run verification steps, and run LLM-scored evals across 5 quality dimensions.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/quality-assurance

tylerjrbuell/provider-patterns

data-ai

VerifiedTrustedCommunity

Configure per-provider behavior, understand streaming quirks, and use the 7-hook adapter system for optimal performance across LLM providers.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/provider-patterns

tylerjrbuell/memory-patterns

data-ai

VerifiedTrustedCommunity

Configure the 4-layer memory system with SQLite/FTS5/vec storage for persistent agent knowledge that survives sessions.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/memory-patterns

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/tylerjrbuell/reactive-agents-ts.git

# Copy into Claude Code skills folder (global)
cp -r reactive-agents-ts/apps/docs/skills/identity-and-guardrails ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

tylerjrbuell/reactive-agents-ts

8 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT