Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

tylerjrbuell/cost-budget-enforcement

Name: cost-budget-enforcement
Author: tylerjrbuell

apps/docs/skills/cost-budget-enforcement/SKILL.md

npx skillsauth add tylerjrbuell/reactive-agents-ts cost-budget-enforcement

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Cost and Budget Enforcement

Agent objective

Produce a builder with cost tracking, budget limits, and rate limiting configured so the agent never exceeds defined spending thresholds.

When to load this skill

Deploying agents in production with real API costs
Building multi-tenant SaaS where per-user cost isolation matters
Protecting against runaway agent loops consuming excessive tokens
Adding circuit breakers for provider reliability

Implementation baseline

import { ReactiveAgents } from "@reactive-agents/runtime";

const agent = await ReactiveAgents.create()
  .withName("assistant")
  .withProvider("anthropic")
  .withReasoning({ defaultStrategy: "adaptive", maxIterations: 15 })
  .withTools({ allowedTools: ["web-search", "http-get", "checkpoint"] })
  .withCostTracking({
    perRequest: 0.25,   // max $0.25 per LLM call
    perSession: 2.0,    // max $2.00 per agent.run() call
    daily: 10.0,        // max $10.00/day across all sessions
    monthly: 100.0,     // max $100.00/month
  })
  .withRateLimiting({
    requestsPerMinute: 30,
    tokensPerMinute: 50_000,
    maxConcurrent: 3,
  })
  .withCircuitBreaker()   // auto-opens on provider errors; prevents cascading failures
  .build();

Key patterns

withCostTracking() — budget limits

.withCostTracking()
// Enables cost tracking with defaults:
// perRequest: $1.00, perSession: $5.00, daily: $25.00, monthly: $200.00

.withCostTracking({
  perRequest: 0.50,    // hard stop mid-request if cost would exceed this
  perSession: 5.0,
  daily: 25.0,         // daily limit (default $25.00)
  monthly: 200.0,
})

When a budget is exceeded, the agent throws a BudgetExceededError and stops. Daily/monthly budgets reset based on the timezone configured in .withGateway() (if used) or UTC by default.

withRateLimiting() — throughput caps

.withRateLimiting()
// Defaults: 60 RPM, 100,000 TPM, 10 concurrent requests

.withRateLimiting({
  requestsPerMinute: 60,     // max LLM requests per minute
  tokensPerMinute: 100_000,  // max tokens per minute (input + output)
  maxConcurrent: 10,         // max simultaneous in-flight LLM requests
})

Requests that exceed limits are queued (not dropped) — the agent waits for capacity before proceeding.

withCircuitBreaker() — provider reliability

.withCircuitBreaker()
// Default thresholds (open after 5 failures in 60s window, retry after 30s)

.withCircuitBreaker({
  failureThreshold: 5,       // open circuit after N consecutive failures
  windowMs: 60_000,          // failure counting window
  retryAfterMs: 30_000,      // wait before trying half-open probe
})

Circuit breaker states: closed (normal) → open (failing fast) → half-open (probing recovery).

Per-user cost isolation (multi-tenant)

// Create one agent per user/tenant with separate tracking contexts
const userAgent = await ReactiveAgents.create()
  .withProvider("anthropic")
  .withCostTracking({ perSession: 1.0, daily: 5.0 })
  .withName(`user-${userId}`)
  .withSystemPrompt(`You are assisting user ${userId}.`)
  .build();

// Or use per-request context injection:
const result = await agent.run(task, {
  context: { userId, tenantId },   // included in cost tracking metadata
});

Dynamic pricing (LiteLLM / custom providers)

import { createLiteLLMPricingProvider } from "@reactive-agents/llm-provider";

.withDynamicPricing(createLiteLLMPricingProvider())
// Fetches live model prices from LiteLLM pricing API
// Required when using models whose costs are not in the built-in price table

CostTrackingOptions reference

| Field | Type | Default | Notes | |-------|------|---------|-------| | perRequest | number | 1.00 | Max USD per single LLM request | | perSession | number | 5.00 | Max USD per agent.run() call | | daily | number | 20.00 | Max USD per calendar day | | monthly | number | 200.00 | Max USD per calendar month |

RateLimiterConfig reference

| Field | Type | Default | Notes | |-------|------|---------|-------| | requestsPerMinute | number | 60 | Max LLM requests/minute | | tokensPerMinute | number | 100_000 | Max tokens/minute (input + output) | | maxConcurrent | number | 10 | Max simultaneous in-flight requests |

Pitfalls

Budget limits are enforced per-process — multiple processes running the same agent each get their own daily/monthly counters; use an external store for true cross-process budget tracking
withCostTracking() with no args is still useful — it enables cost telemetry without enforcing limits (all defaults are generous)
withCircuitBreaker() opens on LLM provider errors, not on budget exceeded errors — they are independent systems
Rate limiting queues requests rather than dropping them — set maxConcurrent based on your provider's actual concurrency limits to avoid provider-side 429s
withDynamicPricing() makes an external HTTP call during build — ensure network access and handle build failures
Daily budget resets at midnight UTC by default — to use a different timezone, configure it via .withGateway({ timezone: "America/New_York" })

tylerjrbuell/cost-budget-enforcement

apps/docs/skills/cost-budget-enforcement/SKILL.md

Set per-request, per-session, daily, and monthly spend limits, configure rate limiting and circuit breakers, and isolate costs per user or tenant.

9 stars

testing

Updated May 6, 2026

$ install --global

skillsauth

npx skillsauth add tylerjrbuell/reactive-agents-ts cost-budget-enforcement

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: May 6, 2026, 2:34 AM59.7s1 file scanned

SKILL.md

name:: cost-budget-enforcement
description:: Set per-request, per-session, daily, and monthly spend limits, configure rate limiting and circuit breakers, and isolate costs per user or tenant.
compatibility:: Reactive Agents TypeScript projects using @reactive-agents/*
author:: reactive-agents
version:: 2.0
tier:: capability

Cost and Budget Enforcement

Agent objective

Produce a builder with cost tracking, budget limits, and rate limiting configured so the agent never exceeds defined spending thresholds.

When to load this skill

Deploying agents in production with real API costs
Building multi-tenant SaaS where per-user cost isolation matters
Protecting against runaway agent loops consuming excessive tokens
Adding circuit breakers for provider reliability

Implementation baseline

import { ReactiveAgents } from "@reactive-agents/runtime";

const agent = await ReactiveAgents.create()
  .withName("assistant")
  .withProvider("anthropic")
  .withReasoning({ defaultStrategy: "adaptive", maxIterations: 15 })
  .withTools({ allowedTools: ["web-search", "http-get", "checkpoint"] })
  .withCostTracking({
    perRequest: 0.25,   // max $0.25 per LLM call
    perSession: 2.0,    // max $2.00 per agent.run() call
    daily: 10.0,        // max $10.00/day across all sessions
    monthly: 100.0,     // max $100.00/month
  })
  .withRateLimiting({
    requestsPerMinute: 30,
    tokensPerMinute: 50_000,
    maxConcurrent: 3,
  })
  .withCircuitBreaker()   // auto-opens on provider errors; prevents cascading failures
  .build();

Key patterns

withCostTracking() — budget limits

.withCostTracking()
// Enables cost tracking with defaults:
// perRequest: $1.00, perSession: $5.00, daily: $25.00, monthly: $200.00

.withCostTracking({
  perRequest: 0.50,    // hard stop mid-request if cost would exceed this
  perSession: 5.0,
  daily: 25.0,         // daily limit (default $25.00)
  monthly: 200.0,
})

When a budget is exceeded, the agent throws a BudgetExceededError and stops. Daily/monthly budgets reset based on the timezone configured in .withGateway() (if used) or UTC by default.

withRateLimiting() — throughput caps

.withRateLimiting()
// Defaults: 60 RPM, 100,000 TPM, 10 concurrent requests

.withRateLimiting({
  requestsPerMinute: 60,     // max LLM requests per minute
  tokensPerMinute: 100_000,  // max tokens per minute (input + output)
  maxConcurrent: 10,         // max simultaneous in-flight LLM requests
})

Requests that exceed limits are queued (not dropped) — the agent waits for capacity before proceeding.

withCircuitBreaker() — provider reliability

.withCircuitBreaker()
// Default thresholds (open after 5 failures in 60s window, retry after 30s)

.withCircuitBreaker({
  failureThreshold: 5,       // open circuit after N consecutive failures
  windowMs: 60_000,          // failure counting window
  retryAfterMs: 30_000,      // wait before trying half-open probe
})

Circuit breaker states: closed (normal) → open (failing fast) → half-open (probing recovery).

Per-user cost isolation (multi-tenant)

// Create one agent per user/tenant with separate tracking contexts
const userAgent = await ReactiveAgents.create()
  .withProvider("anthropic")
  .withCostTracking({ perSession: 1.0, daily: 5.0 })
  .withName(`user-${userId}`)
  .withSystemPrompt(`You are assisting user ${userId}.`)
  .build();

// Or use per-request context injection:
const result = await agent.run(task, {
  context: { userId, tenantId },   // included in cost tracking metadata
});

Dynamic pricing (LiteLLM / custom providers)

import { createLiteLLMPricingProvider } from "@reactive-agents/llm-provider";

.withDynamicPricing(createLiteLLMPricingProvider())
// Fetches live model prices from LiteLLM pricing API
// Required when using models whose costs are not in the built-in price table

CostTrackingOptions reference

RateLimiterConfig reference

Pitfalls

Budget limits are enforced per-process — multiple processes running the same agent each get their own daily/monthly counters; use an external store for true cross-process budget tracking
withCostTracking() with no args is still useful — it enables cost telemetry without enforcing limits (all defaults are generous)
withCircuitBreaker() opens on LLM provider errors, not on budget exceeded errors — they are independent systems
Rate limiting queues requests rather than dropping them — set maxConcurrent based on your provider's actual concurrency limits to avoid provider-side 429s
withDynamicPricing() makes an external HTTP call during build — ensure network access and handle build failures
Daily budget resets at midnight UTC by default — to use a different timezone, configure it via .withGateway({ timezone: "America/New_York" })

Related Skills

tylerjrbuell/reactive-agents

development

VerifiedTrustedCommunity

Orient to the Reactive Agents framework, understand the builder API shape, and select the right capability skills for your task.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/reactive-agents

tylerjrbuell/quality-assurance

testing

VerifiedTrustedCommunity

Enable output verification (hallucination detection, semantic entropy, self-consistency), add post-run verification steps, and run LLM-scored evals across 5 quality dimensions.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/quality-assurance

tylerjrbuell/provider-patterns

data-ai

VerifiedTrustedCommunity

Configure per-provider behavior, understand streaming quirks, and use the 7-hook adapter system for optimal performance across LLM providers.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/provider-patterns

tylerjrbuell/memory-patterns

data-ai

VerifiedTrustedCommunity

Configure the 4-layer memory system with SQLite/FTS5/vec storage for persistent agent knowledge that survives sessions.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/memory-patterns

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/tylerjrbuell/reactive-agents-ts.git

# Copy into Claude Code skills folder (global)
cp -r reactive-agents-ts/apps/docs/skills/cost-budget-enforcement ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

tylerjrbuell/reactive-agents-ts

9 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT