Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

tylerjrbuell/shell-execution-sandbox

Name: shell-execution-sandbox
Author: tylerjrbuell

apps/docs/skills/shell-execution-sandbox/SKILL.md

npx skillsauth add tylerjrbuell/reactive-agents-ts shell-execution-sandbox

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Shell Execution Sandbox

Disclaimer — your machine, your risk. shell-execute runs real processes on the host (or in an optional Docker sandbox you configure). Allowlists and blocklists reduce accidents but are not a guarantee. Only enable for trusted codebases and accounts; review allowed command names and working-directory rules before production use. Cortex exposes this as an explicit opt-in in the Lab builder with the same warning.

Agent objective

Produce a builder with shell execution enabled, the correct allowlist for the task, and appropriate safety config — without exposing destructive commands.

When to load this skill

Agent needs to run terminal commands (git, file operations, build tools)
Agent generates and executes code (Node, Bun, Python)
Task requires reading directory structure, running tests, or processing files
CI/CD or automation agent workflows

Implementation baseline

const agent = await ReactiveAgents.create()
  .withProvider("anthropic")
  .withReasoning({ defaultStrategy: "plan-execute-reflect", maxIterations: 15 })
  .withTools({
    allowedTools: ["shell-execute", "file-read", "checkpoint"],
    terminal: true, // registers shell-execute handler (or use .withTerminalTools())
  })
  .withSystemPrompt(`
    You have access to a shell. Use it to explore the codebase and run commands.
    Always checkpoint important findings before continuing.
  `)
  .build();

Default allowlist

The shell-execute tool blocks any command not on the allowlist. Default allowed commands:

git, ls, cat, grep, find, echo, printf
mkdir, cp, mv, touch
wc, head, tail, sort, uniq, cut, tr, tee, diff, sed, awk, jq
pwd, date, which, basename, dirname, test, true, false
seq, gzip, gunzip, zip, unzip

Explicitly excluded: rm, chmod, chown — too destructive for agent sandboxes.

Key patterns

Opt-in commands for build tasks

Build tools (Node, Bun, npm, Python, curl) are available but not on by default:

// Available opt-in commands: node, bun, npm, npx, python, python3, curl, env, xargs, tar
// Add via ShellExecuteConfig.additionalCommands when registering the tool:
import { shellExecuteTool, shellExecuteHandler } from "@reactive-agents/tools";

const shellTool = {
  definition: shellExecuteTool,
  handler: shellExecuteHandler({
    additionalCommands: ["bun", "node", "npm"],
    timeoutMs: 60_000,        // default 30s — increase for build commands
    maxOutputChars: 8_000,    // default 4000
    cwd: "/workspace",        // default to project root
  }),
};

const agent = await ReactiveAgents.create()
  .withTools({ tools: [shellTool], allowedTools: ["shell-execute"] })
  .build();

Docker-isolated code execution

When dockerEscalation is enabled, inline code (Node --eval, Bun -e, Python -c) automatically routes through a Docker sandbox:

shellExecuteHandler({
  additionalCommands: ["node", "python3"],
  dockerEscalation: {
    enabled: true,
    // Inline code execution is fully isolated in a fresh container
  },
})

Read-only shell (safest config)

shellExecuteHandler({
  allowedCommands: ["ls", "cat", "grep", "find", "head", "tail", "wc"],
  // Only listing and reading — no writes, no execution
})

Audit logging

shellExecuteHandler({
  onAudit: (entry: ShellAuditEntry) => {
    logger.info("shell-execute", {
      command: entry.command,
      exitCode: entry.exitCode,
      durationMs: entry.durationMs,
    });
  },
})

Shell tool properties

The shell-execute built-in tool has these characteristics:

| Property | Value | |----------|-------| | riskLevel | "high" | | requiresApproval | true | | category | "system" | | timeoutMs (default) | 30,000ms | | maxOutputChars (default) | 4,000 chars | | MAX_COMMAND_LENGTH | 4,096 chars |

Builder API reference

| Method | Key params | Notes | |--------|-----------|-------| | .withTools({ tools, allowedTools }) | include "shell-execute" | Register custom handler for config | | .withTools() | no args | Enables shell-execute but with requiresApproval: true |

Pitfalls

shell-execute has requiresApproval: true by default — in automated pipelines, register a custom handler with requiresApproval: false if human approval flow is not wired
Commands are allowlisted by executable name only (first word) — git is allowed regardless of sub-command args; curl is opt-in
MAX_COMMAND_LENGTH is 4,096 — very long piped commands will be rejected
Docker daemon must be running for dockerEscalation — check before enabling in CI
rm, chmod, chown are hard-excluded and cannot be added via additionalCommands
maxOutputChars: 4000 truncates long output — increase for commands that produce large output (e.g., git log, find on large trees)

tylerjrbuell/shell-execution-sandbox

apps/docs/skills/shell-execution-sandbox/SKILL.md

Enable and configure the sandboxed shell execution tool with command allowlists, Docker isolation, and audit logging for agents that run terminal commands.

8 stars

tools

Updated Apr 22, 2026

$ install --global

skillsauth

npx skillsauth add tylerjrbuell/reactive-agents-ts shell-execution-sandbox

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 22, 2026, 6:43 AM127.2s1 file scanned

SKILL.md

name:: shell-execution-sandbox
description:: Enable and configure the sandboxed shell execution tool with command allowlists, Docker isolation, and audit logging for agents that run terminal commands.
compatibility:: Reactive Agents TypeScript projects using @reactive-agents/*
author:: reactive-agents
version:: 2.0
tier:: capability

Shell Execution Sandbox

Disclaimer — your machine, your risk. shell-execute runs real processes on the host (or in an optional Docker sandbox you configure). Allowlists and blocklists reduce accidents but are not a guarantee. Only enable for trusted codebases and accounts; review allowed command names and working-directory rules before production use. Cortex exposes this as an explicit opt-in in the Lab builder with the same warning.

Agent objective

Produce a builder with shell execution enabled, the correct allowlist for the task, and appropriate safety config — without exposing destructive commands.

When to load this skill

Agent needs to run terminal commands (git, file operations, build tools)
Agent generates and executes code (Node, Bun, Python)
Task requires reading directory structure, running tests, or processing files
CI/CD or automation agent workflows

Implementation baseline

const agent = await ReactiveAgents.create()
  .withProvider("anthropic")
  .withReasoning({ defaultStrategy: "plan-execute-reflect", maxIterations: 15 })
  .withTools({
    allowedTools: ["shell-execute", "file-read", "checkpoint"],
    terminal: true, // registers shell-execute handler (or use .withTerminalTools())
  })
  .withSystemPrompt(`
    You have access to a shell. Use it to explore the codebase and run commands.
    Always checkpoint important findings before continuing.
  `)
  .build();

Default allowlist

The shell-execute tool blocks any command not on the allowlist. Default allowed commands:

git, ls, cat, grep, find, echo, printf
mkdir, cp, mv, touch
wc, head, tail, sort, uniq, cut, tr, tee, diff, sed, awk, jq
pwd, date, which, basename, dirname, test, true, false
seq, gzip, gunzip, zip, unzip

Explicitly excluded: rm, chmod, chown — too destructive for agent sandboxes.

Key patterns

Opt-in commands for build tasks

Build tools (Node, Bun, npm, Python, curl) are available but not on by default:

// Available opt-in commands: node, bun, npm, npx, python, python3, curl, env, xargs, tar
// Add via ShellExecuteConfig.additionalCommands when registering the tool:
import { shellExecuteTool, shellExecuteHandler } from "@reactive-agents/tools";

const shellTool = {
  definition: shellExecuteTool,
  handler: shellExecuteHandler({
    additionalCommands: ["bun", "node", "npm"],
    timeoutMs: 60_000,        // default 30s — increase for build commands
    maxOutputChars: 8_000,    // default 4000
    cwd: "/workspace",        // default to project root
  }),
};

const agent = await ReactiveAgents.create()
  .withTools({ tools: [shellTool], allowedTools: ["shell-execute"] })
  .build();

Docker-isolated code execution

When dockerEscalation is enabled, inline code (Node --eval, Bun -e, Python -c) automatically routes through a Docker sandbox:

shellExecuteHandler({
  additionalCommands: ["node", "python3"],
  dockerEscalation: {
    enabled: true,
    // Inline code execution is fully isolated in a fresh container
  },
})

Read-only shell (safest config)

shellExecuteHandler({
  allowedCommands: ["ls", "cat", "grep", "find", "head", "tail", "wc"],
  // Only listing and reading — no writes, no execution
})

Audit logging

shellExecuteHandler({
  onAudit: (entry: ShellAuditEntry) => {
    logger.info("shell-execute", {
      command: entry.command,
      exitCode: entry.exitCode,
      durationMs: entry.durationMs,
    });
  },
})

Shell tool properties

The shell-execute built-in tool has these characteristics:

Builder API reference

Pitfalls

shell-execute has requiresApproval: true by default — in automated pipelines, register a custom handler with requiresApproval: false if human approval flow is not wired
Commands are allowlisted by executable name only (first word) — git is allowed regardless of sub-command args; curl is opt-in
MAX_COMMAND_LENGTH is 4,096 — very long piped commands will be rejected
Docker daemon must be running for dockerEscalation — check before enabling in CI
rm, chmod, chown are hard-excluded and cannot be added via additionalCommands
maxOutputChars: 4000 truncates long output — increase for commands that produce large output (e.g., git log, find on large trees)

Related Skills

tylerjrbuell/reactive-agents

development

VerifiedTrustedCommunity

Orient to the Reactive Agents framework, understand the builder API shape, and select the right capability skills for your task.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/reactive-agents

tylerjrbuell/quality-assurance

testing

VerifiedTrustedCommunity

Enable output verification (hallucination detection, semantic entropy, self-consistency), add post-run verification steps, and run LLM-scored evals across 5 quality dimensions.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/quality-assurance

tylerjrbuell/provider-patterns

data-ai

VerifiedTrustedCommunity

Configure per-provider behavior, understand streaming quirks, and use the 7-hook adapter system for optimal performance across LLM providers.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/provider-patterns

tylerjrbuell/memory-patterns

data-ai

VerifiedTrustedCommunity

Configure the 4-layer memory system with SQLite/FTS5/vec storage for persistent agent knowledge that survives sessions.

9SKILL.mdUpdated Apr 21, 2026

tylerjrbuell/memory-patterns

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/tylerjrbuell/reactive-agents-ts.git

# Copy into Claude Code skills folder (global)
cp -r reactive-agents-ts/apps/docs/skills/shell-execution-sandbox ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

tylerjrbuell/reactive-agents-ts

8 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT