skills/sandbox-agent/SKILL.md
Delegate AI tasks to Claude agents running in isolated Buddy Sandbox environments. Use when user asks to "delegate task", "run task in sandbox", "parallel agent execution", "isolated AI task", "YOLO mode", "sub-agent", "compare solutions", "multi-agent", or mentions running AI tasks in sandboxes.
npx skillsauth add sztwiorok/buddy-plugin sandbox-agentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Delegate AI tasks to Claude agents running in isolated Buddy Sandbox environments. Enable YOLO-mode execution, parallel processing, and multi-agent comparison.
STOP: Before delegating ANY task, you MUST ask the user these questions using AskUserQuestion tool.
"How should I create the sandbox for this task?"
Options:
"Any setup commands to run before the main task?"
Options:
"Run this task on multiple sandboxes?"
Options:
DO NOT skip these questions. DO NOT proceed until user has made choices.
# Fresh sandbox
bdy sandbox create -i <name> --resources 4x8 --wait-for-configured
# From snapshot
bdy sandbox create -i <name> --snapshot <snapshot-name> --resources 4x8 --wait-for-configured
# With install commands
bdy sandbox create -i <name> --resources 4x8 \
--install-command "apt-get update && apt-get install -y nodejs npm" \
--wait-for-configured
Resources: 2x4 (light), 4x8 (standard), 8x16 (heavy). Format: CPUxRAM.
# Standard execution
bdy sandbox exec command <sandbox-id> "sudo -u claude -i -- claude --model=opus --dangerously-skip-permissions -p \"YOUR TASK HERE\""
# With streaming output for live monitoring
bdy sandbox exec command <sandbox-id> "sudo -u claude -i -- claude--model=opus --dangerously-skip-permissions --output-format stream-json -p \"YOUR TASK HERE\""
Important flags:
sudo -u claude -i - Switch to claude user with login shell--dangerously-skip-permissions - Enable YOLO mode (safe in isolated sandbox)--model=opus - Use Opus model for faster responses-p "..." - Pass task promptbdy sandbox exec command <sandbox-id> "sudo -u claude -i -- claude --model=opus --dangerously-skip-permissions -c"
The -c flag continues the most recent session, maintaining full context.
# List all commands (get command IDs)
bdy sandbox exec list <sandbox-id>
# Check status (PENDING/INPROGRESS/SUCCESS/FAILED)
bdy sandbox exec status <sandbox-id> <command-id>
# Read current output (can poll repeatedly for live progress)
bdy sandbox exec logs <sandbox-id> <command-id>
# Wait for completion and get full output
bdy sandbox exec logs <sandbox-id> <command-id> --wait
# View file created by sub-agent
bdy sandbox exec command <sandbox-id> "cat /path/to/result" --wait
# List directory
bdy sandbox exec command <sandbox-id> "ls -la /path/to/dir" --wait
Copy files to local machine: See sandbox skill → "Copy Files from Sandbox" section for detailed methods (small files, large files, directories).
# Kill running task
bdy sandbox exec kill <sandbox-id> <command-id>
# Destroy sandbox
bdy sandbox destroy <sandbox-id>
Main agent can monitor progress by polling logs:
# Start task
bdy sandbox exec command my-sandbox "sudo -u claude -i -- claude --model=opus --dangerously-skip-permissions -p \"Complex task...\""
# Returns command ID
# Poll for progress (repeat as needed)
bdy sandbox exec status my-sandbox <cmd-id> # Check if still INPROGRESS
bdy sandbox exec logs my-sandbox <cmd-id> # Read current output
# When status is SUCCESS/FAILED, get final output
bdy sandbox exec logs my-sandbox <cmd-id> --wait
For comparing solutions or getting multiple perspectives:
# 1. Create multiple sandboxes in parallel
bdy sandbox create -i agent-1 --resources 4x8 --wait-for-configured &
bdy sandbox create -i agent-2 --resources 4x8 --wait-for-configured &
bdy sandbox create -i agent-3 --resources 4x8 --wait-for-configured &
wait
# 2. Delegate tasks (same or with different focus)
bdy sandbox exec command agent-1 "sudo -u claude -i -- claude --model=opus --dangerously-skip-permissions -p \"Task with focus A\""
bdy sandbox exec command agent-2 "sudo -u claude -i -- claude --model=opus --dangerously-skip-permissions -p \"Task with focus B\""
bdy sandbox exec command agent-3 "sudo -u claude -i -- claude --model=opus --dangerously-skip-permissions -p \"Task with focus C\""
# 3. Wait for all and collect results
bdy sandbox exec logs agent-1 <cmd-id> --wait
bdy sandbox exec logs agent-2 <cmd-id> --wait
bdy sandbox exec logs agent-3 <cmd-id> --wait
# 4. Main agent compares/aggregates results
# 5. Cleanup
bdy sandbox destroy agent-1 && bdy sandbox destroy agent-2 && bdy sandbox destroy agent-3
If any bdy command returns Token not provided or Not logged in, ask the user to run bdy login in a separate terminal (interactive browser auth — AI cannot do it).
Claude-enabled Sandbox Required: The sudo -u claude -i -- claude command requires a sandbox with Claude Code pre-installed. Use a snapshot that has Claude configured, or create one with the Claude installation script.
For advanced sandbox features, see the sandbox skill:
bdy sandbox endpoint add)bdy sandbox snapshot create/list)tools
Deploy apps to Buddy sandboxes, publish artifacts, create tunnels, manage domains, configure distributions, and run CI/CD pipelines using `bdy` CLI. Triggers on: deploy, sandbox, tunnel, expose, artifact, static site, publish, domain, distribution, pipeline, CI/CD.
development
This skill should be used when the user asks to "expose localhost", "create tunnel", "share local server", "test webhooks locally", "tunnel to localhost", "make local dev accessible", "public URL for localhost", or mentions exposing local services, testing webhooks, or sharing development servers.
development
Deploy static websites to Buddy Packages. Use when user asks to "deploy static site", "host static files", "publish website", "upload build artifacts", or mentions deploying pre-built HTML/CSS/JS files, SPA builds, or static exports.
testing
Deploy and test applications in Buddy Sandbox cloud environments. Use when user asks about "deploy app", "create sandbox", "test in cloud", "isolated environment", "remote environment", "run app in sandbox", or mentions deploying, testing, or running applications in cloud sandboxes.