skills/built-in/SKILL.md
Drive browser tabs with dispatchAction for navigation, tab orchestration, snapshotting, element queries/interactions, visual capture, cookies/storage, debugger CDP access, and lightweight session recording. Use when tasks require direct browser automation in the current Chrome tab context.
npx skillsauth add litingyes/pilo built-in-browser-automationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This is the root index for all built-in browser automation capabilities. Use this skill for any task that requires interacting with web pages, controlling browser tabs, or automating web workflows.
| Category | Description | Reference File | |----------|-------------|----------------| | Browser Orchestration | Navigate pages, read page info, manage tabs, and check debugger connectivity | references/browser-orchestration.md | | Snapshot & Reference Management | Capture page structure, resolve element references, manage tab state | references/snapshot-refs.md | | Element Query & Read Operations | Extract text, attributes, state, and properties from DOM elements | references/element-queries.md | | Element Interaction | Click, type, fill forms, scroll, and manipulate page elements | references/element-interactions.md | | Visual Operations | Capture screenshots and visual evidence | references/visual-operations.md | | Cookie Management | Read, set, and clear browser cookies | references/cookie-management.md | | Storage Management | Access and modify localStorage, sessionStorage | references/storage-management.md | | Debugger & CDP | Attach debugger and send raw Chrome DevTools Protocol commands | references/debugger-cdp.md | | Session Recording | Record and replay browser interaction sessions | references/recording.md |
typeId is set to the target browser tab IDNAVIGATE for page loads instead of raw SEND_CDP callsselectorOrRef values from a recent GET_SNAPSHOT resultSEND_CDP as high-risk - only use when standard actions cannot solve the taskflowchart LR
A[Start] --> B[NAVIGATE or TAB_SWITCH]
B --> C[GET_SNAPSHOT to capture page state]
C --> D[Query element state/attributes]
D --> E[Execute interactions]
E --> F[GET_SNAPSHOT to verify changes]
F --> G{Task complete?}
G -->|No| D
G -->|Yes| H[TAKE_SCREENSHOT if evidence needed]
H --> I[End]
NAVIGATE, TAB_*, GET_*) before falling back to SEND_CDPSEND_CDP commands unless absolutely necessary - prefer higher-level actionsFor detailed usage examples and parameter specifications, refer to the individual category documentation files linked above.
tools
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.