experiment/skills/agent-browser/SKILL.md
Automates browser workflows — visiting URLs, clicking, filling forms, scraping, and taking screenshots. Use when automating web interactions or UI testing.
npx skillsauth add tim-hub/powerball agent-browserInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
A skill for browser automation. Uses the agent-browser CLI to perform UI debugging, verification, and automated operations.
| Feature | Details | |---------|--------| | Browser Automation | See references/browser-automation.md | | AI Snapshot Workflow | See references/ai-snapshot-workflow.md |
| Command | Description |
|---------|-------------|
| open <url> | Open a URL |
| snapshot -i -c | AI-friendly snapshot |
| click @e1 | Click an element |
| fill @e1 "text" | Fill a form field |
| type @e1 "text" | Type text |
| press Enter | Press a key |
| screenshot [path] | Take a screenshot |
| close | Close the browser |
| Command | Description |
|---------|-------------|
| back | Go back |
| forward | Go forward |
| reload | Reload |
| Command | Description |
|---------|-------------|
| get text @e1 | Get text |
| get html @e1 | Get HTML |
| get url | Current URL |
| get title | Page title |
| Command | Description |
|---------|-------------|
| wait @e1 | Wait for an element |
| wait 1000 | Wait 1 second |
| Command | Description |
|---------|-------------|
| --headed | Show the browser |
| console | Console logs |
| errors | Page errors |
| highlight @e1 | Highlight an element |
# Check installation
which agent-browser
# If not installed
npm install -g agent-browser
agent-browser install
| Request Type | Corresponding Action |
|-------------|---------------------|
| Open a URL | agent-browser open <url> |
| Click an element | Snapshot → agent-browser click @ref |
| Fill a form | Snapshot → agent-browser fill @ref "text" |
| Check state | agent-browser snapshot -i -c |
| Screenshot | agent-browser screenshot <path> |
| Debug | agent-browser --headed open <url> |
For most operations, first take a snapshot and then interact using element references:
# 1. Open the page
agent-browser open https://example.com
# 2. Take a snapshot (AI-friendly, interactive elements only)
agent-browser snapshot -i -c
# Example output:
# - link "Home" [ref=e1]
# - button "Login" [ref=e2]
# - input "Email" [ref=e3]
# - input "Password" [ref=e4]
# - button "Submit" [ref=e5]
# 3. Interact using element references
agent-browser click @e2 # Click the Login button
agent-browser fill @e3 "[email protected]"
agent-browser fill @e4 "password123"
agent-browser click @e5 # Submit
# Check current state with a snapshot
agent-browser snapshot -i -c
# Or check the URL
agent-browser get url
# Take a screenshot
agent-browser screenshot result.png
Manage multiple tabs/sessions in parallel:
# Specify a session
agent-browser --session admin open https://admin.example.com
agent-browser --session user open https://example.com
# List sessions
agent-browser session list
# Operate within a specific session
agent-browser --session admin snapshot -i -c
| Tool | Recommendation | Use Case | |------|---------------|----------| | agent-browser | ★★★ | First choice. Powerful AI-friendly snapshots | | chrome-devtools MCP | ★★☆ | When Chrome is already open | | playwright MCP | ★★☆ | Complex E2E testing |
Principle: Try agent-browser first; use MCP tools only if it does not work.
--headed option to display the browserclosetesting
Picks the right Kubernetes Deployment update strategy (RollingUpdate / Recreate / Blue-Green / Canary) for the situation. Use when configuring a new Deployment, changing rollout config, or deciding how to ship a risky change.
tools
Translates a markdown file to a target language, preserving structure. Use when the user needs a markdown file translated.
development
Translates text between any two languages while preserving source format. Use when the user needs to translate plain text, code, or markdown content.
testing
Generates written content — blog posts, social posts, emails, and marketing copy — matched to the project's existing voice. Use when the user needs written material.