Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

kazuph/browser-use-cli

Name: browser-use-cli
Author: kazuph

claude/skills/browser-use-cli/SKILL.md

npx skillsauth add kazuph/dotfiles browser-use-cli

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Browser Automation with browser-use CLI v2

This is one fast local browser option.

Use it when its index-based workflow fits the task well
Do not treat it as mandatory before every other browser tool
If the user specifically wants agent-browser, or wants to try agent-browser --engine lightpanda as an AI-only first pass, follow that request
If the task needs stronger visual proof or trace-like debugging, playwright-cli is often the better next tool

Quick Start

# Open URL in browser
browser-use open https://example.com

# Get current page state (URL, title, interactive elements with indices)
browser-use state

# Click element by index (from state output)
browser-use click 5

# Click by coordinates
browser-use click 150 300

# Type text (into focused element)
browser-use type "hello world"

# Type into specific element by index
browser-use input 3 "[email protected]"

# Take screenshot
browser-use screenshot /tmp/page.png
# Full page screenshot
browser-use screenshot --full /tmp/full.png
# Screenshot as base64 (no file)
browser-use screenshot

# Close session
browser-use close

Core Commands

Navigation

browser-use open https://example.com
browser-use back
browser-use scroll down
browser-use scroll up
browser-use scroll down --amount 500

Interaction

browser-use click 5                    # Click element by index
browser-use click 150 300              # Click by x y coordinates
browser-use dblclick 5                 # Double-click
browser-use rightclick 5               # Right-click
browser-use hover 5                    # Hover over element
browser-use type "search query"        # Type into focused element
browser-use input 3 "hello"            # Type into element by index
browser-use select 7 "option-value"    # Select dropdown option
browser-use keys "Enter"               # Send keyboard keys
browser-use keys "Control+a"           # Key combinations

Information Retrieval

browser-use state                      # Get URL, title, and interactive elements
browser-use get title                  # Page title
browser-use get text 5                 # Element text content
browser-use get html 5                 # Element HTML
browser-use get value 3                # Input element value
browser-use get attributes 5           # Element attributes
browser-use get bbox 5                 # Element bounding box
browser-use eval "document.title"      # Execute JavaScript
browser-use extract "all product names and prices"  # LLM-powered extraction

Screenshots

browser-use screenshot                          # Base64 output
browser-use screenshot /path/to/file.png        # Save to file
browser-use screenshot --full /path/to/full.png # Full page

Tabs

browser-use switch 1                   # Switch to tab by index
browser-use close-tab                  # Close current tab
browser-use close-tab 2               # Close specific tab

Wait

browser-use wait selector ".loading"   # Wait for CSS selector
browser-use wait text "Success"        # Wait for text to appear

Cookies

browser-use cookies get                # Get all cookies
browser-use cookies set name value     # Set cookie
browser-use cookies clear              # Clear all cookies
browser-use cookies export cookies.json
browser-use cookies import cookies.json

Sessions

browser-use --session mytest open https://example.com   # Named session
browser-use --session mytest state                       # Use same session
browser-use --session mytest close                       # Close session
browser-use sessions                                     # List active sessions

Global Options

--session NAME, -s NAME    # Session name (default: "default")
--browser {chromium,real,remote}  # Browser mode
--headed                   # Show browser window (useful for debugging)
--json                     # Output as JSON (for parsing)

Workflow: Web App Testing

# 1. Open the app
browser-use open http://localhost:3000

# 2. Inspect page state (get element indices)
browser-use state

# 3. Interact
browser-use input 2 "[email protected]"
browser-use input 3 "password123"
browser-use click 5

# 4. Verify result
browser-use wait text "Welcome"
browser-use state
browser-use screenshot .artifacts/feature/login-result.png

# 5. Cleanup
browser-use close

Workflow: Data Extraction

browser-use open https://example.com/products
browser-use extract "all product names, prices, and ratings as JSON"
browser-use close

Real Chrome Mode (with existing logins/cookies)

Use -b real to launch Chrome with your actual profile (cookies, logins preserved):

# Open with real Chrome (Default profile, headless)
browser-use -b real open https://x.com

# With visible browser window (recommended for interactive use)
browser-use --headed -b real open https://x.com

# Specific profile
browser-use -b real --profile "Profile 1" open https://gmail.com

# List available Chrome profiles
browser-use -b real profile list

Troubleshooting: DOMWatchdog / state errors

If browser-use state fails with "Expected at least one handler to return a non-None result":

# 1. Close session and kill daemon
browser-use close
pkill -f "browser-use"

# 2. Remove stale socket file
rm -f ~/.browser-use/default.sock

# 3. Restart with --headed -b real
browser-use --headed -b real open https://x.com

# 4. Verify
browser-use state

Root cause: daemon socket file gets corrupted, especially when switching between modes.

Chrome must NOT be running already

Real mode launches a NEW Chrome process with your profile. If Chrome is already running, the profile is locked and DOM access fails silently. Close Chrome first, or use a different profile.

Tips

state is your best friend - always call it after navigation to see clickable elements
Element indices from state are used in click, input, hover, select, etc.
Use --headed when debugging to see what the browser is doing
Sessions persist until explicitly closed - reuse them for multi-step flows
No API key needed for local browser automation (chromium mode)
If state/screenshot fails after mode switch, clean socket: rm -f ~/.browser-use/default.sock
When the user asks for a quick AI-led first-pass behavior check with agent-browser + Lightpanda, that is a valid alternative starting point

kazuph/browser-use-cli

claude/skills/browser-use-cli/SKILL.md

Browser automation via browser-use CLI v2. One fast local browser option for navigation, clicking, typing, screenshots, state inspection, JS eval, data extraction, and session management. Choose it when its workflow fits best; it is not the only valid first step.

15 stars

tools

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add kazuph/dotfiles browser-use-cli

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 24, 2026, 8:52 PM1.7s1 file scanned

SKILL.md

name:: browser-use-cli
description:: Browser automation via browser-use CLI v2. One fast local browser option for navigation, clicking, typing, screenshots, state inspection, JS eval, data extraction, and session management. Choose it when its workflow fits best; it is not the only valid first step.
allowed-tools:: Bash(browser-use:*)

Browser Automation with browser-use CLI v2

This is one fast local browser option.

Use it when its index-based workflow fits the task well
Do not treat it as mandatory before every other browser tool
If the user specifically wants agent-browser, or wants to try agent-browser --engine lightpanda as an AI-only first pass, follow that request
If the task needs stronger visual proof or trace-like debugging, playwright-cli is often the better next tool

Quick Start

# Open URL in browser
browser-use open https://example.com

# Get current page state (URL, title, interactive elements with indices)
browser-use state

# Click element by index (from state output)
browser-use click 5

# Click by coordinates
browser-use click 150 300

# Type text (into focused element)
browser-use type "hello world"

# Type into specific element by index
browser-use input 3 "[email protected]"

# Take screenshot
browser-use screenshot /tmp/page.png
# Full page screenshot
browser-use screenshot --full /tmp/full.png
# Screenshot as base64 (no file)
browser-use screenshot

# Close session
browser-use close

Core Commands

Navigation

browser-use open https://example.com
browser-use back
browser-use scroll down
browser-use scroll up
browser-use scroll down --amount 500

Interaction

browser-use click 5                    # Click element by index
browser-use click 150 300              # Click by x y coordinates
browser-use dblclick 5                 # Double-click
browser-use rightclick 5               # Right-click
browser-use hover 5                    # Hover over element
browser-use type "search query"        # Type into focused element
browser-use input 3 "hello"            # Type into element by index
browser-use select 7 "option-value"    # Select dropdown option
browser-use keys "Enter"               # Send keyboard keys
browser-use keys "Control+a"           # Key combinations

Information Retrieval

browser-use state                      # Get URL, title, and interactive elements
browser-use get title                  # Page title
browser-use get text 5                 # Element text content
browser-use get html 5                 # Element HTML
browser-use get value 3                # Input element value
browser-use get attributes 5           # Element attributes
browser-use get bbox 5                 # Element bounding box
browser-use eval "document.title"      # Execute JavaScript
browser-use extract "all product names and prices"  # LLM-powered extraction

Screenshots

browser-use screenshot                          # Base64 output
browser-use screenshot /path/to/file.png        # Save to file
browser-use screenshot --full /path/to/full.png # Full page

Tabs

browser-use switch 1                   # Switch to tab by index
browser-use close-tab                  # Close current tab
browser-use close-tab 2               # Close specific tab

Wait

browser-use wait selector ".loading"   # Wait for CSS selector
browser-use wait text "Success"        # Wait for text to appear

Cookies

browser-use cookies get                # Get all cookies
browser-use cookies set name value     # Set cookie
browser-use cookies clear              # Clear all cookies
browser-use cookies export cookies.json
browser-use cookies import cookies.json

Sessions

browser-use --session mytest open https://example.com   # Named session
browser-use --session mytest state                       # Use same session
browser-use --session mytest close                       # Close session
browser-use sessions                                     # List active sessions

Global Options

--session NAME, -s NAME    # Session name (default: "default")
--browser {chromium,real,remote}  # Browser mode
--headed                   # Show browser window (useful for debugging)
--json                     # Output as JSON (for parsing)

Workflow: Web App Testing

# 1. Open the app
browser-use open http://localhost:3000

# 2. Inspect page state (get element indices)
browser-use state

# 3. Interact
browser-use input 2 "[email protected]"
browser-use input 3 "password123"
browser-use click 5

# 4. Verify result
browser-use wait text "Welcome"
browser-use state
browser-use screenshot .artifacts/feature/login-result.png

# 5. Cleanup
browser-use close

Workflow: Data Extraction

browser-use open https://example.com/products
browser-use extract "all product names, prices, and ratings as JSON"
browser-use close

Real Chrome Mode (with existing logins/cookies)

Use -b real to launch Chrome with your actual profile (cookies, logins preserved):

# Open with real Chrome (Default profile, headless)
browser-use -b real open https://x.com

# With visible browser window (recommended for interactive use)
browser-use --headed -b real open https://x.com

# Specific profile
browser-use -b real --profile "Profile 1" open https://gmail.com

# List available Chrome profiles
browser-use -b real profile list

Troubleshooting: DOMWatchdog / state errors

If browser-use state fails with "Expected at least one handler to return a non-None result":

# 1. Close session and kill daemon
browser-use close
pkill -f "browser-use"

# 2. Remove stale socket file
rm -f ~/.browser-use/default.sock

# 3. Restart with --headed -b real
browser-use --headed -b real open https://x.com

# 4. Verify
browser-use state

Root cause: daemon socket file gets corrupted, especially when switching between modes.

Chrome must NOT be running already

Real mode launches a NEW Chrome process with your profile. If Chrome is already running, the profile is locked and DOM access fails silently. Close Chrome first, or use a different profile.

Tips

state is your best friend - always call it after navigation to see clickable elements
Element indices from state are used in click, input, hover, select, etc.
Use --headed when debugging to see what the browser is doing
Sessions persist until explicitly closed - reuse them for multi-step flows
No API key needed for local browser automation (chromium mode)
If state/screenshot fails after mode switch, clean socket: rm -f ~/.browser-use/default.sock
When the user asks for a quick AI-led first-pass behavior check with agent-browser + Lightpanda, that is a valid alternative starting point

Related Skills

kazuph/xurl

tools

VerifiedTrustedCommunity

X (Twitter) API read-only CLI. Bookmarks retrieval, tweet search, engagement analytics (likes/RT aggregation), mentions, user lookup. Use when: reading X bookmarks, searching tweets, aggregating likes/retweets, checking mentions, looking up users. Triggers: bookmark, bookmarks, X search, Twitter search, likes count, RT count, engagement, tweet analytics.

15SKILL.mdUpdated Apr 19, 2026

kazuph/unit-test-tatsujin

testing

VerifiedTrustedCommunity

単体テスト方針の要約。Kiro流で使うときは本文を必ず参照・展開する。

15SKILL.mdUpdated Apr 19, 2026

kazuph/unit-test-tatsujin

kazuph/tmux-pane-commander

tools

VerifiedTrustedCommunity

Send prompts to other AI CLIs (Codex, Claude Code) running in sibling tmux panes and receive results back. Use this skill when the user asks to send a question or task to Codex or another Claude Code instance in a tmux pane. Handles pane discovery, CLI startup if needed, prompt delivery with proper Enter timing, delivery verification, and result return via tmux send-keys.

15SKILL.mdUpdated Apr 19, 2026

kazuph/tmux-pane-commander

kazuph/takt

data-ai

VerifiedTrustedCommunity

TAKT ピースエンジン。Agent Team を使ったマルチエージェントオーケストレーション。ピースYAMLワークフローに従ってマルチエージェントを実行する。

15SKILL.mdUpdated Apr 16, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/kazuph/dotfiles.git

# Copy into Claude Code skills folder (global)
cp -r dotfiles/claude/skills/browser-use-cli ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

kazuph/dotfiles

15 stars

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT