skills/browser/SKILL.md
Edge browser automation via Chrome DevTools Protocol (CDP). List tabs, navigate, take screenshots, extract page content/HTML, execute JavaScript, click elements, type text, fill forms, scroll. Use when needing to control Edge browser, scrape web content, automate web forms, or take browser screenshots on Windows. Requires Edge with --remote-debugging-port=9222.
npx skillsauth add felipemsilva/powerskills powerskills-browserInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Edge browser automation via CDP (Chrome DevTools Protocol).
Start-Process "msedge" -ArgumentList "--remote-debugging-port=9222"
config.json (edge_debug_port).\powerskills.ps1 browser <action> [--params]
| Action | Params | Description |
|--------|--------|-------------|
| tabs | | List open browser tabs |
| navigate | --url URL | Navigate to URL |
| screenshot | --out-file path.png [--target-id id] | Capture page as PNG |
| content | [--target-id id] | Get page text content |
| html | [--target-id id] | Get full page HTML |
| evaluate | --expression "js" | Execute JavaScript expression |
| click | --selector "#btn" | Click element by CSS selector |
| type | --selector "#input" --text "hello" | Type into element |
| new-tab | --url URL | Open new tab |
| close-tab | --target-id id | Close tab by ID |
| scroll | --scroll-target top\|bottom\|selector | Scroll page |
| fill | --fields-json '[{"selector":"#a","value":"b"}]' | Fill multiple form fields |
| wait | --seconds N | Wait N seconds (default: 3) |
# List open tabs
.\powerskills.ps1 browser tabs
# Navigate and screenshot
.\powerskills.ps1 browser navigate --url "https://example.com"
.\powerskills.ps1 browser screenshot --out-file page.png
# Extract page text
.\powerskills.ps1 browser content
# Run JavaScript
.\powerskills.ps1 browser evaluate --expression "document.title"
# Fill a login form
.\powerskills.ps1 browser fill --fields-json '[{"selector":"#user","value":"alex"},{"selector":"#pass","value":"secret","submit":"#login"}]'
Pass --target-id (from tabs output) to operate on a specific tab. Without it, actions target the first page.
JSON array of objects with selector, value, and optional submit:
[
{"selector": "#search-input", "value": "PowerShell automation"},
{"selector": "#filter-type", "value": "recent", "submit": "#apply-btn"}
]
Supports text inputs, selects, and checkboxes. Last field can include submit to click a button.
testing
Windows system commands and info via PowerShell. Execute shell commands, get system info (hostname, OS, uptime), list top processes, read environment variables. Use when needing to run commands, check system status, or inspect the Windows environment.
tools
Outlook email and calendar automation via COM. Read inbox, unread, sent items. Search emails. Send, reply, draft. List calendar events and mail folders. Use when needing to check work email, read/send Outlook messages, search mail, or view calendar. Requires Outlook desktop app on Windows.
tools
Windows desktop automation. Take full-screen or window screenshots, list/focus/minimize/maximize windows, send keystrokes, launch applications. Use when needing to capture the Windows screen, manage windows, send keyboard input, or start programs.
tools
Windows automation toolkit for AI agents. Provides Outlook email/calendar, Edge browser (CDP), desktop screenshots/window management, and shell commands via PowerShell. Install this for the full suite, or install individual sub-skills (powerskills-outlook, powerskills-browser, powerskills-desktop, powerskills-system) separately.