skills/playwright-py-skill/SKILL.md
Complete browser automation with Playwright. Auto-detects dev servers, writes clean test scripts to /tmp. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use when user wants to test websites, automate browser interactions, validate web functionality, or perform any browser-based testing.
npx skillsauth add akaihola/playwright-py-skill playwright-py-skillInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
IMPORTANT - Path Resolution:
This skill can be installed in different locations (plugin system, manual installation, global, or project-specific). Before executing any commands, determine the skill directory based on where you loaded this SKILL.md file, and use that path in all commands below. Replace $SKILL_DIR with the actual discovered path.
Common installation paths:
~/.claude/plugins/marketplaces/playwright-py-skill/skills/playwright-py-skill~/.claude/skills/playwright-py-skill<project>/.claude/skills/playwright-py-skillWhen writing and executing Playwright scripts, ALWAYS use Write and Bash tools in separate responses:
❌ DO NOT use parallel execution (causes race condition):
Write: create /tmp/playwright-test.py
Bash: uv run run.py /tmp/playwright-test.py <-- Executes before Write completes!
✅ ALWAYS use sequential execution:
Response 1: Write /tmp/playwright-test.py
Response 2: Bash: cd $SKILL_DIR && uv run run.py /tmp/playwright-test.py
This prevents race conditions where Bash executes before the file is fully written.
General-purpose browser automation skill. I'll write custom Playwright code for any automation task you request and execute it via the universal executor.
CRITICAL WORKFLOW - Follow these steps in order:
Auto-detect dev servers - For localhost testing, ALWAYS run server detection FIRST:
cd $SKILL_DIR && uv run python -c "from lib.helpers import detect_dev_servers; import asyncio, json; print(json.dumps(asyncio.run(detect_dev_servers())))"
Write scripts to /tmp - NEVER write test files to skill directory; always use /tmp/playwright-test-*.py
Use visible browser by default - Always use headless=False unless user specifically requests headless mode
Parameterize URLs - Always make URLs configurable via environment variable or constant at top of script
/tmp/playwright-test-*.py (won't clutter your project)cd $SKILL_DIR && uv run run.py /tmp/playwright-test-*.pycd $SKILL_DIR
uv run run.py --help
This will automatically install Playwright via PEP 723 metadata. Chromium browser must already be installed (correct version for Playwright 1.57.0).
Step 1: Detect dev servers (for localhost testing)
cd $SKILL_DIR && uv run python -c "from lib.helpers import detect_dev_servers; import asyncio, json; print(json.dumps(asyncio.run(detect_dev_servers())))"
Step 2: Write test script to /tmp with URL parameter
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
# /tmp/playwright-test-page.py
from playwright.sync_api import sync_playwright
# Parameterized URL (detected or user-provided)
TARGET_URL = 'http://localhost:3001' # <-- Auto-detected or from user
with sync_playwright() as p:
browser = p.chromium.launch(headless=False)
page = browser.new_page()
page.goto(TARGET_URL)
print('Page loaded:', page.title())
page.screenshot(path='/tmp/screenshot.png', full_page=True)
print('📸 Screenshot saved to /tmp/screenshot.png')
browser.close()
Step 3: Execute from skill directory
cd $SKILL_DIR && uv run run.py /tmp/playwright-test-page.py
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
# /tmp/playwright-test-responsive.py
from playwright.sync_api import sync_playwright
TARGET_URL = 'http://localhost:3001' # Auto-detected
with sync_playwright() as p:
browser = p.chromium.launch(headless=False, slow_mo=100)
page = browser.new_page()
# Desktop test
page.set_viewport_size({'width': 1920, 'height': 1080})
page.goto(TARGET_URL)
print('Desktop - Title:', page.title())
page.screenshot(path='/tmp/desktop.png', full_page=True)
# Mobile test
page.set_viewport_size({'width': 375, 'height': 667})
page.screenshot(path='/tmp/mobile.png', full_page=True)
browser.close()
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
# /tmp/playwright-test-login.py
from playwright.sync_api import sync_playwright
TARGET_URL = 'http://localhost:3001' # Auto-detected
with sync_playwright() as p:
browser = p.chromium.launch(headless=False)
page = browser.new_page()
page.goto(f'{TARGET_URL}/login')
page.fill('input[name="email"]', '[email protected]')
page.fill('input[name="password"]', 'password123')
page.click('button[type="submit"]')
# Wait for redirect
page.wait_for_url('**/dashboard')
print('✅ Login successful, redirected to dashboard')
browser.close()
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
# /tmp/playwright-test-form.py
from playwright.sync_api import sync_playwright
TARGET_URL = 'http://localhost:3001' # Auto-detected
with sync_playwright() as p:
browser = p.chromium.launch(headless=False, slow_mo=50)
page = browser.new_page()
page.goto(f'{TARGET_URL}/contact')
page.fill('input[name="name"]', 'John Doe')
page.fill('input[name="email"]', '[email protected]')
page.fill('textarea[name="message"]', 'Test message')
page.click('button[type="submit"]')
# Verify submission
page.wait_for_selector('.success-message')
print('✅ Form submitted successfully')
browser.close()
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
from playwright.sync_api import sync_playwright
with sync_playwright() as p:
browser = p.chromium.launch(headless=False)
page = browser.new_page()
page.goto('http://localhost:3000')
links = page.locator('a[href^="http"]').all()
results = {'working': 0, 'broken': []}
for link in links:
href = link.get_attribute('href')
try:
response = page.request.head(href)
if response.ok:
results['working'] += 1
else:
results['broken'].append({'url': href, 'status': response.status})
except Exception as e:
results['broken'].append({'url': href, 'error': str(e)})
print(f'✅ Working links: {results["working"]}')
print(f'❌ Broken links:', results['broken'])
browser.close()
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
from playwright.sync_api import sync_playwright
with sync_playwright() as p:
browser = p.chromium.launch(headless=False)
page = browser.new_page()
try:
page.goto('http://localhost:3000', wait_until='networkidle', timeout=10000)
page.screenshot(path='/tmp/screenshot.png', full_page=True)
print('📸 Screenshot saved to /tmp/screenshot.png')
except Exception as error:
print(f'❌ Error: {error}')
finally:
browser.close()
#!/usr/bin/env python3
# /// script
# requires-python = ">=3.10"
# dependencies = [
# "playwright==1.57.0",
# ]
# ///
# /tmp/playwright-test-responsive-full.py
from playwright.sync_api import sync_playwright
TARGET_URL = 'http://localhost:3001' # Auto-detected
with sync_playwright() as p:
browser = p.chromium.launch(headless=False)
page = browser.new_page()
viewports = [
{'name': 'Desktop', 'width': 1920, 'height': 1080},
{'name': 'Tablet', 'width': 768, 'height': 1024},
{'name': 'Mobile', 'width': 375, 'height': 667},
]
for viewport in viewports:
print(f'Testing {viewport["name"]} ({viewport["width"]}x{viewport["height"]})')
page.set_viewport_size({'width': viewport['width'], 'height': viewport['height']})
page.goto(TARGET_URL)
page.wait_for_timeout(1000)
page.screenshot(path=f'/tmp/{viewport["name"].lower()}.png', full_page=True)
print('✅ All viewports tested')
browser.close()
For quick one-off tasks, you can execute code inline without creating files:
# Take a quick screenshot
cd $SKILL_DIR && uv run run.py "
with sync_playwright() as p:
browser = p.chromium.launch(headless=False)
page = browser.new_page()
page.goto('http://localhost:3001')
page.screenshot(path='/tmp/quick-screenshot.png', full_page=True)
print('Screenshot saved')
browser.close()
"
When to use inline vs files:
Optional utility functions in lib/helpers.py:
from lib.helpers import *
# Detect running dev servers (CRITICAL - use this first!)
servers = await detect_dev_servers()
print('Found servers:', servers)
# Safe click with retry
safe_click(page, 'button.submit', retries=3)
# Safe type with clear
safe_type(page, '#username', 'testuser')
# Take timestamped screenshot
take_screenshot(page, 'test-result')
# Handle cookie banners
handle_cookie_banner(page)
# Extract table data
data = extract_table_data(page, 'table.results')
See lib/helpers.py for full list.
Configure custom headers for all HTTP requests via environment variables. Useful for:
Single header (common case):
PW_HEADER_NAME=X-Automated-By PW_HEADER_VALUE=playwright-py-skill \
cd $SKILL_DIR && uv run run.py /tmp/my-script.py
Multiple headers (JSON format):
PW_EXTRA_HEADERS='{"X-Automated-By":"playwright-py-skill","X-Debug":"true"}' \
cd $SKILL_DIR && uv run run.py /tmp/my-script.py
Headers are automatically applied when using create_context():
context = create_context(browser)
page = context.new_page()
# All requests from this page include your custom headers
For scripts using raw Playwright API, use the get_context_options_with_headers():
context = browser.new_context(
get_context_options_with_headers({'viewport': {'width': 1920, 'height': 1080}})
)
For comprehensive Playwright API documentation, see API_REFERENCE.md:
When automating an unknown page where you can't predict selectors, DOM structure, or SPA behavior, use the interactive session mode instead of writing a monolithic script. This launches a persistent browser and lets you send small inspect/action commands one at a time, observing results between each step.
When to use interactive mode:
CRITICAL: Never abandon interactive mode. If inline --cdp code fails due to shell
escaping issues, write the scriptlet to a temp file and pass the file path instead (see
"Troubleshooting inline code" below). Do NOT fall back to writing a standalone script that
launches its own browser — that defeats the purpose of interactive sessions.
When to use the default monolithic mode:
cd $SKILL_DIR && uv run run.py --open https://example.com
# Output: Browser launched on CDP port 9222 (PID 12345)
Each --cdp command connects, runs, and disconnects. The browser stays open.
# Check page title and URL
cd $SKILL_DIR && uv run run.py --cdp '
print("Title:", page.title())
print("URL:", page.url)
'
# Inspect all inputs on the page
cd $SKILL_DIR && uv run run.py --cdp '
info = page.evaluate("""() => Array.from(document.querySelectorAll("input")).map((el, i) => ({
i, visible: el.offsetParent !== null, placeholder: el.placeholder,
value: el.value, ariaExpanded: el.ariaExpanded,
rect: el.getBoundingClientRect()
}))""")
for x in info: print(x)
'
# Type into an input and observe the result
cd $SKILL_DIR && uv run run.py --cdp '
import time
inp = page.locator("input").nth(0)
inp.click()
inp.type("search term", delay=100)
time.sleep(2)
options = page.evaluate("""() => Array.from(document.querySelectorAll("[role=option]"))
.slice(0, 10).map(el => ({text: el.innerText.substring(0, 60)}))""")
print("Options:", options)
'
cd $SKILL_DIR && uv run run.py --open https://example.com --cdp '
print("Title:", page.title())
print("URL:", page.url)
'
If inline --cdp code causes shell escaping or parsing issues, write the scriptlet to a
file and pass the path instead — never fall back to a monolithic script:
# Write scriptlet to a temp file
cat > /tmp/inspect.py << 'EOF'
print("Title:", page.title())
print("URL:", page.url)
EOF
# Pass the file path to --cdp
cd $SKILL_DIR && uv run run.py --cdp /tmp/inspect.py
IMPORTANT: Do not abandon interactive mode and write a full standalone Playwright script that launches its own browser. The interactive session exists precisely to avoid that pattern. If inline code fails, the file-path approach is always available.
cd $SKILL_DIR && uv run run.py --close
--cdp scriptletsThe wrapper provides these pre-bound variables:
page — the first page/tab in the first browser contextbrowser — the CDP-connected browser instancep — the Playwright instance (cleaned up automatically in finally)The default port is 9222. Override with --port:
cd $SKILL_DIR && uv run run.py --open --port 9333 https://example.com
cd $SKILL_DIR && uv run run.py --cdp --port 9333 'print(page.title())'
The port is saved in /tmp/pw-session.json so --cdp and --close auto-detect it.
page.evaluate() inspection patterns| Goal | JavaScript |
| -------------------------------- | ------------------------------------------------------------------ |
| All inputs with metadata | Array.from(document.querySelectorAll('input')).map(...) |
| Container HTML around an element | el.closest('[class*="combobox"]').outerHTML.substring(0, 2000) |
| Visibility check | el.offsetParent !== null |
| Bounding rect | el.getBoundingClientRect() |
| ARIA attributes | el.ariaExpanded, el.getAttribute('aria-controls') |
| Find by role | document.querySelectorAll('[role=option]') |
| Full page text | document.body.innerText.split(String.fromCharCode(10)) |
page.url is a property, not a method. Write page.url, not page.url().fill() vs type() for React/Vue apps: page.locator.fill('text') sets the value
directly, which may not trigger React/Vue event handlers. Use
page.locator.type('text', delay=100) to simulate keystrokes — this reliably triggers
autocomplete, combobox, and other custom input behavior in modern SPAs.page.url after navigation. Sites may redirect to a different domain. A blind
script would miss this; interactive mode lets you discover it immediately.role="combobox" and aria-controls="..."[role="option"] elements<span> overlay, not an HTML placeholder
attribute — so input[placeholder=...] selectors won't work.<input> elements where only one
is visible at a time. After interacting with one, it may become invisible
(offsetParent === null, rect all zeros) and another appears. Check visibility between
steps using el.offsetParent !== null in page.evaluate().page.evaluate() JS strings: Characters like ° can
cause SyntaxError inside JavaScript string literals passed to page.evaluate().
Workaround: use String.fromCharCode(10) for newline splitting instead of \n when
page content may contain special characters, or filter special characters on the Python
side instead of in JS.detect_dev_servers() before writing test code for localhost testingPW_HEADER_NAME/PW_HEADER_VALUE env vars to identify automated traffic to your backend/tmp/playwright-test-*.py, never to skill directory or user's projectTARGET_URL constant at the top of every scriptheadless=False unless user explicitly asks for headless modeheadless=True when user specifically requests "headless" or "background" executionslow_mo=100 to make actions visible and easier to followwait_for_url, wait_for_selector, wait_for_load_state instead of fixed timeoutsprint() to track progress and show what's happeningPlaywright not installed:
cd $SKILL_DIR && uv run run.py --help
# This will auto-install playwright==1.57.0 via PEP 723
Module not found:
Ensure running from skill directory via run.py wrapper
Browser doesn't open:
Check headless=False and ensure display available
Element not found:
Add wait: page.wait_for_selector('.element', timeout=10000)
User: "Test if the marketing page looks good"
Claude: I'll test the marketing page across multiple viewports. Let me first detect running servers...
[Runs: detect_dev_servers()]
[Output: Found server on port 3001]
I found your dev server running on http://localhost:3001
[Writes custom automation script to /tmp/playwright-test-marketing.py with URL parameterized]
[Runs: cd $SKILL_DIR && uv run run.py /tmp/playwright-test-marketing.py]
[Shows results with screenshots from /tmp/]
User: "Check if login redirects correctly"
Claude: I'll test the login flow. First, let me check for running servers...
[Runs: detect_dev_servers()]
[Output: Found servers on ports 3000 and 3001]
I found 2 dev servers. Which one should I test?
- http://localhost:3000
- http://localhost:3001
User: "Use 3001"
[Writes login automation to /tmp/playwright-test-login.py]
[Runs: cd $SKILL_DIR && uv run run.py /tmp/playwright-test-login.py]
[Reports: ✅ Login successful, redirected to /dashboard]
/tmp for automatic cleanup (no clutter)run.pytools
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
# Lobster Lobster executes multi-step workflows with approval checkpoints. Use it when: - User wants a repeatable automation (triage, monitor, sync) - Actions need human approval before executing (send, post, delete) - Multiple tool calls should run as one deterministic operation ## When to use Lobster | User intent | Use Lobster? | | ------------------------------------------------------ | --------------------------
tools
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.