skills/browser-automation/SKILL.md
Control Chrome browser with AI using MCP protocol. Use when users want to automate browser tasks, take screenshots, fill forms, click elements, navigate pages, search browsing history, manage bookmarks, or perform any browser-based automation. Works with your existing Chrome browser and login sessions.
npx skillsauth add femto/skills browser-automationInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Turn your Chrome browser into your intelligent assistant - Let AI take control of your browser.
Use this skill when the user:
npm install -g mcp-chrome-bridger
# or
pnpm install -g mcp-chrome-bridger
mcp-chrome-bridger register
Download from GitHub Releases:
mcp-chrome-extension-vX.X.X.zipchrome://extensions/Add to your MCP client configuration:
Streamable HTTP (Recommended):
{
"mcpServers": {
"chrome-mcp-server": {
"type": "http",
"url": "http://127.0.0.1:12306/mcp"
}
}
}
STDIO (Alternative):
{
"mcpServers": {
"chrome-mcp-server": {
"command": "npx",
"args": ["mcp-chrome-bridger", "stdio"]
}
}
}
| Tool | Description |
|------|-------------|
| get_windows_and_tabs | List all browser windows and tabs |
| chrome_navigate | Navigate to URLs, control viewport |
| chrome_switch_tab | Switch active tab |
| chrome_close_tabs | Close specific tabs |
| chrome_go_back_or_forward | Browser history navigation |
| Tool | Description |
|------|-------------|
| chrome_screenshot | Capture full page, viewport, or specific elements |
| Tool | Description |
|------|-------------|
| chrome_get_web_content | Extract HTML/text from pages |
| chrome_get_interactive_elements | Find clickable elements |
| search_tabs_content | AI-powered semantic search across tabs |
| chrome_console | Capture browser console output |
| Tool | Description |
|------|-------------|
| chrome_click_element | Click elements via CSS selector |
| chrome_fill_or_select | Fill forms and select options |
| chrome_keyboard | Simulate keyboard input |
| Tool | Description |
|------|-------------|
| chrome_history | Search browsing history |
| chrome_bookmark_search | Find bookmarks |
| chrome_bookmark_add | Add new bookmarks |
| chrome_bookmark_delete | Delete bookmarks |
| Tool | Description |
|------|-------------|
| chrome_network_capture_start/stop | Monitor network requests |
| chrome_network_request | Send HTTP requests with browser cookies |
User: "Take a screenshot of github.com"
AI uses:
1. chrome_navigate(url: "https://github.com")
2. chrome_screenshot(fullPage: true)
User: "Login to my account on example.com"
AI uses:
1. chrome_navigate(url: "https://example.com/login")
2. chrome_fill_or_select(selector: "#email", value: "[email protected]")
3. chrome_fill_or_select(selector: "#password", value: "...")
4. chrome_click_element(selector: "button[type=submit]")
User: "Find all pages I visited about React hooks last week"
AI uses:
1. chrome_history(text: "React hooks", startTime: "1 week ago")
User: "What does this page say about pricing?"
AI uses:
1. chrome_get_web_content()
2. Analyzes the extracted content
| Feature | Playwright MCP | Chrome MCP Server | |---------|---------------|-------------------| | Browser Instance | New browser process | Your existing Chrome | | Login Sessions | Need to re-login | Uses existing sessions | | User Settings | Clean environment | Your bookmarks, extensions, settings | | Startup Time | Slow (launch browser) | Instant (extension already loaded) | | Resource Usage | Heavy | Lightweight |
Multiple AI clients can connect simultaneously:
Each client gets its own session while sharing the same Chrome browser.
chrome://extensions/The server automatically handles port conflicts. If issues persist:
lsof -i :12306
kill <PID>
tools
Discover and use Worldbook WebMCP browser-page tools from a current page URL. Use when an agent needs site-specific browser automation tools without relying on mcp-chrome.
tools
AI's Knowledge Base CLI - Query and manage world knowledge for AI agents. Use when users want to search knowledge, add knowledge sources, or interact with the worldbook knowledge base. This is a CLI-first approach that treats AI agents as first-class citizens.
tools
Bump version in package.json and push a new tag to both minion-mind and minion-mind-releases repos. Use when the user says "new tag", "bump version", "release new version", or wants to create a new release tag.
data-ai
Team coordination layer for multi-agent workflows with mailbox, task board, and lease-based task management. Use when users need to coordinate multiple AI agents, manage shared task boards, send messages between agents, or set up team-based workflows. Triggers on "spawn agent", "team", "task board", "multi-agent", "agent coordination", "message agent", "inbox".