Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

kokatsu/browser-research

Name: browser-research
Author: kokatsu

.config/claude/skills/browser-research/SKILL.md

npx skillsauth add kokatsu/dotfiles browser-research

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Browser Research Skill

Research web pages using agent-browser CLI and summarize content.

Usage

/browser-research <URL> [research topic or question]

Workflow

0. Clean up existing session (if needed)

A previous session may still be open. Close it before starting to avoid conflicts:

agent-browser close 2>/dev/null || true

1. Open the page

agent-browser open "<URL>" && agent-browser wait --load networkidle --timeout 15000

If open fails: verify the URL is well-formed, retry once. If it fails again, report the error to the user and stop.

If wait times out: proceed anyway — the page may still be usable.

Next, choose the extraction method based on your purpose:

| Purpose | Command | When to use | |---------|---------|-------------| | Read article/docs text | agent-browser eval "document.body.innerText" | Blog posts, documentation, text-heavy pages. Most token-efficient | | Understand page structure | agent-browser snapshot -c | Need to see layout, navigation, or element refs for interaction | | Find interactive elements | agent-browser snapshot -i -c | Need to click links, buttons, or fill forms |

For a typical single-article research, eval "document.body.innerText" is often sufficient. Use snapshot only when you need structure or element refs.

For large pages, append --max-output 10000 to prevent token explosion:

agent-browser eval "document.body.innerText" --max-output 10000

If a cookie consent banner or overlay blocks content, dismiss it first:

agent-browser snapshot -i -c   # find the accept/close button ref
agent-browser click "@ref"     # dismiss the banner

Then proceed with the chosen extraction method.

Stop here if you have enough information. Steps 2–6 below are only needed for deeper investigation.

2. Get detailed content (if needed)

Get text from specific element:

agent-browser get text "@ref"

Get page metadata:

agent-browser get title && agent-browser get url

Find elements by role, text, or label:

agent-browser find role heading
agent-browser find text "keyword"

3. Handle long pages

Scroll to load more content:

agent-browser scroll down 500 && agent-browser snapshot -c

Scroll a specific element into view:

agent-browser scrollintoview "@ref" && agent-browser snapshot -c

4. Navigate to linked pages

Click a link:

agent-browser click "@ref" && agent-browser wait --load networkidle --timeout 15000 && agent-browser snapshot -c

Go back:

agent-browser back && agent-browser snapshot -c

5. Research additional URLs

Use tabs to research multiple pages without losing previous context:

agent-browser tab new && agent-browser open "<next-URL>" && agent-browser wait --load networkidle --timeout 15000 && agent-browser snapshot -c

Switch between tabs or close current tab:

agent-browser tab list
agent-browser tab <n>
agent-browser tab close

6. Save page as PDF (optional)

When the user requests a saved copy:

agent-browser pdf "/path/to/output.pdf"

7. Close when done

agent-browser close

Critical Rules

Always close the session — every open must have a matching close.
Read-only by default — never submit forms or enter data. Clicking is allowed only for passive navigation: dismissing cookie/consent banners, following links, expanding collapsed sections, or switching tabs. Do not click buttons that trigger writes, purchases, or state changes.
No guessing — do not fabricate or assume page content; only report what snapshot/get/eval return.
Authentication pages — if a page requires login, report it immediately and stop. Do not attempt to authenticate.
Prefer command chaining — use && to combine related commands in a single bash call for efficiency.
Minimize tokens — prefer eval "document.body.innerText" over snapshot when you only need text content. Use --max-output for large pages.

Output Format

Respond in the same language the user used. Summarize findings in this structure:

Overview: Main topic and purpose of the page
Key Points: Important information as bullet points
Details: Detailed explanations as needed
Related Links: Additional resources to reference

When researching multiple URLs or when the user requests it, save results to a file using the Write tool. For a single-URL quick lookup, respond directly in chat.

Snapshot Options

| Flag | Description | |------|-------------| | -i, --interactive | Show only interactive elements | | -c, --compact | Remove empty structural elements | | -d <n>, --depth <n> | Limit DOM tree depth | | -s <sel>, --selector <sel> | Scope to CSS selector |

Additional Useful Commands

screenshot --full — Capture full page screenshot
screenshot --annotate — Screenshot with numbered element labels
diff snapshot — Compare current page state against previous snapshot
console — View browser console logs (useful for debugging)
errors — View page errors
get count "<sel>" — Count matching elements
--max-output <chars> — Truncate output for large pages

kokatsu/browser-research

.config/claude/skills/browser-research/SKILL.md

Research web pages using agent-browser — a headless browser CLI that renders JavaScript and handles dynamic content. Use this skill as a fallback when WebSearch or WebFetch fails, returns insufficient results, or when the target page requires JS rendering (SPAs, dynamic docs). Also use when the user provides a specific URL to investigate, when you need to navigate multi-page documentation, or when summarizing web content that WebFetch cannot parse properly.

tools

Updated Apr 15, 2026

$ install --global

skillsauth

npx skillsauth add kokatsu/dotfiles browser-research

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 15, 2026, 1:30 PM13.6s1 file scanned

SKILL.md

name:: browser-research
description:: Research web pages using agent-browser — a headless browser CLI that renders JavaScript and handles dynamic content. Use this skill as a fallback when WebSearch or WebFetch fails, returns insufficient results, or when the target page requires JS rendering (SPAs, dynamic docs). Also use when the user provides a specific URL to investigate, when you need to navigate multi-page documentation, or when summarizing web content that WebFetch cannot parse properly.
- Bash(agent-browser:: *)

Browser Research Skill

Research web pages using agent-browser CLI and summarize content.

Usage

/browser-research <URL> [research topic or question]

Workflow

0. Clean up existing session (if needed)

A previous session may still be open. Close it before starting to avoid conflicts:

agent-browser close 2>/dev/null || true

1. Open the page

agent-browser open "<URL>" && agent-browser wait --load networkidle --timeout 15000

If open fails: verify the URL is well-formed, retry once. If it fails again, report the error to the user and stop.

If wait times out: proceed anyway — the page may still be usable.

Next, choose the extraction method based on your purpose:

For a typical single-article research, eval "document.body.innerText" is often sufficient. Use snapshot only when you need structure or element refs.

For large pages, append --max-output 10000 to prevent token explosion:

agent-browser eval "document.body.innerText" --max-output 10000

If a cookie consent banner or overlay blocks content, dismiss it first:

agent-browser snapshot -i -c   # find the accept/close button ref
agent-browser click "@ref"     # dismiss the banner

Then proceed with the chosen extraction method.

Stop here if you have enough information. Steps 2–6 below are only needed for deeper investigation.

2. Get detailed content (if needed)

Get text from specific element:

agent-browser get text "@ref"

Get page metadata:

agent-browser get title && agent-browser get url

Find elements by role, text, or label:

agent-browser find role heading
agent-browser find text "keyword"

3. Handle long pages

Scroll to load more content:

agent-browser scroll down 500 && agent-browser snapshot -c

Scroll a specific element into view:

agent-browser scrollintoview "@ref" && agent-browser snapshot -c

4. Navigate to linked pages

Click a link:

agent-browser click "@ref" && agent-browser wait --load networkidle --timeout 15000 && agent-browser snapshot -c

Go back:

agent-browser back && agent-browser snapshot -c

5. Research additional URLs

Use tabs to research multiple pages without losing previous context:

agent-browser tab new && agent-browser open "<next-URL>" && agent-browser wait --load networkidle --timeout 15000 && agent-browser snapshot -c

Switch between tabs or close current tab:

agent-browser tab list
agent-browser tab <n>
agent-browser tab close

6. Save page as PDF (optional)

When the user requests a saved copy:

agent-browser pdf "/path/to/output.pdf"

7. Close when done

agent-browser close

Critical Rules

Always close the session — every open must have a matching close.
Read-only by default — never submit forms or enter data. Clicking is allowed only for passive navigation: dismissing cookie/consent banners, following links, expanding collapsed sections, or switching tabs. Do not click buttons that trigger writes, purchases, or state changes.
No guessing — do not fabricate or assume page content; only report what snapshot/get/eval return.
Authentication pages — if a page requires login, report it immediately and stop. Do not attempt to authenticate.
Prefer command chaining — use && to combine related commands in a single bash call for efficiency.
Minimize tokens — prefer eval "document.body.innerText" over snapshot when you only need text content. Use --max-output for large pages.

Output Format

Respond in the same language the user used. Summarize findings in this structure:

Overview: Main topic and purpose of the page
Key Points: Important information as bullet points
Details: Detailed explanations as needed
Related Links: Additional resources to reference

When researching multiple URLs or when the user requests it, save results to a file using the Write tool. For a single-URL quick lookup, respond directly in chat.

Snapshot Options

Additional Useful Commands

screenshot --full — Capture full page screenshot
screenshot --annotate — Screenshot with numbered element labels
diff snapshot — Compare current page state against previous snapshot
console — View browser console logs (useful for debugging)
errors — View page errors
get count "<sel>" — Count matching elements
--max-output <chars> — Truncate output for large pages

Related Skills

kokatsu/ux-psychology

development

VerifiedTrustedCommunity

Apply UX psychology principles when building UI components, forms, pricing pages, onboarding flows, checkout experiences, modals, or any user-facing interface. Use when designing CTAs, implementing progress indicators, creating loading states, improving user engagement, or reviewing UI for psychological effectiveness.

SKILL.mdUpdated Apr 15, 2026

kokatsu/ux-psychology

kokatsu/tdd

development

VerifiedTrustedCommunity

Guide TDD workflow with Red-Green-Refactor cycle. Use when the user asks to "write tests first", "TDD", "test-driven", "テスト駆動", "TDDで実装", "テストファースト".

SKILL.mdUpdated Apr 15, 2026

kokatsu/research

development

VerifiedTrustedCommunity

Deep-read a codebase area and write findings to research.md. Use for thorough investigation before planning.

SKILL.mdUpdated Apr 15, 2026

kokatsu/plan

development

VerifiedTrustedCommunity

Create a detailed implementation plan in plan.md. Never implements code. Use after /research or when planning a feature.

SKILL.mdUpdated Apr 15, 2026

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/kokatsu/dotfiles.git

# Copy into Claude Code skills folder (global)
cp -r dotfiles/.config/claude/skills/browser-research ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

kokatsu/dotfiles

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT