skills/browser-work/SKILL.md
Recon-first browser automation. Orchestrator explores the site first via chromux, saves a guide file with insights, then delegates execution to browser-explorer agent. Use when: "/browser-work", "브라우저 작업", "사이트에서 해줘", "웹에서 해줘", "LinkedIn에서", "크롬으로", "browser task", "automate this site".
npx skillsauth add team-attention/hoyeon browser-workInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Recon-first browser automation: explore → document → delegate.
To reliably execute browser tasks on the user's behalf, the orchestrator first scouts the site directly, creates a pitfall-prevention guide, then delegates execution to the browser-explorer agent.
CLAUDE_SESSION_ID for ~/.hoyeon/{session}/ paths.date +%Y%m%d-%H%M%S and store guides under
$HOME/.hoyeon/codex-browser-$RUN_ID/.hoyeon-browser-explorer when installed.When an agent sees a site for the first time, there's a lot of trial and error (snapshot vs. screenshot confusion, clicking wrong elements, unfamiliar site structure). If the orchestrator walks through one cycle first and builds a "map," the agent can execute accurately.
SESSION_ID="[CLAUDE_SESSION_ID from UserPromptSubmit hook]"
WORK_DIR="$HOME/.hoyeon/$SESSION_ID"
mkdir -p "$WORK_DIR"
echo "WORK_DIR=$WORK_DIR"
Resolve chromux path. Remember the output literally — you'll inline it in every command.
CX=$(command -v chromux 2>/dev/null || echo "") && [ -n "$CX" ] && echo "CHROMUX=$CX" || (npx @team-attention/chromux help >/dev/null 2>&1 && echo "CHROMUX=npx @team-attention/chromux" || echo "MISSING")
If MISSING, report error and stop.
Launch Chrome in headless mode (no visible window, but fully functional):
/path/to/chromux launch default --headless 2>/dev/null || true
To let the user see a live tab (e.g., during recon or debugging), use show — no restart needed:
/path/to/chromux show exp-ab12 # Opens DevTools in user's browser
openssl rand -hex 2
Remember the output (e.g., ab12) → your chromux session ID is exp-ab12. Inline it literally in every command.
Before doing recon, assess whether the task needs it:
| Complexity | Criteria | Action | |------------|----------|--------| | Simple | Single page, 1-2 clicks, well-known site (Google, GitHub) | Skip recon → go directly to Step 4 (Delegate) | | Medium | Multi-step workflow, unfamiliar site, 3+ interactions | Do recon (Step 2-3) | | Complex | Dynamic content, auth flows, pagination, bot-sensitive site | Do thorough recon (Step 2-3) + extra caution notes |
If skipping recon, still create a minimal guide file with the task description and URL.
You (the orchestrator) use chromux directly. Follow the chromux guide in references/chromux-guide.md.
/path/to/chromux open exp-ab12 "<target-url>" && sleep 2 && /path/to/chromux snapshot exp-ab12
Execute the entire workflow once — the same steps the agent will need to do:
Bot detection caution:
wait 2000 between actions (don't click rapidly)/path/to/chromux close exp-ab12
Save recon findings to $WORK_DIR/guide.md. This is the "map" the agent will follow.
cat > "$WORK_DIR/guide.md" << 'GUIDE_EOF'
# Browser Work Guide
## Task
[What the user wants done — 1-2 sentences]
## Target URL
[Starting URL]
## Site Characteristics
- [Login required? Already logged in?]
- [Single page or multi-page workflow?]
- [Dynamic content loading? (infinite scroll, AJAX)]
- [Known bot detection? Rate limits?]
## Workflow Steps
1. [Step description] — [which element to look for in snapshot]
2. [Step description] — [expected @ref pattern or text to search for]
3. ...
## Pitfalls & Insights
- [Things that could trip up the agent]
- [e.g., "Sort dropdown is NOT the first artdeco-dropdown — look for text 'Most Relevant'"]
- [e.g., "'Load more' button changes @ref every time — always re-snapshot"]
- [e.g., "Confirmation modal appears after clicking Connect — look for 'Send without a note'"]
## Bot Detection Notes
- [Recommended delay between actions]
- [Any rate limits observed]
- [Pages to avoid rapid-fire clicking on]
GUIDE_EOF
Fill in the template with actual findings from your recon. Be specific — the agent will read this literally.
Launch the browser-explorer agent with the guide file content included in the prompt.
Agent(
subagent_type: "hoyeon:browser-explorer",
mode: "dontAsk",
prompt: """
[Task description from user]
## Recon Guide
[Paste full contents of $WORK_DIR/guide.md here]
## Execution Rules
1. Follow the Workflow Steps in the guide above
2. Be conservative — add `wait 2000` between actions to avoid bot detection
3. If something doesn't match the guide (unexpected popup, different layout), snapshot and adapt
4. If you hit a CAPTCHA or rate limit, STOP and report back
5. Close your session when done
"""
)
If the task requires multiple independent sub-tasks (e.g., "send connection requests to 5 people"), you can launch multiple browser-explorer agents in parallel — each gets its own tab.
After the agent completes:
$WORK_DIR/guide.md with new insights for future runs| Situation | Response |
|-----------|----------|
| chromux not found | Report error, suggest npm i -g @team-attention/chromux |
| Site requires login | Check if chromux profile has saved login. If not, tell user to log in manually first |
| CAPTCHA during recon | Stop recon, note in guide, delegate with extra caution |
| Agent fails despite guide | Resume agent with corrections, or re-do recon with more detail |
| Task too complex for single agent | Split into sub-tasks, delegate each to separate agent |
Guide files persist in ~/.hoyeon/{sid}/ for reference. No auto-cleanup — user can review or reuse.
development
Run a full implementation verification pass after code or data changes. Use when the user asks to verify, QA, smoke test, run checks, validate a feature, inspect a local app in the browser, capture screenshots, or turn discovered QA issues into regression tests/checklists with user approval.
development
Hoyeon execution workflow for Codex. Use when the user invokes "$hoyeon-execute" or wants to execute a Hoyeon plan.json through the Bash-first Codex adapter. This adapter loads the canonical execute skill and follows its Codex runtime surface.
development
Plan-driven orchestrator. Reads plan.json (from /blueprint) or requirements.md, then dispatches workers to build the system. Use when: "/execute", "execute", "plan 실행", "blueprint 실행"
testing
"/clarify", "clarify this", "keep asking until clear", "remove ambiguity", "clarify requirements", "clarify design", "clarify the plan", "질문 계속해", "모호한 게 없게", "명확해질 때까지", "계속 물어봐", "Q&A로 정리", "질문답변 기록", "요구사항 명확화", "설계 명확화". Relentless ambiguity-resolution interview that records Q&A under .hoyeon/clarify/<topic>/ and hands off to specify/blueprint/docs when clear.