desktop-control/SKILL.md
Unified desktop + browser automation. Routes DOM targets to CDP (cli-jaw browser), desktop apps to Computer Use, hybrid combos to both. Codex desktop/CLI + macOS required for Computer Use.
npx skillsauth add lidge-jun/cli-jaw-skills desktop-controlInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Unified skill for all UI automation. Chooses between CDP and Computer Use based on the target, and reports meaningful actions with a path= + action_class= transcript.
This skill is already injected into your system prompt. Do not run
sed,cat,head, orReadto load it from disk. Guessing absolute paths like/Users/*/.codex/skills/...or/Users/*/.cli-jaw-*/skills/...wastes a turn and often targets a file that doesn't exist. If you need a specific reference file (e.g.,reference/computer-use.md), usecli-jaw skill read desktop-control <ref-name>.
Trigger on any request that touches a visible UI:
$computer-use or /computer-use → skip routing analysis, jump straight to reference/computer-use.md. Explicit user opt-in. If Computer Use tools are not available, stop with precondition failed: computer-use unavailable.reference/cdp.mdreference/computer-use.mdreference/vision-click.mdreference/intent-routing.md FIRSTreference/control-workflow.mdpath=cdp, path=computer-use, or path=cdp+cu.get_app_state(app) before interacting with that app. Re-call it on stale warnings, after actions that change UI state, and whenever confidence drops.action_class. Classes: state-read, element-action, value-injection, keyboard-action, pointer-action, pointer-action+vision, scroll-action, drag-action, secondary-action.get_app_state(app) (Computer Use) or cli-jaw browser snapshot (CDP). Never chain actions through uncertainty. Guessing indices or URLs leads to infinite correction loops. If two consecutive actions produced ambiguous state, the next call must be a state-read, not another action.list_apps, get_app_state, click, drag, press_key, scroll, select_text, set_value, type_text, and perform_secondary_action.list_apps if the app is unknown./Applications/Jaw.app and /Applications/Codex Computer Use.app may be required for TCC attribution. Missing app bundles are a setup issue, not a reason to silently switch paths.CDP action:
path=cdp
url=https://example.com
action=click e3
result=ok
Computer Use action:
path=computer-use
app=Google Chrome
action_class=element-action
action=click(element_index=730)
stale_warning=no
result=ok
Hybrid (lookup via CDP, action via Computer Use):
path=cdp+cu
lookup=cli-jaw browser snapshot → bbox of "Play"
action_class=pointer-action
action=click(x=812, y=514)
result=ok
browser — CDP command reference (this skill supersedes its coverage).screen-capture — generic macOS screenshot / webcam / video recording (unchanged).vision-click — no longer auto-active. Absorbed as a tactic in reference/vision-click.md. If you need the low-level recipe (NDJSON parsing, DPR correction), run cli-jaw skill install vision-click.| Symptom | Correct report |
|---|---|
| "I don't see a cursor" | cursor overlay is best-effort in the current build — action=click(...) succeeded; visible cursor not guaranteed |
| CDP server not running | precondition failed: cli-jaw serve not running. Start with 'jaw serve' and retry. |
| Computer Use tools missing | precondition failed: computer-use unavailable |
| cli-jaw CU app missing in packaged install | precondition failed: /Applications/Codex Computer Use.app missing. Recover: jaw doctor --tcc --fix |
| Stale warning on action | re-call get_app_state(app) then retry; log stale_warning=yes in the transcript |
| Non-GUI task routed here | needs boss follow-up: not GUI automation |
development
Goal execution guidelines with PABCD integration, verification tiers, documentation workflow, and AI-driven planning
tools
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.
development
Use this skill any time a spreadsheet file is the primary input or output (.xlsx, .xlsm, .csv, .tsv). This includes: creating, reading, editing, analyzing, or formatting spreadsheets; cleaning messy tabular data; converting between formats; and data visualization with charts. Also use for pandas-based data analysis when the deliverable is a spreadsheet. Do NOT trigger when the primary deliverable is a Word document, HTML report, standalone Python script, database pipeline, or Google Sheets API integration.
tools
Use this skill when the user wants to build a financial model, 3-statement model, DCF valuation, cap table, scenario analysis, or financial projections in Excel. Trigger on: 'financial model', '3-statement model', 'DCF', 'cap table', 'pro forma', 'projections', 'sensitivity analysis', 'waterfall', 'debt schedule', 'break-even', 'discounted cash flow', 'capitalization table', 'fundraising model', 'WACC calculation', 'scenario analysis model'. Input is a text prompt with assumptions. Output is a single .xlsx file with formula-driven, interconnected statement sheets.