Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

catalan-adobe/visual-tree

Name: visual-tree
Author: catalan-adobe

skills/visual-tree/SKILL.md

npx skillsauth add catalan-adobe/skills visual-tree

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Visual Tree

Capture a spatial hierarchy of rendered DOM elements from any webpage via playwright-cli. Returns three outputs for downstream consumption.

Prerequisites

playwright-cli available (run playwright-cli help to verify)
A page already open in the browser session

Script Location

if [[ -n "${CLAUDE_SKILL_DIR:-}" ]]; then
  VT_BUNDLE="${CLAUDE_SKILL_DIR}/scripts/visual-tree-bundle.js"
else
  VT_BUNDLE="$(find ~/.claude \
    -path "*/visual-tree/scripts/visual-tree-bundle.js" \
    -type f 2>/dev/null | head -1)"
fi

Verify the path is non-empty before continuing.

Parameters

| Parameter | Default | Description | |-----------|---------|-------------| | minWidth | 900 | Minimum element width in px. Elements narrower than this are excluded. position: fixed elements always pass regardless. Lower for more detail (e.g., 300 for mobile). |

Workflow

Step 1 — Resolve the bundle

Run the script location block above and store the path in VT_BUNDLE. If the path is empty, report an error and stop.

Step 2 — Inject and capture

Inject the bundle via initScript in the playwright-cli config, then capture with a pure expression eval. Do NOT use inline $(cat) or IIFE wrappers — playwright-cli eval only accepts pure expressions (it wraps them as () => (EXPR) internally, so function bodies with statements fail).

MINWIDTH=900  # or caller-specified value

# Build config with initScript — injects bundle before navigation
VT_CONFIG="/tmp/vt-config-$$.json"
echo "{\"browser\":{\"initScript\":[\"$VT_BUNDLE\"]}}" > "$VT_CONFIG"

# Open page (or use existing session) — bundle creates window.__visualTree
playwright-cli --config="$VT_CONFIG" open "$URL"
sleep 2

# Capture — pure expression, no IIFE
VT_RESULT=$(playwright-cli eval \
  "JSON.stringify(window.__visualTree.captureVisualTree($MINWIDTH))")

rm -f "$VT_CONFIG"

Parse the returned JSON string.

Step 3 — Present outputs

Present three sections to the caller:

1. Visual Tree (text format)

The primary output for LLM consumers. Show in a code block:

r @0,0 1440x5667
  rc1 [3x1] @0,0 1440x83 "Header text..."
  rc2 @0,83 1440x5216
    rc2c1 [bg:image] @0,83 1440x410 "Hero text..."
    ...

Format: ID [role] [CxR] [bg:type] @x,y wxh "text..."

ID: positional address in the tree (r = root, rc1 = first child, etc.)
[role]: ARIA role if present
[CxR]: grid layout (e.g., 4x2 = 4 columns, 2 rows) — only when multi-column
[bg:type]: background (color, gradient, or image) — only when visually distinct
@x,y: position from page top-left in pixels
wxh: width x height in pixels
"text...": first 30 characters of text content

2. Node Map

Positional ID to metadata lookup. Show as JSON. Each entry contains:

selector: CSS selector for the DOM element
background (optional): { type, value, raw, source }
overlay (optional): { occluding: [sibling IDs this node covers] }

Overlay entries indicate the node was promoted from a deeper DOM position to root level because it rendered outside its parent's bounds (e.g., cookie banners, fixed navs, modals).

3. JSON Tree

Full structured tree. Show as JSON only if the caller requests it, otherwise mention it is available. Each node contains: tag, selector, bounds, text, role, layout, background, children.

Pipeline

The bundle runs 6 passes on the DOM:

buildVisualNode — walks document.body, captures bounding boxes, backgrounds, text, roles, layout detection. Filters by minWidth. position: fixed elements bypass the width filter.
collapseSingleChildren — flattens wrapper chains (div > div > div becomes a single node with promoted properties).
pruneZeroHeightLeaves — removes invisible zero-dimension nodes bottom-up (e.g., accessibility skip-links).
promoteEscapedNodes — re-parents elements rendered outside their DOM parent's bounds to the nearest containing ancestor. Uses 2px tolerance for subpixel rounding.
assignPositionalIds — assigns compact tree addresses (r, rc1, rc1c2, ...) and builds the nodeMap.
enrichOverlayMetadata — tags promoted root-level nodes with overlay.occluding listing which siblings they visually cover.

Tips

Run on pages after they finish loading (playwright-cli goto <url> then wait for network idle) for best results.
For pages with lazy-loaded content, scroll to bottom and back before capturing.
Overlay nodes in the nodeMap have CSS selectors usable for dismissal (e.g., click accept buttons, remove elements).
The text format is designed for LLM consumption — thin, spatial, and inferrable. The nodeMap carries richer metadata for programmatic use.

catalan-adobe/visual-tree

skills/visual-tree/SKILL.md

Capture a spatial hierarchy of rendered DOM elements from any webpage. Injects a pre-built script via playwright-cli that walks the DOM, detects layout grids, extracts backgrounds, prunes invisible nodes, promotes elements rendered outside their DOM parent (overlays, fixed navs, modals), and tags overlay nodes with occlusion metadata. Returns three outputs: LLM-friendly indented text, structured JSON tree, and a nodeMap mapping positional IDs to CSS selectors with background and overlay data. Use before page decomposition, overlay detection, brand extraction, or any workflow that needs structured page analysis. Triggers on: visual tree, capture tree, page structure, page hierarchy, DOM tree, capture visual, page analysis, extract tree.

tools

Updated Apr 25, 2026

$ install --global

skillsauth

npx skillsauth add catalan-adobe/skills visual-tree

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 25, 2026, 5:23 PM144.8s2 files scanned

SKILL.md

name:: visual-tree
description:: >-
workflow that needs structured page analysis. Triggers on:: visual tree,

Visual Tree

Capture a spatial hierarchy of rendered DOM elements from any webpage via playwright-cli. Returns three outputs for downstream consumption.

Prerequisites

playwright-cli available (run playwright-cli help to verify)
A page already open in the browser session

Script Location

if [[ -n "${CLAUDE_SKILL_DIR:-}" ]]; then
  VT_BUNDLE="${CLAUDE_SKILL_DIR}/scripts/visual-tree-bundle.js"
else
  VT_BUNDLE="$(find ~/.claude \
    -path "*/visual-tree/scripts/visual-tree-bundle.js" \
    -type f 2>/dev/null | head -1)"
fi

Verify the path is non-empty before continuing.

Parameters

Workflow

Step 1 — Resolve the bundle

Run the script location block above and store the path in VT_BUNDLE. If the path is empty, report an error and stop.

Step 2 — Inject and capture

MINWIDTH=900  # or caller-specified value

# Build config with initScript — injects bundle before navigation
VT_CONFIG="/tmp/vt-config-$$.json"
echo "{\"browser\":{\"initScript\":[\"$VT_BUNDLE\"]}}" > "$VT_CONFIG"

# Open page (or use existing session) — bundle creates window.__visualTree
playwright-cli --config="$VT_CONFIG" open "$URL"
sleep 2

# Capture — pure expression, no IIFE
VT_RESULT=$(playwright-cli eval \
  "JSON.stringify(window.__visualTree.captureVisualTree($MINWIDTH))")

rm -f "$VT_CONFIG"

Parse the returned JSON string.

Step 3 — Present outputs

Present three sections to the caller:

1. Visual Tree (text format)

The primary output for LLM consumers. Show in a code block:

r @0,0 1440x5667
  rc1 [3x1] @0,0 1440x83 "Header text..."
  rc2 @0,83 1440x5216
    rc2c1 [bg:image] @0,83 1440x410 "Hero text..."
    ...

Format: ID [role] [CxR] [bg:type] @x,y wxh "text..."

ID: positional address in the tree (r = root, rc1 = first child, etc.)
[role]: ARIA role if present
[CxR]: grid layout (e.g., 4x2 = 4 columns, 2 rows) — only when multi-column
[bg:type]: background (color, gradient, or image) — only when visually distinct
@x,y: position from page top-left in pixels
wxh: width x height in pixels
"text...": first 30 characters of text content

2. Node Map

Positional ID to metadata lookup. Show as JSON. Each entry contains:

selector: CSS selector for the DOM element
background (optional): { type, value, raw, source }
overlay (optional): { occluding: [sibling IDs this node covers] }

Overlay entries indicate the node was promoted from a deeper DOM position to root level because it rendered outside its parent's bounds (e.g., cookie banners, fixed navs, modals).

3. JSON Tree

Full structured tree. Show as JSON only if the caller requests it, otherwise mention it is available. Each node contains: tag, selector, bounds, text, role, layout, background, children.

Pipeline

The bundle runs 6 passes on the DOM:

buildVisualNode — walks document.body, captures bounding boxes, backgrounds, text, roles, layout detection. Filters by minWidth. position: fixed elements bypass the width filter.
collapseSingleChildren — flattens wrapper chains (div > div > div becomes a single node with promoted properties).
pruneZeroHeightLeaves — removes invisible zero-dimension nodes bottom-up (e.g., accessibility skip-links).
promoteEscapedNodes — re-parents elements rendered outside their DOM parent's bounds to the nearest containing ancestor. Uses 2px tolerance for subpixel rounding.
assignPositionalIds — assigns compact tree addresses (r, rc1, rc1c2, ...) and builds the nodeMap.
enrichOverlayMetadata — tags promoted root-level nodes with overlay.occluding listing which siblings they visually cover.

Tips

Run on pages after they finish loading (playwright-cli goto <url> then wait for network idle) for best results.
For pages with lazy-loaded content, scroll to bottom and back before capturing.
Overlay nodes in the nodeMap have CSS selectors usable for dismissal (e.g., click accept buttons, remove elements).
The text format is designed for LLM consumption — thin, spatial, and inferrable. The nodeMap carries richer metadata for programmatic use.

Related Skills

catalan-adobe/reduce-page

tools

VerifiedTrustedCommunity

Reduce a webpage to a structural skeleton with semantic tokens. Two-phase pipeline: Phase 1 injects a browser script that tokenizes content ({TEXT}, {HEADING:n}, {IMAGE:WxH}, {CTA:label}, {LINK:label}, {INPUT:type}, {VIDEO}, {ICON}). Phase 2 applies LLM structural reasoning to collapse repeated patterns ({REPEAT:N}), remove decorative wrappers, strip utility classes, and produce skeleton.html + manifest.json. Use when migrating pages to EDS, analyzing page structure, extracting page blueprints, or preparing input for GenAI block generation. Triggers on: reduce page, page skeleton, page blueprint, extract structure, tokenize page, page reduction, structural skeleton, reduce URL.

SKILL.mdUpdated May 29, 2026

catalan-adobe/reduce-page

catalan-adobe/video-digest

tools

VerifiedTrustedCommunity

Summarize any video by analyzing both audio and visuals. Downloads via yt-dlp, extracts transcript (YouTube captions or Whisper), pulls scene-detected keyframes, and produces a multimodal summary with clickable timestamped YouTube links. Use this skill whenever the user wants to summarize a YouTube video, digest a talk or tutorial, get notes from a video, extract key points from a recording, or says things like "tl;dw", "summarize this video", "what's in this video", or pastes a YouTube URL and asks for a summary. Also triggers for non-YouTube URLs that yt-dlp supports.

SKILL.mdUpdated Apr 25, 2026

catalan-adobe/video-digest

catalan-adobe/spectrum-2-web

development

VerifiedTrustedCommunity

Design and build web UIs with Adobe Spectrum 2 design system. Applies S2 layout principles, visual hierarchy, spacing, and component composition to produce accessible interfaces. Outputs vanilla CSS with Spectrum tokens (static pages) or Spectrum Web Components (interactive apps). Recommends tier based on complexity. Covers sp-theme setup, side-effect imports, overlay system, form patterns, --mod-* token customization, and 14 critical gotchas. Use for: spectrum 2 web, SWC, sp-button, sp-theme, build UI with spectrum, S2 layout, spectrum application, adobe design system, web component form, spectrum overlay.

SKILL.mdUpdated Apr 25, 2026

catalan-adobe/spectrum-2-web

catalan-adobe/slack-cdp

development

VerifiedTrustedCommunity

Control Slack via CDP or headless API tokens. Navigate channels, read/send messages, search conversations, check unreads, and manage status. Two modes: CDP (Slack desktop with --remote-debugging-port) for full UI control, or headless (xoxp/xoxb token) for data operations without Slack running. Triggers on: slack, read slack, search slack, slack unreads, send slack message, slack status, navigate slack, check slack, slack messages, go to channel, slack DM.

SKILL.mdUpdated Apr 25, 2026

catalan-adobe/slack-cdp

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/catalan-adobe/skills.git

# Copy into Claude Code skills folder (global)
cp -r skills/skills/visual-tree ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

catalan-adobe/skills

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT