skills/jina-web/SKILL.md
Read web pages and search the internet using Jina AI Reader & Search APIs. Two modes: (1) READ - fetch any URL and convert to clean markdown/text, (2) SEARCH - web search with results returned as markdown. Supports proxy, CSS selectors, screenshots, caching control, and more. Auto-detects python/curl+jq/powershell backend. Use when: user wants to read a webpage, fetch URL content, scrape a site, search the web, or get clean text from a URL. Triggers: "read this page", "fetch URL", "scrape website", "search the web", "jina search", "jina read", "读取网页", "抓取页面", "搜索网页", "网页内容", "获取链接内容".
npx skillsauth add ba0gu0/skillbox jina-webInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Read web pages and search the internet via Jina AI Reader & Search APIs.
scripts/jina-web.pyscripts/jina-web.sh (curl)scripts/jina-web.ps1 (Windows)Set JINA_API_KEY env var for higher rate limits (optional, ~20 req/min without).
# Read a web page → markdown
python3 scripts/jina-web.py read "https://example.com"
# Read as plain text
python3 scripts/jina-web.py read "https://example.com" --format text
# Read with CSS selector, no images
python3 scripts/jina-web.py read "https://example.com" --target "article" --no-images
# Remove nav/footer, wait for JS content
python3 scripts/jina-web.py read "https://example.com" --remove "nav,footer" --wait-for ".content"
# Read as JSON (includes title, url, publishedTime, token usage)
python3 scripts/jina-web.py read "https://example.com" --json
# Take screenshot
python3 scripts/jina-web.py read "https://example.com" --format screenshot
# Search the web
python3 scripts/jina-web.py search "Claude AI latest features"
# Search with JSON output
python3 scripts/jina-web.py search "AI news 2026" --json
# News search with token budget
python3 scripts/jina-web.py search "AI regulation" --type news --budget 5000
# Image search
python3 scripts/jina-web.py search "cute cats" --type images
# Use Bing engine
python3 scripts/jina-web.py search "rust tutorial" --engine bing
Shell version (identical interface):
scripts/jina-web.sh read "https://example.com"
scripts/jina-web.sh search "Claude AI" --json
PowerShell version (Windows):
pwsh scripts/jina-web.ps1 read "https://example.com"
pwsh scripts/jina-web.ps1 search "Claude AI" --json
| Flag | Description | Default |
|------|-------------|---------|
| --json | Return structured JSON with metadata | off |
| --no-cache | Skip cache, fetch fresh | off |
| --proxy URL | Use HTTP proxy | none |
| --format FMT | markdown/text/html/screenshot/pageshot | markdown |
| --timeout SEC | Request timeout (max 180) | 30 |
| Flag | Description | Default |
|------|-------------|---------|
| --target SEL | Target CSS selector(s), comma-separated | none |
| --wait-for SEL | Wait for CSS selector(s) before extraction | none |
| --remove SEL | Remove elements matching selector(s) | none |
| --no-images | Strip all images from output | off |
| --no-links | Strip all links from output | off |
| --with-links-summary | Append link summary at end | off |
| --with-images-summary | Append image summary at end | off |
| Flag | Description | Default |
|------|-------------|---------|
| --budget N | Token budget for results | none |
| --engine ENG | google/bing/reader | google |
| --type TYPE | web/images/news | web |
r.jina.ai): Renders page (including JS), extracts content → clean markdowns.jina.ai): Searches web, fetches top results → markdown with contentPlain text (default): clean markdown directly.
JSON (--json): structured response with metadata.
{"code":200,"data":{"text":"...","title":"...","url":"...","publishedTime":"...","usage":{"tokens":N}}}{"code":200,"data":[{"title":"...","url":"...","description":"...","content":"..."},...]}tools
Download videos and audio from 20+ platforms (YouTube, TikTok, Instagram, Twitter/X, Reddit, Bilibili, Pinterest, SoundCloud, Vimeo, etc.) using the cobalt API. Auto-detects python/curl+jq/powershell backend. Use when: user wants to download a video, extract audio from a video, save a clip, or rip media from a social platform URL. Triggers: "download video", "下载视频", "save video", "rip audio", "extract audio", "download from youtube", "download from tiktok", "download from twitter", "下载", "保存视频", "提取音频".
development
项目接手系统生成器。把新项目、代码无文档项目或老项目升级成任何 AI Coding Agent 都能不靠聊天历史直接接手执行的工作系统。核心建立 5 个能力:项目说明、当前状态、下一步任务、验收标准、红线规则。
development
Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.
development
Maintainer workflow for OpenClaw releases, prereleases, changelog release notes, and publish validation. Use when Codex needs to prepare or verify stable or beta release steps, align version naming, assemble release notes, check release auth requirements, or validate publish-time commands and artifacts.