skills/ai-task-hub/SKILL.md
AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.
npx skillsauth add leoyeai/openclaw-master-skills ai-task-hubInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Formerly skill-hub-gateway.
Public package boundary:
portal.skill.execute, portal.skill.poll, and portal.skill.presentation.api_key or userToken inside this package.Chinese documentation: SKILL.zh-CN.md
Use this skill when the user asks to:
speech to text, audio transcription)text to speech, voice generation)document to markdown)poll, check job status)overlay, mask, and cutoutExample requests that should trigger this skill:
vision aliases: face detection, human detection, person detection, image taggingbackground aliases: remove background, background removal, cutout, matting, product-cutoutasr aliases: speech to text, audio transcription, transcribe audiotts aliases: text to speech, voice generation, speech synthesismarkdown_convert aliases: document to markdown, file to markdown, markdown conversionpoll aliases: check job status, poll long-running task, async run statuspresentation aliases: rendered output, overlay, mask, cutout filesembeddings/reranker aliases: vectorization, semantic vectors, relevance rerankingDefault API base URL: https://gateway-api.binaryworks.app
Action to endpoint mapping:
portal.skill.execute -> POST /agent/skill/executeportal.skill.poll -> GET /agent/skill/runs/:run_idportal.skill.presentation -> GET /agent/skill/runs/:run_id/presentationEvery request must include:
X-Agent-Task-Token: <jwt_or_paseto>Recommended token claims:
sub (user_id)agent_uidconversation_idscope (execute|poll|presentation)expjtiCLI argument order for scripts/skill.mjs:
[agent_task_token] <action> <payload_json> [base_url]AGENT_TASK_TOKEN from environment.AGENT_TASK_TOKEN automatically to avoid user-facing auth friction.portal.skill.execute: payload requires capability and input.payload.request_id is optional and passed through.portal.skill.poll and portal.skill.presentation: payload requires run_id.portal.skill.presentation supports include_files (defaults to true).Attachment normalization:
image_url / audio_url / file_url.attachment.url is mapped to target media field by capability.file_path is disabled in the published package./api/blob/upload-file.request_id, data, error.POINTS_INSUFFICIENT and pass through error.details.recharge_url.scripts/skill.mjsscripts/agent-task-auth.mjsscripts/base-url.mjsscripts/attachment-normalize.mjsscripts/telemetry.mjs (compatibility shim)references/capabilities.jsonreferences/openapi.jsonSKILL.zh-CN.mdtesting
AI-powered diary generation for agents - creates rich, reflective journal entries (400-600 words) with Quote Hall of Fame, Curiosity Backlog, Decision Archaeology, Relationship Evolution, mood analytics, weekly digests, "On This Day" resurfacing, and cron auto-generation. Works best with Claude models (Haiku, Sonnet, Opus).
development
Multi-agent UX for OpenClaw Control UI — agent selector, per-agent sessions, session history viewer with search, agent-filtered Sessions tab with friendly names, Create Agent wizard, emoji picker, and backend agent CRUD.
tools
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
tools
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.