skills/web-development-page-agent/SKILL.md
Embed alibaba/page-agent AI copilot into web applications. Use when you need to add natural language voice/copilot control to a web app (click "login", fill "username as John"), modernize legacy apps without rewriting frontend, or add accessibility via natural language. NOT for server-side browser automation — use Hermes built-in browser tool instead.
npx skillsauth add ariffazil/openclaw-workspace web-development-page-agentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
alibaba/page-agent (MIT, 17k+ stars) is a TypeScript widget that drops into any web app and lets users drive the UI with natural language ("click login, fill username as John"). It reads the DOM as text — no screenshots, no multimodal LLM needed.
Load this skill when:
<script src="https://cdn.jsdelivr.net/npm/[email protected]/dist/iife/page-agent.demo.js" crossorigin="anonymous"></script>
For quick eval only — rate-limited, not for production.
npm install page-agent
import { PageAgent } from 'page-agent'
const agent = new PageAgent({
model: 'qwen3.5-plus',
baseURL: 'https://dashscope.aliyuncs.com/compatible-mode/v1',
apiKey: process.env.LLM_API_KEY,
language: 'en-US',
})
// Show the panel for end users
agent.panel.show()
// Or drive programmatically
await agent.execute('Click submit button, then fill username as John')
Provider examples:
| Provider | baseURL | model |
|---------|---------|-------|
| Qwen/DashScope | https://dashscope.aliyuncs.com/compatible-mode/v1 | qwen3.5-plus |
| OpenAI | https://api.openai.com/v1 | gpt-4o-mini |
| Ollama (local) | http://localhost:11434/v1 | qwen3:14b |
| OpenRouter | https://openrouter.ai/api/v1 | anthropic/claude-sonnet-4.6 |
git clone https://github.com/alibaba/page-agent.git
cd page-agent
npm ci
# Create .env with LLM_API_KEY and LLM_BASE_URL
npm run dev:demo
# Bookmarklet: javascript:(function(){var s=document.createElement('script');s.src='http://localhost:5174/page-agent.demo.js';document.head.appendChild(s);})();
cd /root/arif-sitesnpm install page-agentimport { PageAgent } from 'page-agent'
// Initialize with desired LLM
const agent = new PageAgent({ ... })
agent.panel.show()
apiKey in client-side JS — proxy through your backenddevelopment
Governed intelligence skill for AAA as the abstraction, attestation, and abduction control plane across arifOS, APEX, A-FORGE, GEOX, WEALTH, WELL, and the ariffazil profile repository. Use when the user asks to explain or design AAA, route agentic work, reduce chaos/entropy in an arifOS federation task, create AREP/task declarations, classify risk, plan multi-repo changes, review governance boundaries, or translate human intent into evidence-backed, authority-safe, recursively agentic workflows. Provides deterministic F1-F13 floor checking, bounded abduction, and FederationReceipt composition.
development
Check every skill’s “use when” and “do not use when” clauses for collisions, missing negatives, and vague verbs like “help,” “assist,” or “improve.” Load when linting, reviewing, or validating trigger boundaries.
development
Bootstrap, design, and package new skills. Load when capturing user intent for a new skill or drafting its initial instruction framework.
content-media
Diagnose which federation services are up, down, or drifting. Produce a prioritized remediation plan.