Adoption

Agent Skills are supported by leading AI development tools.

VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory VS Code Gemini CLI GitHub Goose Amp Cursor Claude Code Letta OpenCode Claude OpenAI Codex Factory

am009/mobile-use

Name: mobile-use
Author: am009

mobile_use_skill/SKILL.md

npx skillsauth add am009/mobile-use-skill mobile-use

Clean

TrivyContainer and dependency vulnerability scanner

Clean

SemgrepStatic code analysis for vulnerabilities

Clean

mcp-scan (Snyk)Model Context Protocol security validation

Skipped

Snyk (dep)Open source security scanning

Skipped

Socket.devSupply chain security analysis

Skipped

VirusTotalMulti-engine malware detection

Skipped

CrowdStrikeAdvanced threat intelligence

Skipped

OSV-ScannerOpen Source Vulnerability database check

Skipped

OWASP Dep-Check

Mobile Use

Control Android devices through ADB with screenshot-based interaction.

Default Path

Capture a screenshot, then let the grounding workflow interpret the image and execute the action directly.

Write the operation description as clearly and specifically as possible so the model can distinguish the target from nearby controls and avoid accidental taps on the wrong UI element.

from mobile_use import get_screenshot, interact_with_screen

get_screenshot("/tmp/screen.png")
result = interact_with_screen("/tmp/screen.png", "点击微信")

This is the recommended way to click, long-press, or swipe based on a screenshot. Prefer natural-language targets over hard-coded coordinates.

Typical tuned call:

from mobile_use import get_screenshot, interact_with_screen

get_screenshot("/tmp/screen.png")
result = interact_with_screen(
    "/tmp/screen.png",
    "点击底部中间的登录按钮",
    reasoning_effort="low",
    max_rounds=3,
)

result includes the grounding decision and an execution field. When the action is accepted, execution.performed is True and execution.controller_result contains the ADB-layer result.

API

`get_screenshot()`

Capture a device screenshot.

get_screenshot(save_path: str = None) -> str

This is usually the first step before calling interact_with_screen(...).

`interact_with_screen()`

Interpret a screenshot plus a natural-language instruction, then execute the grounded action on the device.

interact_with_screen(
    image: str,
    instruction: str,
    *,
    config: GroundingConfig | None = None,
    model: str | None = None,
    reasoning_effort: str | None = None,
    max_rounds: int | None = None,
    out: str | None = None,
    workdir: str | None = None,
    timeout_sec: int | None = None,
) -> dict

Notes:

Recommended default for screenshot-based interaction.
Use clear target descriptions such as 点击底部中间的“继续”按钮.
Low-level coordinate actions are intentionally not the main public workflow here.

Navigation Keys

`back()`

Press the back button.

back() -> str

`home()`

Press the Home key.

home() -> str

`enter()`

Press the Enter key.

enter() -> str

`keyevent()`

Send any Android keycode.

keyevent(code: str) -> str

Common key codes:

KEYCODE_BACK
KEYCODE_HOME
KEYCODE_MENU
KEYCODE_ENTER
KEYCODE_VOLUME_UP
KEYCODE_VOLUME_DOWN
KEYCODE_POWER
KEYCODE_CAMERA
KEYCODE_SEARCH
KEYCODE_DPAD_UP
KEYCODE_DPAD_DOWN
KEYCODE_DPAD_LEFT
KEYCODE_DPAD_RIGHT
KEYCODE_DPAD_CENTER
KEYCODE_TAB
KEYCODE_SPACE
KEYCODE_DEL
KEYCODE_ESCAPE

`text()`

Type text into the currently focused input field.

text(input_str: str) -> str

`get_device_size()`

Get screen dimensions.

get_device_size() -> Tuple[int, int]

Returns (width, height) in pixels.

Environment

ANDROID_SERIAL: target device serial number. Defaults to the first connected device.

Example:

export ANDROID_SERIAL=emulator-5554

Requirements:

adb installed and available in PATH
An Android device connected with USB debugging enabled
Python dependencies: opencv-python>=4.5.0, pyshine>=0.0.6

Install dependencies:

pip install opencv-python pyshine

Errors

Grounding or controller execution may raise RuntimeError when:

No Android device is connected
An ADB command fails
The device serial in ANDROID_SERIAL is not found

Example:

try:
    result = interact_with_screen("/tmp/screen.png", "点击微信")
except RuntimeError as e:
    print(f"ADB error: {e}")

am009/mobile-use

mobile_use_skill/SKILL.md

This skill should be used when the user asks to control an Android phone, tap the screen, take a phone screenshot, automate Android via ADB, type into a mobile app, swipe on screen, navigate back/home, or interact with UI elements on a connected device.

development

Updated Apr 16, 2026

$ install --global

skillsauth

npx skillsauth add am009/mobile-use-skill mobile-use

Install this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.

Security Scan Results

3 of 9 scanners reported clean

Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.

Scanners Passed

Scanners in report

Clean

TrivyContainer and dependency vulnerability scanner

95%

Clean

SemgrepStatic code analysis for vulnerabilities

95%

Clean

mcp-scan (Snyk)Model Context Protocol security validation

95%

Skipped

Snyk (dep)Open source security scanning

50%

Skipped

Socket.devSupply chain security analysis

50%

Skipped

VirusTotalMulti-engine malware detection

50%

Skipped

CrowdStrikeAdvanced threat intelligence

50%

Skipped

OSV-ScannerOpen Source Vulnerability database check

50%

Skipped

OWASP Dep-Check

50%

Last scanned: Apr 16, 2026, 9:14 AM8.6s2 files scanned

SKILL.md

name:: mobile-use
description:: This skill should be used when the user asks to control an Android phone, tap the screen, take a phone screenshot, automate Android via ADB, type into a mobile app, swipe on screen, navigate back/home, or interact with UI elements on a connected device.

Mobile Use

Control Android devices through ADB with screenshot-based interaction.

Default Path

Capture a screenshot, then let the grounding workflow interpret the image and execute the action directly.

Write the operation description as clearly and specifically as possible so the model can distinguish the target from nearby controls and avoid accidental taps on the wrong UI element.

from mobile_use import get_screenshot, interact_with_screen

get_screenshot("/tmp/screen.png")
result = interact_with_screen("/tmp/screen.png", "点击微信")

This is the recommended way to click, long-press, or swipe based on a screenshot. Prefer natural-language targets over hard-coded coordinates.

Typical tuned call:

from mobile_use import get_screenshot, interact_with_screen

get_screenshot("/tmp/screen.png")
result = interact_with_screen(
    "/tmp/screen.png",
    "点击底部中间的登录按钮",
    reasoning_effort="low",
    max_rounds=3,
)

result includes the grounding decision and an execution field. When the action is accepted, execution.performed is True and execution.controller_result contains the ADB-layer result.

API

`get_screenshot()`

Capture a device screenshot.

get_screenshot(save_path: str = None) -> str

This is usually the first step before calling interact_with_screen(...).

`interact_with_screen()`

Interpret a screenshot plus a natural-language instruction, then execute the grounded action on the device.

interact_with_screen(
    image: str,
    instruction: str,
    *,
    config: GroundingConfig | None = None,
    model: str | None = None,
    reasoning_effort: str | None = None,
    max_rounds: int | None = None,
    out: str | None = None,
    workdir: str | None = None,
    timeout_sec: int | None = None,
) -> dict

Notes:

Recommended default for screenshot-based interaction.
Use clear target descriptions such as 点击底部中间的“继续”按钮.
Low-level coordinate actions are intentionally not the main public workflow here.

Navigation Keys

`back()`

Press the back button.

back() -> str

`home()`

Press the Home key.

home() -> str

`enter()`

Press the Enter key.

enter() -> str

`keyevent()`

Send any Android keycode.

keyevent(code: str) -> str

Common key codes:

KEYCODE_BACK
KEYCODE_HOME
KEYCODE_MENU
KEYCODE_ENTER
KEYCODE_VOLUME_UP
KEYCODE_VOLUME_DOWN
KEYCODE_POWER
KEYCODE_CAMERA
KEYCODE_SEARCH
KEYCODE_DPAD_UP
KEYCODE_DPAD_DOWN
KEYCODE_DPAD_LEFT
KEYCODE_DPAD_RIGHT
KEYCODE_DPAD_CENTER
KEYCODE_TAB
KEYCODE_SPACE
KEYCODE_DEL
KEYCODE_ESCAPE

`text()`

Type text into the currently focused input field.

text(input_str: str) -> str

`get_device_size()`

Get screen dimensions.

get_device_size() -> Tuple[int, int]

Returns (width, height) in pixels.

Environment

ANDROID_SERIAL: target device serial number. Defaults to the first connected device.

Example:

export ANDROID_SERIAL=emulator-5554

Requirements:

adb installed and available in PATH
An Android device connected with USB debugging enabled
Python dependencies: opencv-python>=4.5.0, pyshine>=0.0.6

Install dependencies:

pip install opencv-python pyshine

Errors

Grounding or controller execution may raise RuntimeError when:

No Android device is connected
An ADB command fails
The device serial in ANDROID_SERIAL is not found

Example:

try:
    result = interact_with_screen("/tmp/screen.png", "点击微信")
except RuntimeError as e:
    print(f"ADB error: {e}")

Related Skills

am009/airtest-autophone

development

VerifiedTrustedCommunity

Write python airtest scripts to automate tasks on mobile phone. Use when the user need to update his phone-control scripts.

SKILL.mdUpdated Apr 3, 2026

am009/airtest-autophone

openclaw/openclaw-secret-scanning-maintainer

development

VerifiedTrustedCommunity

Maintainer-only workflow for handling GitHub Secret Scanning alerts on OpenClaw. Use when Codex needs to triage, redact, clean up, and resolve secret leakage found in issue comments, issue bodies, PR comments, or other GitHub content.

357,764SKILL.mdUpdated Apr 15, 2026

openclaw/openclaw-secret-scanning-maintainer

openclaw/openclaw-release-maintainer

development

VerifiedTrustedCommunity

Maintainer workflow for OpenClaw releases, prereleases, changelog release notes, and publish validation. Use when Codex needs to prepare or verify stable or beta release steps, align version naming, assemble release notes, check release auth requirements, or validate publish-time commands and artifacts.

357,764SKILL.mdUpdated Apr 10, 2026

openclaw/openclaw-release-maintainer

openclaw/openclaw-qa-testing

development

VerifiedTrustedCommunity

Run, watch, debug, and extend OpenClaw QA testing with qa-lab and qa-channel. Use when Codex needs to execute the repo-backed QA suite, inspect live QA artifacts, debug failing scenarios, add new QA scenarios, or explain the OpenClaw QA workflow. Prefer the live OpenAI lane with regular openai/gpt-5.4 in fast mode; do not use gpt-5.4-pro or gpt-5.4-mini unless the user explicitly overrides that policy.

357,764SKILL.mdUpdated Apr 10, 2026

openclaw/openclaw-qa-testing

Download

For Claude Desktop. Download once, then upload the file in the app — no terminal needed.

Need help? View full Cowork setup guide →

Install manually

Choose your platform

# Clone the repo
git clone https://github.com/am009/mobile-use-skill.git

# Copy into Claude Code skills folder (global)
cp -r mobile-use-skill/mobile_use_skill ~/.claude/skills/

Claude Code Skills — official skills path docs.

Repository

am009/mobile-use-skill

Compatible with

Claude Code

OpenAI Codex CLI

ChatGPT