Yyds.Auto — Android RPA Skill for AI Agents
name: yyds-auto
by chenanzong · published 2026-04-01
$ claw add gh:chenanzong/chenanzong-yyds-auto---
name: yyds-auto
description: Control Android devices via MCP — tap, swipe, OCR, screenshot, UI automation, shell, file management, and AI agent orchestration for Android RPA.
version: 1.0.0
metadata:
openclaw:
requires:
env:
- YYDS_DEVICE_HOST
- YYDS_DEVICE_PORT
bins:
- node
anyBins:
- adb
primaryEnv: YYDS_DEVICE_HOST
emoji: "\U0001F4F1"
homepage: https://yydsauto.com
os:
- windows
- macos
- linux
install:
- kind: node
package: yyds-auto-mcp
bins: [yyds-auto-mcp]
---
# Yyds.Auto — Android RPA Skill for AI Agents
> Let LLMs directly control Android devices through the MCP protocol.
Yyds.Auto is a production-grade Android RPA (Robotic Process Automation) platform that exposes **60 MCP tools** covering the full spectrum of Android device automation — from pixel-level touch injection and OCR to UI hierarchy inspection, file management, and on-device AI agent orchestration.
What Can It Do?
| Category | Tools | Capabilities |
|----------|-------|-------------|
| 📱 Device Info | 4 | Device model, screen size, IMEI, foreground app, network status |
| 👆 Touch & Input | 8 | Tap, swipe, long press, drag, text input, clipboard, key press |
| 📸 Screenshot | 2 | Screenshot as base64 image (LLM can see it directly), save to device |
| 🌲 UI Automation | 5 | UI hierarchy dump, find elements by attributes, element relations, wait & scroll |
| 🔍 OCR & Image | 8 | Screen OCR, tap-on-text, template matching, pixel color, image comparison |
| 💻 Shell | 1 | Execute shell commands with ROOT/SHELL privileges |
| 📦 App Management | 8 | Launch/stop apps, list installed, install/uninstall APK, open URL, toast |
| 📁 File Operations | 7 | List, read, write, delete, rename files and directories on device |
| 🐍 Script Projects | 5 | List/start/stop Python projects, execute Python code snippets |
| 📚 Pip Management | 4 | List, install, uninstall, inspect Python packages |
| 🤖 AI Agent | 8 | Configure and run an on-device AI agent with natural language instructions |
Architecture
AI Agent (Claude / GPT / Gemini / Cursor / Windsurf / ...)
↓ MCP Protocol (stdio, JSON-RPC)
yyds-auto-mcp (Node.js, this skill)
↓ HTTP REST (JSON, port 61140)
yyds.py engine (Android, aiohttp server)
↓ IPC
yyds.auto engine (Android, kernel-level UI automation)The MCP server communicates with the on-device engine via HTTP REST. When connected via USB, ADB port forwarding is set up automatically. Remote devices over WiFi/LAN are also supported.
Prerequisites
1. **Android device** with [Yyds.Auto](https://yydsauto.com) installed and the engine running
2. **Connection**: USB (auto ADB forward) or WiFi (same LAN)
3. **Node.js** >= 18
Quick Start
Install the MCP Server
npm install -g yyds-auto-mcpConnect to a USB Device (auto-detected)
# Default: 127.0.0.1:61140, ADB forward set up automatically
yyds-auto-mcpConnect to a Remote Device
YYDS_DEVICE_HOST=192.168.1.100 YYDS_DEVICE_PORT=61140 yyds-auto-mcpClaude Desktop Configuration
Add to `claude_desktop_config.json`:
{
"mcpServers": {
"yyds-auto": {
"command": "npx",
"args": ["-y", "yyds-auto-mcp"],
"env": {
"YYDS_DEVICE_HOST": "127.0.0.1",
"YYDS_DEVICE_PORT": "61140"
}
}
}
}Cursor / Windsurf / VS Code Configuration
Add the same MCP server configuration in your editor's MCP settings.
Environment Variables
| Variable | Default | Description |
|----------|---------|-------------|
| `YYDS_DEVICE_HOST` | `127.0.0.1` | Device IP address |
| `YYDS_DEVICE_PORT` | `61140` | Engine port number |
| `YYDS_DEVICE_SERIAL` | *(first device)* | Specify ADB device serial |
| `YYDS_ADB_PATH` | *(auto-detect)* | Custom ADB binary path |
Tool Reference
Device Information
Touch & Input
Screenshot
UI Automation
OCR & Image
Shell
App Management
File Operations
Script Projects
Pip Management
AI Agent
Key Features
🔄 Auto-Reconnect
USB connection drops are handled gracefully — when the device disconnects, the MCP server automatically re-establishes ADB port forwarding and retries the request.
🚀 Auto-Bootstrap
On first connection via USB, the server automatically sets up ADB forwarding and starts the engine on the device if it's not already running.
🧠 Smart UI Dump
UI hierarchy dumps over 15KB are automatically trimmed to keep only actionable elements (those with text, resource-id, content-desc, or clickable/scrollable attributes), reducing LLM token usage.
🎯 Kernel-Level Touch
Touch events are injected at the Linux kernel level, making them work in any app including games, locked-down apps, and areas that block accessibility-based input.
Example Prompts
Once connected, try these prompts with your AI agent:
Links
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...