OMG-Agent is an open-source Mobile GUI Agent desktop client that drives AI to automatically operate Android phones via natural language instructions.
Core Features:
- Supports Mobile GUI models like AutoGLM and GELab-Zero
- ADB Real-time Screenshot + AI Task Execution
- Bilingual Interface (English/Chinese), Dark/Light Themes
- Supports OpenAI-compatible API
- Supports Android Emulators
⚠️ DisclaimerThis project is for learning, research, and technical exploration only. Strictly prohibited for any commercial use. When using this tool, please comply with relevant laws and regulations, as well as the terms of use and service agreements of mobile phone manufacturers and applications. Users are solely responsible for any actions and consequences arising from the use of this project, which are unrelated to this project and its developers.
# Install ADB
scoop install adb # Windows
brew install android-platform-tools # macOS
apt install adb # Linux (Ubuntu)git clone https://github.com/safphere/OMG-Agent.git
cd OMG-Agent
pip install -r requirements.txt
python run.py- Enable "Developer Options" and "USB Debugging".
- Install ADBKeyboard.
- Connect phone via USB and allow debugging authorization.
- Click "Refresh Devices".
- Click "Start Screen".
- Enter task (e.g., "Open WeChat and send a message to John").
- Click "Execute".
| Model | Source | Description |
|---|---|---|
| AutoGLM-Phone-9B | Zhipu AI | Dedicated Mobile GUI Model |
| GELab-Zero-4B-preview | StepFun | Mobile Agent Model |
These models are specifically trained for Mobile GUI tasks and are recommended.
For users without Android phones, refer to the Emulator Setup Guide.
OMG-Agent/
├── omg_agent/
│ ├── gui/ # GUI Interface
│ └── core/
│ ├── agent/ # AI Agent Core
│ └── config.py # Config Management
├── assets/ # Assets
├── docs/ # Documentation
└── run.py # Entry Point
Safphere is an open-source community composed of algorithm engineers and university geeks, focusing on technical exploration and knowledge sharing in the AI field.
| Platform | Link |
|---|---|
| GitHub | github.com/safphere |
| WeChat Official | Safphere |
| Social Media | @Safphere |
⭐ If you find this project helpful, please Star us!
This project is licensed under the Apache License 2.0 with Commons Clause.
- ✅ Learning, research, and personal use allowed
- ✅ Modification and secondary development allowed
- ❌ Commercial use prohibited
⚠️ Please credit the source when using
See LICENSE for details.
- Open-AutoGLM — Zhipu AI Mobile GUI Model
- gelab-zero — StepFun Mobile Agent Framework
- ADBKeyboard — ADB Input Method
© 2025 Safphere



