explaingit

droidrun/mobilerun

8,313PythonAudience · developerComplexity · 3/5LicenseSetup · moderate

TLDR

A Python framework that lets you control Android and iOS devices with plain-English instructions, using an AI agent that reads the screen and taps through the app for you.

Mindmap

mindmap
  root((mobilerun))
    What it does
      Plain English control
      Android and iOS
      AI agent taps screen
    AI providers
      OpenAI
      Anthropic
      Gemini Ollama
    Use cases
      App testing
      Task automation
      Guided workflows
    Setup steps
      Install package
      Install portal app
      Configure provider
    Access
      CLI commands
      Python API
      Cloud hosted version
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Automate repetitive tapping workflows on a phone without writing code, just by typing what you want done

USE CASE 2

Run automated tests on an Android or iOS app by describing test steps in plain English

USE CASE 3

Walk a non-technical teammate through a workflow on their phone using plain-language instructions

USE CASE 4

Write a Python script using the mobilerun API to build a custom mobile automation pipeline

Tech stack

PythonOpenAIAnthropicGoogle GeminiOllamaDeepSeek

Getting it running

Difficulty · moderate Time to first run · 30min

Requires installing a companion portal app on your phone or emulator and an API key for your chosen AI provider.

Use freely for any purpose, including commercial use, as long as you keep the copyright notice.

In plain English

Mobilerun is a Python framework that lets you control Android and iOS devices by typing plain-language instructions instead of writing automation scripts. You tell it what you want in a sentence, and an AI agent figures out how to tap, swipe, and navigate through the device to make it happen. Setup takes three steps: install the mobilerun Python package, run a setup command that installs a companion portal app on your phone or emulator, then run a configure command to pick an AI provider and enter your credentials. After that you can type commands like "open settings and turn on dark mode" or "search for an apartment in Berlin" and the agent will carry them out on the device. The AI side is not tied to any single service. You can use OpenAI, Anthropic, Google Gemini, Ollama, or DeepSeek as the underlying model, chosen at configuration time. The agent takes screenshots during each task to understand what is currently on screen and uses that visual information when deciding what to tap or type next. The tool covers several use cases: automated testing of mobile apps, walking less technical people through workflows on their phones, handling repetitive tasks that would otherwise require manual tapping, and exploring app interfaces with plain-language commands. A Python API is also available for writing custom automation scripts if you need more control than the command-line interface provides. The project is MIT licensed and requires Python 3.11 through 3.13. A cloud-hosted version is available, documentation lives at docs.mobilerun.ai, and a benchmark page shows accuracy results across different task types and model providers.

Copy-paste prompts

Prompt 1
Using droidrun/mobilerun, write me a Python script that opens the Settings app on an Android emulator, navigates to Wi-Fi, and turns off the Wi-Fi toggle, then prints Done.
Prompt 2
I want to test my iOS app login flow automatically. Show me how to set up mobilerun with an OpenAI provider and describe the login steps as plain-English commands.
Prompt 3
Using mobilerun, how do I run a task like 'search for coffee shops near me in the Maps app' on a connected Android device, and have it take a screenshot when finished?
Prompt 4
I want to use mobilerun with Ollama running locally instead of a cloud AI provider. Walk me through the configure step to point it at my local Ollama instance.
Prompt 5
Using the mobilerun Python API, write a loop that opens a list of 5 apps one at a time, takes a screenshot of the home screen of each, and saves them to a folder.
Open on GitHub → Explain another repo

← droidrun on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.