Oppo open-sources Android AI agent X-OmniClaw that uses your camera, screen, and voice without leaving the phone
Back to Home
tech

Oppo open-sources Android AI agent X-OmniClaw that uses your camera, screen, and voice without leaving the phone

May 17, 202618 views2 min read

Oppo open-sources X-OmniClaw, an Android AI agent that uses camera, screen, and voice locally to automate tasks without leaving the phone.

Smartphone AI is entering a new era of local intelligence with Oppo's latest open-source contribution, X-OmniClaw. This innovative Android agent leverages on-device sensors—camera, screen, and voice—to execute tasks directly within native applications, without requiring cloud-based replicas of the phone's interface. The system represents a significant step toward privacy-conscious AI, keeping sensitive data local while still enabling powerful automation.

Local Execution, Cloud Reasoning

Unlike many AI agents that rely on cloud computing to interpret and interact with mobile interfaces, X-OmniClaw uses local processing to observe and manipulate the device's screen and camera. Only the reasoning phase—such as determining what action to take next—relies on cloud-based compute. This hybrid approach ensures that personal data remains on the device while still benefiting from advanced AI capabilities.

Reusable Skills and Deep Navigation

One of the standout features of X-OmniClaw is its ability to learn and reuse interaction patterns. When the agent performs a task, such as navigating to a specific screen or completing a form, it captures the sequence of taps and actions as a reusable skill. These tap paths are then stored and can be executed directly via deep links, allowing the agent to jump straight to deeply nested app pages in future interactions. This not only speeds up task execution but also improves accuracy by avoiding the need to relearn complex navigation paths.

With this release, Oppo continues to push the boundaries of what's possible on Android, offering developers and researchers a powerful new tool to explore the intersection of local AI and mobile automation. X-OmniClaw is not just a proof of concept—it's a practical framework for building smarter, more autonomous mobile experiences.

Source: The Decoder

Related Articles