Flexible Work, Better Balance
The client is a fast‑growing HKEX‑listed AI company building next‑generation autonomous AI agents that can understand, navigate, and interact with digital interfaces much like humans do. Their work sits at the intersection of multimodal AI, computer vision, reasoning, and agentic systems — pushing beyond traditional LLM capabilities into real‑world task execution across complex GUI environments.
The RoleWe are looking for an exceptional R&D Engineer to join our core team in Singapore. In this role, you will lead the research and development of multimodal AI agents capable of understanding UI layouts, interpreting visual context, and performing accurate actions (clicks, types, scrolls) across Android environments.
You will work at the intersection of Computer Vision, Natural Language Processing, and Reinforcement Learning, translating state‑of‑the‑art research into robust, deployable agents.
Key Responsibilities