Position Overview
Job Description
We are seeking a Senior AI Platform Engineer to design, build, and operate scalable AI/ML platform infrastructure on AWS, with a strong emphasis on platform reliability, visibility, and observability.
In this role, you will enable data scientists and application teams to safely deploy and operate AI workloads by providing resilient infrastructure, standardized tooling having deep operational insight across environments.
This is a hands-on senior engineering role that blends cloud infrastructure, DevOpsSec principles, and AI platform enablement.
Responsibilities
AI Platform & AWS Infrastructure
Design, build, and operate a cloud‑native AI/ML platform on AWS supporting training, inference, and experimentation workloads, spanning orchestration layers, agents, MCP tools, internal APIs, and databases.
Build and maintain core multi‑tenant services that enable the development, testing, deployment, monitoring, and lifecycle management of LLM‑bas...