Position Overview
Build and manage AI compute infrastructure at APPIT Software in Abu Dhabi, designing GPU clusters, model serving platforms, and high‑performance computing environments for large‑scale AI workloads.
Location: Abu Dhabi, UAE – Full‑time – AI & Machine Learning
Responsibilities - Design and manage GPU cluster infrastructure for large‑scale AI training and inference.
- Build high‑performance model serving platforms with auto‑scaling and load balancing.
- Implement networking and storage solutions optimized for distributed AI workloads.
- Manage cloud and on‑premises hybrid AI infrastructure with cost optimization.
- Establish security, access control, and data sovereignty compliance for AI systems.
- Monitor infrastructure performance and implement capacity planning for growing AI demands.
Requirements - 5+ years of infrastructure engineering, with 2+ years focused on AI/HPC workloads.
- Deep exper...