Flexible Work, Better Balance
Description
& Summary:We are looking for a Site Reliability Engineer (SRE) to join our Global Capability Center (GCC) team and support highly scalable, global platforms. The SRE will be responsible for ensuring system reliability, availability, performance, and operational excellence across production environments.
This is a hands-on role requiring strong experience in cloud infrastructure, automation, production support, and incident management , working closely with global engineering and product teams.
Responsibilities:
Ensure high availability and reliability of large-scale, business-critical systems
Monitor, troubleshoot, and resolve production incidents ; participate in on-call rotations
Define and track SLIs, SLOs, and error budgets
Perform root cause analysis (RCA) and drive preventive actions