Position Overview
Overview We are looking for a Middle SRE Operations Engineer to maintain reliability across a cloud‑based SaaS platform.
What you will do - Monitor and support production and staging environments to ensure availability, performance, and stability.
- Respond to incidents, perform triage and root cause analysis, and contribute to remediation efforts.
- Participate in on‑call rotations with defined SLAs.
- Handle operational requests from internal teams.
- Maintain and improve monitoring, alerting, dashboards, logs, and metrics.
- Support CI/CD pipelines, production releases, and GitOps workflows.
- Contribute to automation initiatives to reduce operational overhead.
- Maintain and improve Kubernetes‑based infrastructure and containerized workloads.
- Support Infrastructure as Code practices and environment improvements.
Must haves - 2+ years of experience in Site Reliability Enginee...