Cloud Infrastructure Operations – Manage and maintain AWS-native services (ECS, EKS, Lambda, FSx, Redshift, Glue, WAF, Security Hub, KMS, etc.) to ensure uptime, security, and scalability across production environments.
Infrastructure-as-Code (IaC) – Design and manage deployment pipelines using Terraform, CloudFormation, and Ansible, ensuring consistent, automated infrastructure provisioning and drift resolution.
Lifecycle & Patch Management – Oversee OS patching (RHEL 8→9, Windows Server 2016→2025) using AWS Patch Manager, WSUS, and YUM/DNF; automate schedules and ensure compliance.
Monitoring & Incident Response – Monitor system health, resolve production incidents, and lead root cause analysis to improve service reliability.
Job Requirements:
Proven Experience – Minimum 6 years in DevOps / SRE roles, with 4+ years in public sector or regulated cloud environments.