⏰ Full-time

Lead Specialist, SRE

🏢

TNG Digital

                    Location
                    📍 kuala lumpur, Malaysia
                

                    Posted
                    📅 June 06, 2026
                

                    Work Type
                    ⏰ Full-time
                

Position Overview

What would you do? Service Reliability and Availability: Ensure uptime of 99.99% consistently met, reduce MTTD and MTTR during incidents, drive capacity planning and prevent reliability risks. 
Drive Automation and Operational Excellence: Deliver consistent deployments with zero critical failures, reduce manual toil, standardise and harden container images, CI/CD pipelines and infrastructure. 
Release and Disaster Recovery: Reduce deployment incidents, plan and execute disaster recovery, meet RTO and RPO for cloud environments. 
Incident Response and Troubleshooting: Reduce recurring issues through problem management & root‑cause analysis, establish and enforce incident response processes. 
Security and Compliance: Ensure 100% compliance with audit requirements, achieve zero critical security incidents, enforce security standards on infrastructure and code, implement secure architectures for new deployments. 
Team Leade...
                

Apply Now

Submit Application →

Quick and easy application process

Job Details

⏰

Employment Type

Full-time

📊