Position Overview
The Role This position is for an SRE Problem and Knowledge Management Lead within the enabling group, Site Reliability Engineering and Governance (SRE & Governance) department. This role is expected to strategically lead the conduct of incident retrospective/ problem management operations and in other SRE activities in general which pertains to maintenance management that includes availability, performance, change management, monitoring, capacity planning & also the solutions offered derived from emergency response. The Team Lead is to make sure that the retrospective activities are orchestrated & carried out effectively while promoting the blameless culture in accordance with the SRE principles. Responsibilities Mentor the team in the seamless facilitation & conduct of root cause analysis (RCA) activities from end to end Lead the facilitation for high-severity incidents liaising with top/ senior management and keeping the latter updated Prime focal point for presenting in the RCA Foru...