Position Overview
Join a Site Reliability Engineering team focused on cloud service reliability. Use your skills in incident management and container technologies to enhance operational efficiency in a hybrid work setup.
This role invites you to be a key player in the Business Technology Platform, supporting crucial cloud applications. You'll be responsible for monitoring service behavior, conducting troubleshooting, and creating effective solutions to improve overall reliability. Engage in collaboration with development teams and maintain SRE documentation while advocating for best practices.
Key Responsibilities:
• Serve as a technical expert during service disruptions
• Drive root cause analyses and preventive measures
• Build reliable software solutions for infrastructure monitoring
• Collaborate closely on improvements from postmortems
• Maintain clear technical documentation for stakeholders
Requirements:
• 3+ years of expe...