Position Overview
Join Mizuho as a Site Reliability Engineer!
In this role you will play a crucial role in maintaining the reliability, scalability, and overall performance of our production systems. This position collaborates closely with development, operations, and product teams to automate workflows, monitor system health, and maintain robust services. Expertise in Grafana is vital for creating insightful visualizations and analyzing performance metrics.
Key Responsibilities:
+ Design, implement, and manage automated deployment, monitoring, and alerting solutions.
+ Build and support scalable infrastructure through Infrastructure as Code (IaC) tools.
+ Use Grafana and other monitoring platforms to track system reliability and performance.
+ Partner with development and operations for ongoing improvements to system reliability and efficiency.
+ Diagnose and resolve production issues quickly to minimize downtime.