Position Overview
The Global E-commerce Service Architecture team ensures the availability, scalability, and resilience of TikTokβs e-commerce platform in the ., partnering closely with product and engineering teams to operate reliable, large-scale production systems.
We are seeking a Senior Site Reliability Engineer (SRE) to advance the stability and resilience of TikTok Global E-commerce services in the . In this role, you will strengthen disaster recovery readiness, optimize infrastructure capacity, and elevate service stability. Key Responsibilities:
- Data Center Disaster Recovery: Ensure services maintain disaster recovery capabilities under normal operations, including contingency planning and drills, capacity assurance, and effective response in disaster scenarios.
- Resource Management & Capacity Planning: Manage and plan server and compute resources, including resource restructuring, overall capacity planning, and dynamic scaling, to support reliable business deployment and operations...