Position Overview
Elevate production reliability with Tata Consultancy Services as a Site Reliability Engineer in Canada. Drive optimizations, monitor applications, and automate operations for enhanced performance.
The role focuses on managing and ensuring the reliability of production applications for TCS Canada. You will implement best practices for high availability and capacity planning while analyzing resource usage. Troubleshooting incidents and leading the monitoring setup are key aspects of this position, alongside daily automation tasks.
Key Responsibilities:
• Optimize production applications for performance and scalability
• Implement monitoring and lead incident resolution
• Analyze resource usage for capacity planning
• Automate operational workflows and integrations
• Utilize tools like Dynatrace and Elastic Search
Requirements:
• Proficient in monitoring and observability tools
• Experience with Python and Shell Scripting
• Knowledge of AI Ops and producti...