Position Overview
Elevate your career with CARFAX as a Senior Software Engineer - ML Ops. Drive the design and scalability of cutting-edge AI infrastructure focused on Large Language Models in a hybrid work environment.
This high-impact position seeks an experienced engineer to shape critical platform components for LLM development and hosting. Your expertise in Kubernetes, cloud-native infrastructure, and advanced autoscaling strategies will be essential. Collaborate closely with teams to ensure platform reliability, performance, and security, while actively engaging in architectural decisions.
Key Responsibilities: • Design scalable infrastructure for LLM training and inference • Implement K8s autoscaling strategies tailored for GPU demands • Optimize ML pipeline infrastructure and workflow reliability • Contribute to GitOps workflows and CI/CD best practices • Ensure full observability across LLM workloads and platform health
Requirements: • 7+ years in DevOps or MLOps • Ex...