Position Overview
Job Description
Design and build scalable, modular, and production-grade AI architectures, including resilient orchestration layers, failover mechanisms, circuit breakers, and reusable AI services.Develop and implement automated AI evaluation frameworks (e.g., LLM-as-a-judge, RAGAS, G-Eval, or custom metrics) to measure system quality, accuracy, relevance, and hallucination rates through data-driven KPIs.Architect and optimize AI-powered systems capable of handling high concurrency, large-scale traffic, and terabyte-scale datasets with low-latency performance.Drive AI cost optimization initiatives through engineering-led solutions such as semantic caching, intelligent model routing, prompt optimization, and token efficiency strategies.Build and maintain robust AI data pipelines for large-scale ingestion, cleaning, embedding, vectorization, and efficient retrieval to ensure high-quality and up-to-date AI outputs.Audit, improve, and produc...