🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

AI Infrastructure & Experience Engineer

FocusKPI Inc.
Location 📍 Mountain View, United States
Posted 📅 June 15, 2026
Work Type ⏰ Full-time

Position Overview

FocusKPI is seeking an AI Infrastructure & Experience Engineer to join one of our clients, a high-tech SaaS company. 

Work Location: Mountain View, CA (Onsite role, 5 days/week onsite)
Duration: 4-month contract 
Pay Range: $70 - 79/hr

**No C2C resumes are considered**
 

Position Responsibilities:

+ Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments.

+ Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the low-cost GPU compute.

+ Orchestration & Integration: Seamlessly bridge inference backends with orchestration layers (LiteLLM, Ollama, etc.) and frontends like OpenWebUI.

+ Rapid Prototyping: Build functional, high-fidelity demos showcasing m...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
other-general
🏠
Work Arrangement
On-site
📍
Location
Mountain View, United States