🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

Senior Software Engineer, AI Inference

NVIDIA Corporation
Location 📍 toronto, Canada
Posted 📅 May 29, 2026
Work Type ⏰ Full-time

Position Overview

Senior Software Engineer, AI Inference page is loaded## Senior Software Engineer, AI Inferencelocations: Canada, Torontotime type: Full timeposted on: Posted 2 Days Agojob requisition id: JR Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it!**What You'll be doing:*** Work directly with customer engineering teams through long-term technical partnerships, understanding their LLM serving architectures and performance goals, then designing and implementing end-to-end benchmarking campaigns across Kubernetes and Slurm environments to surface actionable insights.* Set up and operate vLLM serving deployments on GPU clusters, tuning configurations for throughput, latency, and efficiency — and collect Nsight Systems / Nsight Compute profiling traces to identify performance gaps relative to reference frameworks.* Develop detailed performance plans based on profiling findings and collaborate ...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
IT & Technology
🏠
Work Arrangement
On-site
📍
Location
toronto, Canada