🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

Principal Software Engineer, CoreAI Workload Engines

Microsoft Corporation
Location 📍 Redmond, United States
Posted 📅 June 03, 2026
Work Type ⏰ Full-time

Position Overview

**Overview**

The CoreAI Workloads team builds the foundational inference engines and APIs that power largescale AI inference across Azure - from cutting-edge startups to Fortune 500 enterprises and Microsoft Copilots and agents. Our mission is to deliver secure, reliable, and highly efficient GPU inference that enable multitenant AI systems at global scale while maximizing utilization, performance, and developer productivity. We own inference serving and performance of OpenAI and other state of the art large language model (LLM) models and work directly with OpenAI serving some of the largest workloads on the planet with trillions of inferences per day. Our converged AI fabric and engines deliver inference capabilities for all LLMs in Microsoft catalog (https://azure.microsoft.com/en-us/products/ai-model-catalog#Models) , including OpenAI, Anthropic, Mistral, Cohere, Llama, and more.

This role sits at the intersection of LLM inference fleets, serving efficiency, rapid...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
other-general
🏠
Work Arrangement
On-site
📍
Location
Redmond, United States