🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

Amazon Development Centre Canada ULC
Location 📍 toronto, Canada
Posted 📅 June 04, 2026
Work Type ⏰ Full-time

Position Overview

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS's custom ML accelerators, crafting high-performance kernels for ML functions to deliver optimal performance for customers’ demanding workloads.

The AWS Neuron SDK, developed by the Annapurna Labs team, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. It includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch, providing unparalleled inference and training performance.

As part of the broader Neuron Compiler organization, our team works across multiple technology layers—from frameworks and compilers to runtime and collectives...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
Other-General
🏠
Work Arrangement
On-site
📍
Location
toronto, Canada