🌍 Global Opportunities
⚑ Updated Hourly
πŸŽ“ Student Friendly
⏰

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

Amazon
Location πŸ“ toronto, Canada
Posted πŸ“… June 15, 2026
Work Type ⏰ Full-time

Position Overview

ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and generative AI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS’s custom ML accelerators by crafting high-performance kernels for ML functions at the hardware-software boundary.

Key Responsibilities

  • Design and implement high-performance compute kernels for ML operations, leveraging the Neuron architecture and programming models.
  • Analyze and optimize kernel-level performance across multiple generations of Neuron hardware.
  • Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks.
  • Implement compiler optimizations such as fusion, sharding, tiling, and scheduling.
  • Work...

Apply Now

Submit Application β†’

Quick and easy application process

Job Details

⏰
Employment Type
Full-time
πŸ“Š
Category
IT & Technology
🏠
Work Arrangement
On-site
πŸ“
Location
toronto, Canada