Position Overview
We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing and building things like new abstractions, efficient attention kernel implementations, new LLM inference runtimes components, and kernel code generators to accelerate large language models, agents, and other high-impact AI workloads.
What you'll be doing:
+ Innovating and developing new AI systems technologies for efficient inference
+ Designing, implementing, and optimizing kernels for high impact AI workloads
+ Designing and implementing extensible abstractions for LLM serving engines
+ Building efficient just-in-time domain specific compilers and runtimes
+ Collaborating closely with other engineers at N...