🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full time

Software Engineer, RL Training Infra

OpenAI
Location 📍 San Francisco, United States
Posted 📅 June 07, 2026
Work Type ⏰ Full time

Position Overview

About the Team

The Post-Training Frontiers team creates the frontier agents OpenAI ships to the world. We do the reinforcement learning training for the agentic models we ship in Codex, ChatGPT, and the API (from o1 to 5.5).

Our role consists of shepherding all integrations that should go into the final RL run and deciding what can make it in, babysitting and scaling the final run, and building the research and infra for horizontal integrations, such as improving function calling, factuality, multi-agent capabilities, memory, calibrated thinking, etc.

About the Role

This role focuses on keeping our frontier RL training runs fast, reliable, and unblocked. You will work across engineering and infrastructure problems as they emerge, from scaling and orchestration issues to inference bottlenecks, numerical problems, and hardware failures, as well as supporting large horizontal integrations in the big run, like multi-agent capabilities or memory. Thi...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full time
📊
Category
Computer Occupations
🏠
Work Arrangement
On-site
📍
Location
San Francisco, United States