Flexible Work, Better Balance
2 days ago Be among the first 25 applicants
We are developing high-quality training and evaluation datasets to improve how Large Language Models (LLMs) perform on real software engineering problems. The core of this project involves identifying and curating verifiable coding tasks from public GitHub repositories, supported by a human-in-the-loop review process.
As a contractor on this project, you will review code written by AI to solve real software tasks. Your feedback will help improve how future AI models learn to write and understand code.
Key Responsibilities