🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

Rlhf Specialist

Odixcity Consulting
Location 📍 barcelona, Spain
Posted 📅 June 09, 2026
Work Type ⏰ Full-time

Position Overview

Job Title:
Compruebe a continuación si tiene lo necesario para esta oportunidad y, si es así, envíe su solicitud lo antes posible.
RLHF Specialist
Location:
Remote (Worldwide)
Job Summary
An RLHF Specialist is responsible for improving and aligning AI models using
Reinforcement Learning from Human Feedback (RLHF)
methodologies. This role focuses on designing, implementing, and optimizing feedback pipelines that enhance model performance, safety, factual accuracy, and alignment with human values.
Responsibilities
Generate high-quality preference data by comparing multiple model responses and ranking them based on criteria such as helpfulness, honesty, and harmlessness (HHH).
Design complex, multi-turn prompts to stress-test model behavior and expose weaknesses in reasoning or safety.
Write detailed “chain-of-thought” explanations and rationales to train reward models on why specific responses are superior.
Collaborate with...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
Accounting
🏠
Work Arrangement
On-site
📍
Location
barcelona, Spain