🌍 Global Opportunities
Updated Hourly
🎓 Student Friendly

parttimejobs.work

Flexible Work, Better Balance

⏰ Full-time

Agent Quality / Evals Engineer 1754

SOFTGIC
Location 📍 Colombia, Colombia
Posted 📅 June 24, 2026
Work Type ⏰ Full-time

Position Overview

Job Description

This is a remote position.

Owns the eval harness and quality gate from the beginning. This role replaces the old late-stage “Evals Specialist” model with a standing owner for measurable agent quality.

Key Responsibilities

• Build and maintain the MVP eval harness: golden tasks, exception tasks, scorecard metrics, and regression packs.
• Wire evals into CI so quality regressions fail builds and releases.
• Define and maintain release-gate thresholds with Product and the Tech Lead.
• Lay the path for later adversarial and drift-testing expansion without overbuilding MVP scope.


Requirements

Must-Have Qualifications

• Experience evaluating ML, LLM, or non-deterministic systems.
...

Apply Now

Submit Application →

Quick and easy application process

Job Details

Employment Type
Full-time
📊
Category
Engineers
🏠
Work Arrangement
On-site
📍
Location
Colombia, Colombia