Harbor Expert Network

MLOps Engineer Expert

$110–$160/hrpay

Required skills

MLOpsAgent evaluationProduction MLCI/CD for models

About Harbor

Harbor connects domain experts to the development of frontier AI models. Real-world expertise is turned into training data, evaluations, and feedback loops that improve how models perform. AI labs and enterprises use Harbor to train models and build reliable agents through advanced evaluations and field capture programs. Experts contribute directly to how AI systems learn, reason, and perform across robotics, computer vision, wearable AI, industrial inspection, healthcare, and agentic systems. Harbor identifies and vets top talent through an AI recruiter, enabling high-quality contributions at scale.

Job description

Job title: MLOps Engineer Expert

Job type: Contractor

Location: Remote

Job summary: Evaluate agentic systems and MLOps workflows for enterprise AI programs. You review tool-use traces, deployment patterns, and reliability practices frontier labs expect.

Key responsibilities

  1. Score agent trajectories for tool selection, recovery behavior, and task completion.
  2. Review MLOps pipelines: versioning, monitoring, rollback, and eval gates.
  3. Identify failure modes in multi-step agent runs (looping, hallucinated tool args, data leaks).
  4. Document best-practice gaps relative to production SRE and ML engineering standards.
  5. Support benchmark design for long-horizon agent and RAG evaluation suites.

Required skills and qualifications

  1. 3+ years shipping or operating ML systems in production environments.
  2. Hands-on experience with observability, model registries, and automated eval hooks.
  3. Ability to read logs, traces, and infra configs and render expert judgments quickly.
  4. Clear written technical communication for mixed research and platform audiences.

Preferred qualifications

  1. Experience evaluating LLM agents, LangGraph-style workflows, or RLHF pipelines.
  2. Contributions to open-source MLOps or agent frameworks.