AgentsFlow/Careers/Engineering Manager – Evaluation & Observability

Engineering Manager – Evaluation & Observability

Apply For This Job

Team

AI Governance Platform – Evaluation & Observability

Location

US / Remote

About the Role

You will build and lead the evaluation, monitoring, and observability layer of our AI governance platform—core to trust, compliance, and auditability.

What You’ll Do

  • Build systems for:
  • Model evaluation (accuracy, bias, drift, hallucination detection)
  • Agent monitoring (inputs/outputs, decisions, tool usage)
  • Policy enforcement + audit logs
  • Design observability pipelines (real-time + batch)
  • Define evaluation frameworks aligned with NIST AI Risk Management Framework
  • Work on explainability, traceability, and audit artifacts
  • Lead a team of engineers (back-end + data + ML)

What We’re Looking For

  • 8–12 years engineering experience, 3+ years managing teams
  • Strong back-end/data systems experience (event pipelines, logging, distributed systems)
  • Familiarity with LLM/agent evaluation techniques
  • Experience building observability platforms

Nice to Have

  • Experience with AI governance / compliance systems
  • Knowledge of EU AI Act technical requirements

Ready to apply? We'd love to hear from you.

Apply For This Job