AgentsFlow/Careers/Engineering Manager – Evaluation & Observability
Full TimeRemote

Engineering Manager – Evaluation & Observability

At AgentsFlow, you’ll work on cutting-edge AI governance solutions used by enterprises navigating complex regulatory and operational challenges.

Apply For This Job

Team

AI Governance Platform – Evaluation & Observability

Location

US / Remote

Experience

8-10 years

Salary

USD 170,000 – 190,000

Posted

25 March 2026

Apply by

31 July 2026

Required Skills

AI Governance,Langchain, Google vertex AI

About the Role

You will build and lead the evaluation, monitoring, and observability layer of our AI governance platform—core to trust, compliance, and auditability.

What You’ll Do

  • Build systems for:
  • Model evaluation (accuracy, bias, drift, hallucination detection)
  • Agent monitoring (inputs/outputs, decisions, tool usage)
  • Policy enforcement + audit logs
  • Design observability pipelines (real-time + batch)
  • Define evaluation frameworks aligned with NIST AI Risk Management Framework
  • Work on explainability, traceability, and audit artifacts
  • Lead a team of engineers (back-end + data + ML)

What We’re Looking For

  • 8–12 years engineering experience, 3+ years managing teams
  • Strong back-end/data systems experience (event pipelines, logging, distributed systems)
  • Familiarity with LLM/agent evaluation techniques
  • Experience building observability platforms

Nice to Have

  • Experience with AI governance / compliance systems
  • Knowledge of EU AI Act technical requirements

Ready to apply? We'd love to hear from you.

Apply For This Job