Full TimeRemote
Engineering Manager – Evaluation & Observability
At AgentsFlow, you’ll work on cutting-edge AI governance solutions used by enterprises navigating complex regulatory and operational challenges.
Apply For This JobTeam
AI Governance Platform – Evaluation & Observability
Location
US / Remote
Experience
8-10 years
Salary
USD 170,000 – 190,000
Posted
25 March 2026
Apply by
31 July 2026
Required Skills
AI Governance,Langchain, Google vertex AI
About the Role
You will build and lead the evaluation, monitoring, and observability layer of our AI governance platform—core to trust, compliance, and auditability.
What You’ll Do
- Build systems for:
- Model evaluation (accuracy, bias, drift, hallucination detection)
- Agent monitoring (inputs/outputs, decisions, tool usage)
- Policy enforcement + audit logs
- Design observability pipelines (real-time + batch)
- Define evaluation frameworks aligned with NIST AI Risk Management Framework
- Work on explainability, traceability, and audit artifacts
- Lead a team of engineers (back-end + data + ML)
What We’re Looking For
- 8–12 years engineering experience, 3+ years managing teams
- Strong back-end/data systems experience (event pipelines, logging, distributed systems)
- Familiarity with LLM/agent evaluation techniques
- Experience building observability platforms
Nice to Have
- Experience with AI governance / compliance systems
- Knowledge of EU AI Act technical requirements
Ready to apply? We'd love to hear from you.
Apply For This Job