Engineering Manager – Evaluation & Observability
Apply For This JobTeam
AI Governance Platform – Evaluation & Observability
Location
US / Remote
About the Role
You will build and lead the evaluation, monitoring, and observability layer of our AI governance platform—core to trust, compliance, and auditability.
What You’ll Do
- Build systems for:
- Model evaluation (accuracy, bias, drift, hallucination detection)
- Agent monitoring (inputs/outputs, decisions, tool usage)
- Policy enforcement + audit logs
- Design observability pipelines (real-time + batch)
- Define evaluation frameworks aligned with NIST AI Risk Management Framework
- Work on explainability, traceability, and audit artifacts
- Lead a team of engineers (back-end + data + ML)
What We’re Looking For
- 8–12 years engineering experience, 3+ years managing teams
- Strong back-end/data systems experience (event pipelines, logging, distributed systems)
- Familiarity with LLM/agent evaluation techniques
- Experience building observability platforms
Nice to Have
- Experience with AI governance / compliance systems
- Knowledge of EU AI Act technical requirements
Ready to apply? We'd love to hear from you.
Apply For This Job