*Key Responsibilities** Design and implement comprehensive monitoring and observability systems for all live AI agents — tracking response quality, latency, error rates, and conversation outcomes Build and maintain evaluation frameworks to measure agent performance against defined benchmarks, includ