*Mission Summary:** The Behavior Understanding and Evaluation team at Motional is responsible for defining how we measure and validate autonomous vehicle behavior at scale. To prepare for large-scale driverless deployment, manual review and static metrics thresholds are no longer sufficient. We aim