A project dedicated to assessing and benchmarking advanced agentic audio models against leading systems. The program’s mission is to evaluate and optimize model performance for real-world customer support use cases. Responsibilities Create and execute role-play–based evaluation scenarios that simula