$8–$25/hr
- Create and execute role-play–based evaluation scenarios that simulate realistic customer service interactions.
- Contribute to the development of diverse and representative datasets used to assess conversational audio agents.
- Evaluate model performance across a standardized set of qualitative and quantitative metrics.