Similar Jobs

See all

Responsibilities:

  • Create and execute role-play–based evaluation scenarios.
  • Contribute to the development of diverse datasets.
  • Evaluate model performance across standardized metrics.

Evaluation Metrics:

  • Task completion accuracy and efficiency
  • Conversational naturalness
  • Audio comprehension and response quality

Technical & Equipment Requirements:

  • Strong verbal communication skills
  • Access to a high-quality microphone
  • Comfort working with structured prompts

Project World Wide

They are dedicated to assessing and benchmarking advanced agentic audio models against leading systems. The program’s mission is to evaluate and optimize model performance for real-world customer support use cases.

Apply for This Position