Make scripted and unscripted calls with an AI agent.
Produce clear, natural speech following provided guidelines.
Test and validate the AI’s ability to understand and interpret speech.
RWS is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at RWS are based on business needs, job requirements and individual qualifications.
Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
Identify issues and provide refined, high-quality rewritten responses.
Create natural prompts and responses in English to improve conversational datasets.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They build smarter, more human AI with a diverse community in 100+ countries.
You will be matched with another participant for 1-on-1 verbal or text-based exchanges.
Use your natural Dutch from Netherlands dialect to discuss various topics provided by the researcher.
Help the AI understand the nuances, slang, and cultural context of Dutch from the Netherlands, through real-world interaction.
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.
Create and execute role-play–based evaluation scenarios that simulate realistic customer service interactions.
Contribute to the development of diverse and representative datasets used to assess conversational audio agents.
Evaluate model performance across a standardized set of qualitative and quantitative metrics.
They are dedicated to assessing and benchmarking advanced agentic audio models against leading systems. The program’s mission is to evaluate and optimize model performance for real-world customer support use cases.
You'll work with AI tools, test model outputs, and evaluate responses.
Document errors, gaps, and collaborate with our team.
Spot inconsistencies and provide structured feedback.
Project World Wide is involved in shaping the future of AI through training data. They seek motivated individuals to contribute to the development of cutting-edge AI systems.
Review, analyze, and rank AI-models' chains of thought for correctness and approach.
Provide clear, constructive feedback to improve AI-generated responses.
An Enterprise client is seeking talents who are fluent in English who will help train generative artificial intelligence models. They seem to maintain a contractor-based work environment.
Challenge AI models on realistic educational scenarios.
Validate whether its understanding of pedagogical concepts reflects best-in-class teaching practice.
Evaluate AI outputs for clarity and correctness, analyze subtle reasoning errors, document gaps in logic.
The company is seeking independent Instructional Experts with hands-on experience teaching, tutoring, or building curriculum to train AI models. As a contractor you’ll supply a secure computer and high-speed internet; company-sponsored benefits such as health insurance and PTO do not apply.
Migrate and test existing bulk flashcard creation prompts.
Run test suites and manually review AI outputs for quality and correctness.
Analyze real user data to identify failure patterns and improve prompts.
Brainscape is the world's leading web & mobile EdTech study platform. They help millions of learners create better flashcards and the company is looking for an AI Prompt Engineer to join their team.
Review and label content for sentiment, factual accuracy, and reasoning issues.
Evaluate model outputs across quality dimensions using scoring frameworks.
Validate automated assessments and identify discrepancies or errors.
Welo Data provides AI services helping to develop and evaluate large language models (LLMs). The job posting does not provide information regarding the company's size and culture.
Deliver fast, friendly, and accurate support to Descript users through live chat, email, and occasional video conferencing.
Guide users through product workflows, explain feature behavior, and help troubleshoot technical issues.
Contribute to team knowledge by flagging trends, bugs, and documentation gaps.
Descript is building a simple, intuitive, fully-powered editing tool for video and audio — an editing tool built for the age of AI. They are a team of 150 and have the backing of some of the world's greatest investors; they believe each new employee has a measurable influence on the direction of the company.
Khan Academy is a fast-paced nonprofit on a mission to provide free, world-class education for anyone, anywhere. We reach millions of students every month and are growing rapidly.