Participate in short voice conversations with AI models using a designated platform.
Evaluate AI responses based on relevance, accuracy, clarity, and overall quality.
Provide objective ratings and feedback according to project guidelines.
RWS is a company that provides technology-enabled language, content management, and intellectual property services. They embrace diversity and promote equal opportunity, with a focus on supporting the development of AI models through human evaluation and training.
Evaluate AI-generated Tamil audio for natural emotional and cultural nuance.
Perform sentiment analysis to assess model’s conveyance of intonations and feelings.
Identify and report unnatural or culturally mismatched tones in AI output.
Prolific builds the biggest pool of quality human data in the world, connecting researchers with over 35,000 paid study participants. It is a platform that enables the collection of high-quality, ethically sourced human behavioral data for AI development.
Identify and label languages and dialects from model-generated responses.
Review outputs from two different AI models and determine which model correctly identified the proposed language.
Compare model responses and select the appropriate evaluation outcome from predefined options
RWS – TrainAI is looking for Language Data Annotators. They embrace DEI and promotes equal opportunity and prohibits discrimination and harassment of any kind.
Review, evaluate, and annotate AI-generated content across text, images, audio, and video.
Perform quality checks to ensure accuracy, consistency, and compliance with guidelines.
Identify edge cases and contribute to high-quality datasets for training advanced AI models.
Welo Data, part of Welocalize, is a global AI data company with over 500,000 contributors delivering high-quality, ethical data to train advanced AI systems. They foster a diverse community across 100+ countries and offer flexible, project-based opportunities with complete autonomy.
Evaluate Korean-English translations produced by an AI chatbot for accuracy and adherence to user-specified requirements.
Apply MQM error annotations, verify auto-generated rubric items, and evaluate each item as Pass or Fail.
Ensure consistent and accurate ratings while completing assigned tasks within given timelines.
Appen helps improve AI-powered translation systems through crowd-based evaluation projects. They are a large global company offering project-based independent contractor roles, fostering a culture of flexibility and remote work.
Contribute to training smarter, more inclusive AI through flexible, remote projects.
Work on annotation, evaluation, and prompt creation tasks tailored to your skills.
Join a global community of linguists and culturally aware contributors shaping safer AI.
Welo Data, part of Welocalize, is a global AI data company with over 500,000 contributors delivering high-quality, ethical data to train advanced AI systems. We are building a diverse community in 100+ countries, offering limitless opportunities for growth and contribution on your own terms.
Engage in natural conversations with two AI models and evaluate their performances.
Compare and rank models based on provided criteria after each dialogue.
Submit pass/fail votes for each model to help improve AI quality.
An enterprise client helps innovative companies improve their AI models through human feedback. They are seeking a high volume of freelancers for conversational AI training tasks.
Design complex evaluation frameworks and execute role-play scenarios simulating realistic customer service interactions.
Audit AI model performance across standardized metrics, focusing on task completion, conversational naturalness, and audio comprehension.
Generate diverse, high-quality audio datasets and assess basic computer programming literacy in JSON and structured data reasoning.
The company is an AI benchmarking project that evaluates advanced agentic audio models through simulated customer support scenarios. It is a freelance contractor project with no specific company size or culture mentioned beyond the project scope.
Review English source documents alongside two machine-generated Urdu translations.
Evaluate both variants based on accuracy, fluency, and overall translation quality.
Select the preferred translation and provide a clear written justification for your assessment.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They are building smarter, more human AI with a diverse community in 100+ countries.
Audit and evaluate chatbot conversations based on core dimensions.
Follow project-specific guidelines for accurate evaluations.
Use a proprietary client platform to complete tasks.
RWS Group provides technology-enabled language, content management and intellectual property services. They embrace DEI and promote equal opportunity; they are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.
Review and evaluate AI-generated content across text, images, audio, and video.
Perform quality checks to ensure accuracy, consistency, and compliance with project guidelines.
Identify edge cases and contribute to high-quality dataset development.
Welo Data is a global AI data company that provides high-quality, ethical data to train advanced AI systems. It has over 500,000 contributors worldwide and offers flexible, project-based opportunities with a focus on community and growth.
Rate the performance of AI models or algorithms based on their output or behavior.
Label elements of content, assign categories, and evaluate quality or appropriateness.
Generate additional training data by transforming original data like text, images, or audio.
Innodata is a global data engineering company enabling responsible AI advancement. With over 36 years of experience, we deliver high-quality data and services to AI builders and adopters.
Reviewing, annotating, and testing AI outputs for grammatical accuracy.
Acting as a primary quality check to proactively identify and correct subtle cultural errors.
Analyzing task quality trends and developing educational resources for AI task outputs.
They are sourcing independent Language Alignment & Resource Partners to provide native-level Arabic language vetting and QA for a specialized AI data project. As a contractor, you will supply your own equipment, and company-sponsored benefits do not apply.
Participants will complete recorded search sessions focused on a unique primary search query.
All recordings must follow the project’s recording, safety, and privacy guidelines.
Contributors are encouraged to interact naturally with the platform.
CrowdGen by Appen is a company that focuses on AI training data. They provide opportunities to work on real-world computer vision and OCR-related projects as an Independent Contractor.
Listen to recorded audio files in the target language.
Transcribe conversations accurately and clearly, following formatting and transcription guidelines.
Review and correct transcription errors when needed, completing assigned tasks within the required deadlines.
Terry Soot Management Group (TSMG) is a field data collection company founded in 2017 in Europe. We collect data where automation is not possible, supporting projects involving speech recording, transcription, image and video collection, mapping, and AI training across Europe and North America.
Engage in conversations with a real-time speech-to-speech AI model.
Evaluate performance based on speech recognition, audio quality, conversation flow, and content accuracy.
Provide accurate and consistent ratings based on project guidelines.
Appen is a company that focuses on improving real-time conversational AI. They leverage independent contractors and project-based opportunities to enhance multilingual voice interactions. They seem to foster a community-driven environment.
Complete short voice recordings or conversation tasks for AI training purposes.
Follow clear project instructions to ensure natural, accurate, and usable speech data.
Join tasks when available; assignments may range from a few minutes to multiple hours.
Wing Data supports clients' AI training projects through voice-based tasks. They value flexibility and offer project-based opportunities for individuals to contribute to AI speech systems.
Evaluate machine-translated song lyrics from Japanese to English for quality and accuracy.
Provide ratings based on predefined criteria for meaning, fluency, and naturalness.
Identify mistranslations, awkward phrasing, and cultural inaccuracies without rewriting or editing.
Bellatrix is a language services company specializing in translation and localization. They are a global organization offering freelance opportunities and utilize technology to support their hiring process.