Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
Identify issues and provide refined, high-quality rewritten responses.
Create natural prompts and responses in English to improve conversational datasets.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They build smarter, more human AI with a diverse community in 100+ countries.
Localize English-based questions into your language.
Provide clear, concise, and verifiable answers.
Cite credible sources to support your answers.
CrowdGen, by Appen, is focused on AI solutions. They offer project-based roles for independent contractors to contribute to AI development and language comprehension projects.
Identify and document issues and suggest corrections.
RWS provides language, content management, and intellectual property support services. They embrace DEI and promote equal opportunity, with a commitment to equal employment opportunity for all employees in a work environment free of discrimination and harassment.
Review and label content for sentiment, factual accuracy, and reasoning issues.
Evaluate model outputs across quality dimensions using scoring frameworks.
Validate automated assessments and identify discrepancies or errors.
Welo Data provides AI services helping to develop and evaluate large language models (LLMs). The job posting does not provide information regarding the company's size and culture.
Review, analyze, and rank AI-models' chains of thought for correctness and approach.
Provide clear, constructive feedback to improve AI-generated responses.
An Enterprise client is seeking talents who are fluent in English who will help train generative artificial intelligence models. They seem to maintain a contractor-based work environment.
You'll work with AI tools, test model outputs, and evaluate responses.
Document errors, gaps, and collaborate with our team.
Spot inconsistencies and provide structured feedback.
Project World Wide is involved in shaping the future of AI through training data. They seek motivated individuals to contribute to the development of cutting-edge AI systems.
You will be matched with another participant for 1-on-1 verbal or text-based exchanges.
Use your natural Dutch from Netherlands dialect to discuss various topics provided by the researcher.
Help the AI understand the nuances, slang, and cultural context of Dutch from the Netherlands, through real-world interaction.
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.
Evaluate the relevance of product search results returned for specific queries on e-commerce platforms
Analyze each task consisting of a search query and a corresponding product listing
Use provided context (e.g., search query, search category, and marketplace) to make informed judgments
They are seeking freelance contributors to participate in a search relevance annotation project aimed at improving e-commerce search quality across multiple international markets. This is a remote, task-based opportunity suitable for individuals with a strong command of the English language and an eye for detail.
Make scripted and unscripted calls with an AI agent.
Produce clear, natural speech following provided guidelines.
Test and validate the AI’s ability to understand and interpret speech.
RWS is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at RWS are based on business needs, job requirements and individual qualifications.
Challenge AI models on realistic educational scenarios.
Validate whether its understanding of pedagogical concepts reflects best-in-class teaching practice.
Evaluate AI outputs for clarity and correctness, analyze subtle reasoning errors, document gaps in logic.
The company is seeking independent Instructional Experts with hands-on experience teaching, tutoring, or building curriculum to train AI models. As a contractor you’ll supply a secure computer and high-speed internet; company-sponsored benefits such as health insurance and PTO do not apply.
Make scripted and unscripted voice calls with an AI agent.
Produce clear, natural Haitian Creole speech while following the provided guidelines.
Evaluate how well the AI agent understands spoken language, helping test speech recognition and transcription accuracy.
RWS is a company that embraces DEI and promotes equal opportunity. They are committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.
Support model launch readiness by running evaluations, monitoring and interpreting results, and surfacing regressions or unexpected behavior changes to relevant stakeholders
Partner closely with policy and domain experts throughout the evaluation lifecycle — from identifying risks and scoping the right evaluation approach, to coordinating creation of new evals and ensuring existing ones remain current with evolving policies, threat vectors, and model capabilities
Work with cross-functional stakeholders to help manage evaluation outcomes, including interpreting results and driving mitigations where needed
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Their team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
Challenge advanced language models on topics like verb conjugation and word order.
Verify factual accuracy and logical soundness, capturing reproducible error traces.
Suggest improvements to prompt engineering and evaluation metrics.
I am unable to extract the company description from this job posting, because Greenhouse is a recruiting platform, and the posting company is not clearly named.
Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes. More than 500,000 organizations, including 80% of the Fortune 100, rely on Airtable to transform how work gets done.
Review contributor evaluations of model-generated responses to ensure adherence to project-specific guidelines.
Verify that contributors consistently apply all instructions and evaluation criteria when assessing model responses.
Confirm that contributors accurately identify factual errors, hallucinations, or missing information in model responses.
Welo Data, part of Welocalize, is a global AI data company delivering high-quality, ethical data to train the world’s most advanced AI systems. Welo Data has a diverse community in 100+ countries building smarter, more human AI, offering limitless opportunities for the global community to grow and contribute.