You'll work with AI tools, test model outputs, and evaluate responses.
Document errors, gaps, and collaborate with our team.
Spot inconsistencies and provide structured feedback.
Project World Wide is involved in shaping the future of AI through training data. They seek motivated individuals to contribute to the development of cutting-edge AI systems.
Label and Rank: Accurately label and rank machine learning data with advanced proficiency in Italian, ensuring data integrity and quality.
Audit and Correct: Scrutinize and rectify any inaccuracies in language data, maintaining the highest standard of data accuracy.
Reading and Text-Based Tasks: Efficiently complete reading and text-based assignments, with high attention to detail.
Cohere is dedicated to scaling intelligence to serve humanity by training and deploying frontier models for developers and enterprises. They are a team of researchers, engineers, and designers passionate about their craft, fostering a diverse and inclusive work environment.
You will be matched with another participant for 1-on-1 verbal or text-based exchanges.
Use your natural Dutch from Netherlands dialect to discuss various topics provided by the researcher.
Help the AI understand the nuances, slang, and cultural context of Dutch from the Netherlands, through real-world interaction.
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.
Write or rewrite high-quality business review summaries tailored to the Spanish (Spain) locale.
Adjust tone, style, and register to align with professional standards and local conventions.
Ensure cultural appropriateness and relevance for the target audience.
Welo Data provides AI Services. They are looking for detail-oriented language professionals to support an ongoing project focused on writing and refining business review summaries.
Review contributor evaluations of model-generated responses to ensure adherence to project-specific guidelines.
Verify that contributors consistently apply all instructions and evaluation criteria when assessing model responses.
Confirm that contributors accurately identify factual errors, hallucinations, or missing information in model responses.
Welo Data, part of Welocalize, is a global AI data company delivering high-quality, ethical data to train the world’s most advanced AI systems. Welo Data has a diverse community in 100+ countries building smarter, more human AI, offering limitless opportunities for the global community to grow and contribute.
Challenge AI models on realistic educational scenarios.
Validate whether its understanding of pedagogical concepts reflects best-in-class teaching practice.
Evaluate AI outputs for clarity and correctness, analyze subtle reasoning errors, document gaps in logic.
The company is seeking independent Instructional Experts with hands-on experience teaching, tutoring, or building curriculum to train AI models. As a contractor you’ll supply a secure computer and high-speed internet; company-sponsored benefits such as health insurance and PTO do not apply.
Evaluate the relevance of product search results returned for specific queries on e-commerce platforms
Analyze each task consisting of a search query and a corresponding product listing
Use provided context (e.g., search query, search category, and marketplace) to make informed judgments
They are seeking freelance contributors to participate in a search relevance annotation project aimed at improving e-commerce search quality across multiple international markets. This is a remote, task-based opportunity suitable for individuals with a strong command of the English language and an eye for detail.
Converse with the model on language scenarios, verify factual accuracy and logical soundness.
Capture reproducible error traces and suggest improvements to our prompt engineering and evaluation metrics.
Challenge advanced language models on topics like verb conjugation, noun-adjective agreement, sentence structure, word order, accentuation, and colloquial expressions.
They are evolving large-scale language models from clever chatbots into powerful engines of linguistic discovery. This project needs your expertise to help power the next generation of AI with high‑quality training data, tomorrow’s AI that can democratize world‑class education.
Participate in 15–60 minute recorded conversations.
Collaborate with the Data Operations team.
Contribute to high-quality conversational datasets.
Neon collaborates with prominent AI labs and tech companies to create premium conversational voice datasets, fostering advancements in speech and conversational AI. They seem to be a smaller company focusing on specialized data solutions.
Work in the Data Science team developing advanced AI solutions to drive marketplace growth and efficiency.
Build models, test them, and deploy them to production to measure the true impact of your models.
Translate business requirements into achievable projects and collaborate with other teams.
OLX makes it safe, smart, and convenient to buy and sell cars, find housing, get jobs, buy and sell household goods, and more. They serve millions of people around the world every month through well-loved consumer brands including OLX, Otodom, AutoTrader, Property24.
Design and curate evaluation datasets for retrieval quality.
Measure retrieval quality using metrics like Recall@k, Precision@k, MRR, and NDCG@k.
Conduct systematic error analysis on AI/ML system outputs; build structured failure taxonomies.
Jump empowers financial advisors, firms, and clients to thrive in the age of AI by automating tasks like meeting prep and compliance. As a Series A company, Jump has raised $30M and grown to 100+ employees including leaders from top companies and schools, fostering a culture of velocity, world-class standards, direct communication, and kindness.