Define the technical vision for creating a specialized pool of profiles intended to train AI models.
Serve as the key expert liaison with JobTeaser's strategic partners to validate methodologies and quality.
Work with the Product and Strategy team to translate the needs of AI partners into product specifications.
JobTeaser connects young graduates with companies, integrating into European schools and universities. With over 2 million students and young graduates across Europe, JobTeaser provides a platform for students to find internships and jobs, and for companies to access young talent.
Perform sampling and quality checks on annotated datasets to ensure adherence to annotation guidelines
Identify, log, and categorize annotation defects with severity levels, tracking corrective actions and rework tasks
Coordinate onboarding training, calibration sessions, and refresher training for annotators and reviewers
Welo Data is a multilingual data and evaluation partner for foundation labs and enterprises deploying GenAI systems globally, delivering human judgment, data infrastructure, and evaluation systems for reliable AI performance across languages and cultures. It operates with a global network of over 500,000 vetted experts across 300+ languages, leveraging a unified model led by specialized experts with proprietary identity and fraud-prevention frameworks to ensure accurate and culturally grounded datasets.
Coordinate project workstreams to ensure on-time delivery against execution standards.
Deliver high-quality annotation and QA to establish quality benchmarks.
Analyze datasets to surface patterns and optimization opportunities.
Appen specializes in human-generated data to train, fine-tune, and evaluate models across generative AI, large language models, computer vision, and speech recognition. They have over 1 million contributors in over 200 countries supporting model pre-training, supervised fine-tuning, evaluation and benchmarking, safety and red teaming, and multilingual global expansion.
Evaluate AI responses across scenarios like general Q&A and web search results.
Perform side-by-side comparisons of AI-generated responses, judging accuracy and clarity.
Apply detailed guidelines, maintaining consistency and high-quality evaluations.
Blueprint Technologies is a technology solutions firm that helps organizations grow, transform, and innovate. They have a strong presence across the United States and are expanding across Latin America, with teams united by a shared passion for solving complex problems.
Support the evaluation and labeling of images, search queries and other forms of data.
Understand challenging guidelines and articulate understanding.
Help LLMs learn the intricacies of language and reasoning.
Innodata is a global data engineering company that believes data and Artificial Intelligence (AI) are inextricably linked. Our mission is to enable the responsible advancement of artificial intelligence by providing the data, evaluation frameworks, and human expertise required to build AI systems that can be trusted at scale.
Source candidates using LinkedIn, Boolean searches, AI tools, Apollo, Google, GitHub, portfolios, social media, and alternative sourcing channels.
Support executive search and recruiting projects across multiple international markets and industries.
Use ChatGPT, Claude, Gemini, AI workflows, and automation tools to improve sourcing, outreach, research, reporting, and operational efficiency.
CONFISA INTERNATIONAL GROUP has over 35 years of experience providing integral Human Resources and Business Consulting Services to domestic and multinational corporations. CONFISA provides effective and responsive retained executive search services across multiple industries and organizational areas.
Listen to recorded audio files in the target language.
Transcribe conversations accurately and clearly, following formatting and transcription guidelines.
Review and correct transcription errors when needed, completing assigned tasks within the required deadlines.
Terry Soot Management Group (TSMG) is a field data collection company founded in 2017 in Europe. We collect data where automation is not possible, supporting projects involving speech recording, transcription, image and video collection, mapping, and AI training across Europe and North America.
Review and interpret financial reports, B2B data, or regulatory filings to verify information accuracy.
Respond to specific prompts based on financial data to help AI models understand technical terminology and complex fiscal concepts.
Ensure that the outputs generated by AI systems align with professional financial standards and logical economic frameworks.
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills; they connect researchers and companies with a global pool of participants, enabling the collection of high-quality, ethically sourced human behavioural data and feedback.
Assess the factual accuracy, relevance, and quality of AI-generated Computer Science content
Craft and answer domain-specific questions related to Computer Science and adjacent technical disciplines
Evaluate and rank AI-generated responses based on technical correctness and reasoning quality
The company is seeking Computer Science Experts with PhDs to support the training and evaluation of advanced AI models. This initiative focuses on improving the accuracy, reasoning, and domain expertise of generative AI systems through expert human feedback.
Own product strategy and execution for CoLab’s AutoReview capabilities.
Conduct deep customer research to understand workflow pain points, trust barriers, and adoption patterns.
Define product requirements and make prioritization trade-offs across UX, technical feasibility, and business impact.
CoLab helps mechanical engineering teams bring life-changing products to market sooner, using an AI platform for stronger engineering decisions. The company, founded in St. John’s, Newfoundland, has grown quickly since 2019 and has been recognized on Deloitte’s Fast 50™ and Fast 500™.
Record audio/video tasks that help train next-generation AI systems.
Review requirements, then continue to partner onboarding.
Participate in audio/video recording tasks that may involve speaking, labeling, reviewing, or following on-screen instructions.
Wing Data supports clients on AI Training Audio & Video Participation Study. They assign project-based work based on availability, where work may range from a single 5-minute session to several hours of work.
Evaluate outputs based on accuracy, relevance, clarity, and instruction-following.
Perform side-by-side (SBS) comparisons of AI-generated responses.
Identify nuances in tone, meaning, and cultural context across French.
Blueprint Technologies is a technology solutions firm headquartered in Bellevue, Washington, with a strong presence across the United States and an expanding footprint across Latin America (LATAM). They are united by a shared passion for solving complex problems and bring diverse perspectives, deep expertise, and real-world experience across industries to help organizations grow, transform, and innovate.
Identify and label languages and dialects from model-generated responses.
Review outputs from two different AI models and determine which model correctly identified the proposed language.
Compare model responses and select the appropriate evaluation outcome from predefined options
RWS – TrainAI is looking for Language Data Annotators. They embrace DEI and promotes equal opportunity and prohibits discrimination and harassment of any kind.
Perform annotation and labeling tasks for generative AI datasets, including text, image, video, and multimodal content.
Create, review, and evaluate prompts and responses across a variety of domains and use cases.
Conduct quality assurance reviews to ensure annotation accuracy, consistency, and adherence to guidelines.
Welo Data delivers multilingual content transformation services in translation, localization, and adaptation for over 250 languages. They drive innovation in language services, delivering high-quality training data transformation solutions for NLP-enabled machine learning, with a network of over 400,000 in-country linguistic resources.
Critically assessing the accuracy and relevance of AI-generated summaries and recommendations against source medical documents.
Evaluating pre-release versions of product features, leveraging knowledge of claims handling and decision-making processes.
Providing detailed feedback on how the product integrates into existing daily claims management workflows.
EvolutionIQ's mission is to deliver state of the art technology that helps insurance claims teams make claims handling more accurate, fair, and efficient. They are experiencing massive growth and want to hire world-class talent who want to help build and scale internally, and transform the insurance space.
Reviewing, annotating, and testing AI outputs for grammatical accuracy.
Acting as a primary quality check to proactively identify and correct subtle cultural errors.
Analyzing task quality trends and developing educational resources for AI task outputs.
They are sourcing independent Language Alignment & Resource Partners to provide native-level Arabic language vetting and QA for a specialized AI data project. As a contractor, you will supply your own equipment, and company-sponsored benefits do not apply.
Lead, train, and manage our in-house data labeling team.
Define, execute, and continuously improve data annotation processes with a very high attention to detail.
Ensure high-quality data outputs and meet rigorous accuracy and consistency standards.
Reducto provides a complete toolkit for handling any workflow by understanding documents the way a human would. They have raised over $100M and partner with hundreds of companies, from leading AI teams to enterprise costumers across FAANG and top trading firms.