Evaluate model-generated replies based on Tone or Fluency .
Read a user prompt and two model replies, then rate each using a five-point scale.
CrowdGen, by Appen, focuses on AI response evaluation. They are looking for native Javanese speakers to contribute to a multilingual AI response evaluation project where you review large language model outputs.
Evaluate AI-generated French speech and text for linguistic accuracy, naturalness, and educational quality.
Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.
Alignerr partners with leading AI labs to build expert-driven data pipelines. They improve how models reason, learn, and communicate by working with domain specialists to evaluate and refine AI systems where precision, pedagogy, and human judgment matter most.
Evaluate AI-generated Japanese speech and text for linguistic accuracy, naturalness, and educational quality.
Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.
Alignerr partners with leading AI labs to build expert-driven data pipelines that improve how models reason, learn, and communicate. They work with domain specialists around the world to evaluate and refine AI systems in areas where precision, pedagogy, and human judgment matter most.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They’re building smarter, more human AI with a diverse community in 100+ countries.
Listen to short audio clips in Russian and evaluate them using a defined rubric.
Accurately identify target accents from provided audio samples.
Compare multiple recordings and assess which one sounds more natural in relation to the target accent.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. We’re building smarter, more human AI with a diverse community in 100+ countries.
Evaluate AI-generated presentations for accuracy and visual quality.
Provide detailed feedback to improve future AI performance.
Collaborate with product, design, and content partners to refine criteria.
Blueprint is a technology solutions firm headquartered in Bellevue, Washington, with a strong presence across the United States. They solve complicated problems, using technology to bridge the gap between strategy and execution, powered by the knowledge, skills, and the expertise of their teams. They are bold, smart, agile, and fun.
Assess content and provide feedback to strengthen the model’s understanding.
Develop prompts for AI models reflecting your field and evaluate responses.
Handshake is recruiting Medical Secretaries and Administrative Assistant Professionals to contribute to an hourly, temporary AI research project. The Handshake AI opportunity runs year-round, with project opportunities opening periodically across different areas of expertise.
Provide complete and accurate transcription and sync of new releases
Review and edit community transcriptions for accuracy and completeness
Match new release transcriptions to Apple Music database
Genius is the premier global database of lyrics and artist-focused content, celebrating the lyrics, stories behind the songs, and creative connections that meaningfully drive culture. They spotlight the artists who are shaping music culture across every genre and musical discipline, sharing the stories behind their creativity and craft with over 90M+ people each month.