Jobs Similar to AI Transcription Evaluator

AI Transcription Evaluator

Neon 4 days ago

Global

Evaluate LLM-generated transcriptions on their ability to effectively understand conversations from an audio recording.
Generate high-quality human annotations to correct model-generated transcripts.
Apply consistent annotations by following clear taxonomies and detailed evaluation guidelines.

LLMs Writing Data

View details

8 jobs similar to AI Transcription Evaluator

Jobs ranked by similarity.

AI LLM Evaluation - Javanese Speakers

CrowdGen 22 days ago

$2–$2/hr

Indonesia

Review short, pre-segmented datasets.
Evaluate model-generated replies based on Tone or Fluency .
Read a user prompt and two model replies, then rate each using a five-point scale.

CrowdGen, by Appen, focuses on AI response evaluation. They are looking for native Javanese speakers to contribute to a multilingual AI response evaluation project where you review large language model outputs.

View details Similar jobs

AI Language Expert - French

Alignerr 23 days ago

$25–$30/hr

Global

Evaluate AI-generated French speech and text for linguistic accuracy, naturalness, and educational quality.
Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines. They improve how models reason, learn, and communicate by working with domain specialists to evaluate and refine AI systems where precision, pedagogy, and human judgment matter most.

View details Similar jobs

AI Language Expert - Japanese

Alignerr 28 days ago

$30–$35/hr

Global

Evaluate AI-generated Japanese speech and text for linguistic accuracy, naturalness, and educational quality.
Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines that improve how models reason, learn, and communicate. They work with domain specialists around the world to evaluate and refine AI systems in areas where precision, pedagogy, and human judgment matter most.

View details Similar jobs

AI Service General Application

Welo Data 23 days ago

Global

Native or near-native fluency in Central Khmer.
Based in: Cambodia, Thailand.
Comfortable with digital tools.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They’re building smarter, more human AI with a diverse community in 100+ countries.

View details Similar jobs

Audio Evaluator

Welo Data 4 hours ago

Global

Listen to short audio clips in Russian and evaluate them using a defined rubric.
Accurately identify target accents from provided audio samples.
Compare multiple recordings and assess which one sounds more natural in relation to the target accent.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. We’re building smarter, more human AI with a diverse community in 100+ countries.

View details Similar jobs

AI Visual Content Quality Evaluator

Blueprint 10 days ago

$25–$27/hr

Evaluate AI-generated presentations for accuracy and visual quality.
Provide detailed feedback to improve future AI performance.
Collaborate with product, design, and content partners to refine criteria.

Blueprint is a technology solutions firm headquartered in Bellevue, Washington, with a strong presence across the United States. They solve complicated problems, using technology to bridge the gap between strategy and execution, powered by the knowledge, skills, and the expertise of their teams. They are bold, smart, agile, and fun.

View details Similar jobs

Medical Secretaries and Administrative Assistant Professionals

Handshake 22 days ago

Global

Evaluate AI model outputs related to your field.
Assess content and provide feedback to strengthen the model’s understanding.
Develop prompts for AI models reflecting your field and evaluate responses.

Handshake is recruiting Medical Secretaries and Administrative Assistant Professionals to contribute to an hourly, temporary AI research project. The Handshake AI opportunity runs year-round, with project opportunities opening periodically across different areas of expertise.

View details Similar jobs

Lyrics Specialist

Genius 10 days ago

Provide complete and accurate transcription and sync of new releases
Review and edit community transcriptions for accuracy and completeness
Match new release transcriptions to Apple Music database

Genius is the premier global database of lyrics and artist-focused content, celebrating the lyrics, stories behind the songs, and creative connections that meaningfully drive culture. They spotlight the artists who are shaping music culture across every genre and musical discipline, sharing the stories behind their creativity and craft with over 90M+ people each month.

View details Similar jobs

Source Job