Source Job

20 jobs similar to AI LLM Evaluation - Javanese Speakers

Jobs ranked by similarity.

Global

  • Evaluate AI-generated Japanese speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines that improve how models reason, learn, and communicate. They work with domain specialists around the world to evaluate and refine AI systems in areas where precision, pedagogy, and human judgment matter most.

$25–$30/hr
Global

  • Evaluate AI-generated French speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines. They improve how models reason, learn, and communicate by working with domain specialists to evaluate and refine AI systems where precision, pedagogy, and human judgment matter most.

Global

  • Native or near-native fluency in Central Khmer.
  • Based in: Cambodia, Thailand.
  • Comfortable with digital tools.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They’re building smarter, more human AI with a diverse community in 100+ countries.

Europe

  • Contribute to building smarter, more inclusive AI systems.
  • Work on annotation, evaluation, and prompt creation projects.
  • Join a global network of linguists and language enthusiasts.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

$30–$35/hr

  • Evaluate AI-generated Korean speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr collaborates with top AI labs, creating data pipelines driven by experts to enhance AI models' reasoning, learning, and communication. They partner with domain specialists worldwide, perfecting AI systems where precision, pedagogy, and human judgment are crucial.

Europe

  • Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
  • Identify issues and provide refined, high-quality rewritten responses.
  • Create natural prompts and responses in Spanish to improve conversational datasets.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They're building smarter, more human AI with a diverse community in 100+ countries.

$28–$28/hr
Europe

  • Review and evaluate AI-generated content to ensure accuracy, clarity, and proper source attribution.
  • Utilize linguistic expertise to create data and then evaluate the resulting AI-generated content.
  • Adhere strictly to detailed annotation and fact-checking guidelines provided in English.

RWS embraces DEI and promotes equal opportunity, we are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.

Global

  • Completing AI training tasks such as analyzing, editing, and writing in Mandarin
  • Judging the performance of AI in performing Mandarin prompts
  • Improving cutting-edge AI models

Prolific is building the biggest pool of quality human data in the world and is not just another player in the AI space. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.

$25–$25/hr
Europe

  • Review and evaluate content to ensure accuracy, clarity, and proper source attribution.
  • Create data and then evaluate the resulting AI-generated content.
  • Read and synthesize content from PDF documents in Finnish.

RWS embraces DEI and promotes equal opportunity; they are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.

$25–$25/hr
Europe

  • Review and evaluate content to ensure accuracy and clarity.
  • Help improve the accuracy and reliability of AI systems.
  • Create data and evaluate AI-generated content.

RWS embraces DEI and promotes equal opportunity; it is committed to equal employment opportunity and a work environment free of discrimination and harassment.

Europe Global

  • Record short audio clips in both of your native languages following project guidelines.
  • Ensure recordings meet quality, clarity, and accuracy standards.
  • Collaborate with our global community to deliver consistent, high-quality data.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They are building smarter, more human AI with a diverse community in 100+ countries.

Brazil

  • Perform quality checks and linguistic validation on datasets.
  • Identify and categorize linguistic errors according to safety and quality guidelines.
  • Provide explanations for required changes to improve the performance of translation models.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

$8–$8/hr

  • Performing diverse data-related tasks.
  • Data collection, evaluation, and annotation.
  • Object tagging and labeling across different content types.

RWS embraces DEI and promotes equal opportunity, we are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.

  • Oversee and monitor the work of raters who evaluate machine-translated song lyrics from Korean to Japanese.
  • Review raters' assessments to ensure accuracy, consistency, and adherence to quality standards.
  • Provide constructive feedback and guidance to raters to help improve their evaluation practices.

Welo Data works with technology companies to provide datasets that are high-quality, ethically sourced, relevant, diverse, and scalable to supercharge their AI models.

Canada

  • Annotate data accurately and consistently according to predefined guidelines.
  • Perform basic research as needed to ensure accurate annotation.
  • Provide feedback regarding observed patterns in the annotated data.

RWS Group is looking for Canada-based Dutch Data Annotators to annotate, evaluate and curate text, video and geographic data.

Canada

  • Review and annotate up to 230 text segments per locale.
  • Evaluate translations for accuracy, fluency, and correctness.
  • Identify, track, and document patterns, recurring errors, and linguistic insights to improve overall dataset quality.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.