Source Job

16 jobs similar to Data Annotator for AI models - Japanese (United States)

Jobs ranked by similarity.

US

  • Review, score, and improve AI-generated responses
  • Evaluate prompts and responses across a wide range of topics
  • Perform QA checks on other annotators’ work and provide feedback

RWS is committed to equal employment opportunity and provides a work environment free of discrimination and harassment. They base employment decisions on business needs, job requirements, and individual qualifications.

$15–$15/hr
US

  • Data collection, evaluation, and annotation.
  • Pairwise comparisons.
  • Object tagging and labeling across different content types.

RWS TrainAI focuses on improving AI-generated content. They embrace DEI and equal opportunity, committed to a discrimination-free work environment where employment decisions are based on business needs and qualifications.

$15–$15/hr
US

  • Data collection, evaluation, and annotation.
  • Pairwise comparisons.
  • Object tagging and labeling across different content types (audio, video, images, or collected data)

RWS TrainAI focuses on improving AI-generated content. They embrace DEI and are an equal opportunity employer committed to providing a work environment free of discrimination and harassment.

Global

  • Completing AI training tasks such as analyzing, editing, and writing in Mandarin
  • Judging the performance of AI in performing Mandarin prompts
  • Improving cutting-edge AI models

Prolific is building the biggest pool of quality human data in the world and is not just another player in the AI space. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.

Indonesia

  • Review short, pre-segmented datasets.
  • Evaluate model-generated replies based on Tone or Fluency .
  • Read a user prompt and two model replies, then rate each using a five-point scale.

CrowdGen, by Appen, focuses on AI response evaluation. They are looking for native Javanese speakers to contribute to a multilingual AI response evaluation project where you review large language model outputs.

$11–$12/hr
South Korea

  • Pre-review and evaluate Internet advertisements using web-based tools.
  • Help improve AI programs by evaluating advertisements.
  • Work flexible hours, from 5 to 20 hours per week.

Welo Data is an AI services company specializing in data validation. They operate as a freelance-remote business, as stated in the job posting.

Global

  • Native or near-native fluency in Central Khmer.
  • Based in: Cambodia, Thailand.
  • Comfortable with digital tools.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They’re building smarter, more human AI with a diverse community in 100+ countries.

Global

  • Data collection, evaluation, and annotation.
  • Pairwise comparisons.
  • Object tagging and labeling across different content types (audio, video, images, or collected data)

RWS enhances communication and delivers value. They embrace DEI and promote equal opportunity, committed to a work environment free of discrimination and harassment.

Australia

  • Listen to short audio clips in Chinese (Mandarin, Putonghua) and evaluate them using a defined rubric.
  • Accurately identify target accents from provided audio samples.
  • Apply consistent and objective judgment based on provided evaluation guidelines.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They are building smarter, more human AI with a diverse community in 100+ countries.

Europe

  • Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
  • Identify issues and provide refined, high-quality rewritten responses.
  • Create natural prompts and responses in Spanish to improve conversational datasets.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They're building smarter, more human AI with a diverse community in 100+ countries.

Global

  • Evaluate AI models' output in occupational therapy.
  • Assess content related to the occupational therapy field.
  • Provide clear feedback to improve AI understanding.

Handshake connects students with early talent recruiting. They provide opportunity to evaluate what AI models produce and deliver feedback that strengthens the model’s understanding of workplace tasks and language.

Global

  • Listen to short audio clips in Russian and evaluate them using a defined rubric.
  • Accurately identify target accents from provided audio samples.
  • Compare multiple recordings and assess which one sounds more natural in relation to the target accent.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. We’re building smarter, more human AI with a diverse community in 100+ countries.

Global

  • Evaluate AI model outputs related to your field.
  • Assess content relevant to your area of expertise.
  • Deliver clear feedback to improve the model's comprehension.

Handshake is recruiting College Career/Technical Education Professors to contribute to an hourly, temporary AI research project. In this program, you’ll leverage your professional experience to evaluate what AI models produce in your field.

  • Design scenario-based and edge-case prompts to test AI behavior.
  • Develop evaluation rubrics to assess AI responses across multiple criteria.
  • Perform side-by-side evaluations of AI outputs and score them using defined criteria.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They are building smarter, more human AI with a diverse community in 100+ countries.

Global

  • Record a scripted conversation with another speaker using natural code-switching between your native language (Turkish) and English.
  • Complete a short Pre-Vet test on our Aurora Studio Platform (the Pre-Vet consists of recording 4 sentences).
  • If successful, you’ll be matched with another speaker and scheduled for a recording session within the next 2–3 weeks.

Lionbridge is an AI data training company. They provide opportunities to contribute to cutting-edge AI technology on a global project.

$25–$30/hr
Global

  • Evaluate AI-generated French speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines. They improve how models reason, learn, and communicate by working with domain specialists to evaluate and refine AI systems where precision, pedagogy, and human judgment matter most.