Source Job

US Canada

-Review and evaluate AI-generated written responses. -Refine and rewrite responses to improve clarity, tone, and educational quality. -Create natural prompts and example dialogues to support training data needs.

Teaching Writing Evaluation

19 jobs similar to English Teaching Evaluator Expert

Jobs ranked by similarity.

US Canada

  • Review and evaluate AI-generated responses.
  • Rewrite or refine responses when needed.
  • Create natural prompts and dialogues, helping improve the quality of an AI model.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

US

Review AI-generated written content across multiple genres and formats, providing feedback. Use your expertise to help AI reason through writing challenges, including argumentation, structure, tone, and audience engagement. Identify biases, inaccuracies, or unclear passages in AI-generated outputs, and develop tests.

Labelbox builds the data engine that accelerates breakthrough AI, enabling safer, smarter models in production and is trusted by leading research labs and enterprises worldwide.

  • Leverage professional experience to evaluate AI models' output in your field.
  • Assess content and deliver feedback to strengthen the model’s understanding.
  • Work independently from anywhere, with flexible hours and no minimum commitment.

Handshake is a recruiting platform. They connect students and recent graduates with employers.

Europe

  • Contribute to building smarter, more inclusive AI systems.
  • Work on annotation, evaluation, and prompt creation projects.
  • Join a global network of linguists and language enthusiasts.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

  • Evaluate AI-generated content related to employee training, leadership development, and instructional design.
  • Develop prompts related to learning and development topics.
  • Assess responses and provide clear, structured feedback.

Handshake is recruiting Training and Development Specialists to contribute to an hourly, temporary AI research project — no prior AI experience required.

  • Evaluate AI model outputs related to the instructional field.
  • Develop prompts for AI models reflecting field expertise.
  • Provide clear, structured feedback to enhance AI understanding.

Handshake is recruiting Instructional Coordinator Professionals to contribute to an hourly, temporary AI research project—but there’s no AI experience needed.

$30–$35/hr
Global

  • Evaluate AI-generated Japanese speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines that improve how models reason, learn, and communicate. They work with domain specialists around the world to evaluate and refine AI systems in areas where precision, pedagogy, and human judgment matter most.

  • Review AI-generated responses and evaluate technical accuracy.
  • Provide expert feedback to train AI systems to write better code.
  • Work with various programming languages and coding challenges.

G2i connects subject-matter experts, students, and professionals with flexible, remote AI training work such as annotation, evaluation, fact-checking, and content review.

Content contributors develop engaging content. They level content, write assessments, and develop curriculum. They translate content and proofread.

Newsela is an Instructional Content Platform that combines engaging, leveled content with integrated formative assessments and insights.

Global

Review brief text-based conversations between users and an AI assistant. Assess user sentiment and identify emotional cues, tone shifts, and contextual signals. Provide clear, concise evaluations based on predefined criteria and offer short written rationales to support your evaluations.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

Leverage professional experience to evaluate AI model outputs in your field. Assess content related to your field of work. Deliver clear, structured feedback to strengthen the model’s understanding of workplace tasks and language.

Handshake is recruiting Transportation, Storage, and Distribution Manager Professionals to contribute to an hourly, temporary AI research project.

$25–$25/hr
Europe

  • Review and evaluate content to ensure accuracy, clarity, and proper source attribution.
  • Create data and then evaluate the resulting AI-generated content.
  • Read and synthesize content from PDF documents in Finnish.

RWS embraces DEI and promotes equal opportunity; they are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.

$25–$25/hr
Europe

  • Review and evaluate content to ensure accuracy and clarity.
  • Help improve the accuracy and reliability of AI systems.
  • Create data and evaluate AI-generated content.

RWS embraces DEI and promotes equal opportunity; it is committed to equal employment opportunity and a work environment free of discrimination and harassment.

$30–$35/hr

  • Evaluate AI-generated Korean speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr collaborates with top AI labs, creating data pipelines driven by experts to enhance AI models' reasoning, learning, and communication. They partner with domain specialists worldwide, perfecting AI systems where precision, pedagogy, and human judgment are crucial.

Japan

  • Participate in round-table style discussions about AI tools, including capabilities, weaknesses, cultural alignment, prompt behavior, and model differences.
  • Share real examples of how you use AI - coding, writing, document creation, design support, idea generation, manga/comic development, translation, etc.
  • Evaluate model outputs and provide detailed feedback on issues such as: overly formal or informal tone, incorrect cultural references or mismatched context.

With 27+ years of experience, Welo Data stands as a global leader in high-quality datasets and AI services.

$28–$28/hr
Europe

  • Review and evaluate AI-generated content to ensure accuracy, clarity, and proper source attribution.
  • Utilize linguistic expertise to create data and then evaluate the resulting AI-generated content.
  • Adhere strictly to detailed annotation and fact-checking guidelines provided in English.

RWS embraces DEI and promotes equal opportunity, we are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.