Source Job

Philippines

Analyze AI model outputs to assess quality and safety. Identify instances where the model refuses to answer a prompt and determine if the refusal was necessary or an error. Review model responses for compliance with project-specific policy guidelines.

RLHF

15 jobs similar to Policy and Toxicity Evaluator Philippines

Jobs ranked by similarity.

Global

Review brief text-based conversations between users and an AI assistant. Assess user sentiment and identify emotional cues, tone shifts, and contextual signals. Provide clear, concise evaluations based on predefined criteria and offer short written rationales to support your evaluations.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

Global

Review brief text-based conversations between users and an AI assistant. Assess user sentiment: Were they satisfied, neutral, frustrated, confused, or disengaged? Provide clear, concise evaluations based on predefined criteria.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

US

  • Evaluate agentic and personalized experiences and assess search relevance & recommendations.
  • Create detailed ground truth judgments for music, podcasts, and audiobooks.
  • Adapt quickly as project types rotate and priorities shift and deliver high-quality annotations within 1–2 week deadlines.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

Europe

  • Contribute to building smarter, more inclusive AI systems.
  • Work on annotation, evaluation, and prompt creation projects.
  • Join a global network of linguists and language enthusiasts.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.

$25–$25/hr
Europe

  • Review and evaluate content to ensure accuracy, clarity, and proper source attribution.
  • Create data and then evaluate the resulting AI-generated content.
  • Read and synthesize content from PDF documents in Finnish.

RWS embraces DEI and promotes equal opportunity; they are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.

Philippines South Africa

  • Coordinate appointments, manage calendars, and track important dates.
  • Arrange professional conference travel, including flights, hotels, and registrations.
  • Manage renewals for professional memberships, medical licenses, and organizational affiliations.

We are seeking a reliable, detail-oriented Personal Assistant to support a highly accomplished medical professional and their family with day-to-day administrative tasks.

$30–$35/hr
Global

  • Evaluate AI-generated Japanese speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr partners with leading AI labs to build expert-driven data pipelines that improve how models reason, learn, and communicate. They work with domain specialists around the world to evaluate and refine AI systems in areas where precision, pedagogy, and human judgment matter most.

$30–$35/hr

  • Evaluate AI-generated Korean speech and text for linguistic accuracy, naturalness, and educational quality.
  • Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
  • Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.

Alignerr collaborates with top AI labs, creating data pipelines driven by experts to enhance AI models' reasoning, learning, and communication. They partner with domain specialists worldwide, perfecting AI systems where precision, pedagogy, and human judgment are crucial.

$28–$28/hr
Europe

  • Review and evaluate AI-generated content to ensure accuracy, clarity, and proper source attribution.
  • Utilize linguistic expertise to create data and then evaluate the resulting AI-generated content.
  • Adhere strictly to detailed annotation and fact-checking guidelines provided in English.

RWS embraces DEI and promotes equal opportunity, we are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.

$403–$727/yr
Philippines

  • Purchase materials from vendors (Amazon, Home Depot, Lowe’s, etc.).
  • Work closely with project managers to track tasks and requirements.
  • Connect with vendors/subcontractors to follow up on project status.

Wing is redefining the future of work for companies worldwide and aims to be the one-stop shop for companies looking to build world-class teams and place their operations on autopilot. The company fosters an inclusive culture with opportunities for career growth in a fun, supportive environment.

Japan

  • Participate in round-table style discussions about AI tools, including capabilities, weaknesses, cultural alignment, prompt behavior, and model differences.
  • Share real examples of how you use AI - coding, writing, document creation, design support, idea generation, manga/comic development, translation, etc.
  • Evaluate model outputs and provide detailed feedback on issues such as: overly formal or informal tone, incorrect cultural references or mismatched context.

With 27+ years of experience, Welo Data stands as a global leader in high-quality datasets and AI services.