Analyze AI model outputs to assess quality and safety. Identify instances where the model refuses to answer a prompt and determine if the refusal was necessary or an error. Review model responses for compliance with project-specific policy guidelines.
Source Job
15 jobs similar to Policy and Toxicity Evaluator Philippines
Jobs ranked by similarity.
Review brief text-based conversations between users and an AI assistant. Assess user sentiment and identify emotional cues, tone shifts, and contextual signals. Provide clear, concise evaluations based on predefined criteria and offer short written rationales to support your evaluations.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.
Review brief text-based conversations between users and an AI assistant. Assess user sentiment: Were they satisfied, neutral, frustrated, confused, or disengaged? Provide clear, concise evaluations based on predefined criteria.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.
- Evaluate agentic and personalized experiences and assess search relevance & recommendations.
- Create detailed ground truth judgments for music, podcasts, and audiobooks.
- Adapt quickly as project types rotate and priorities shift and deliver high-quality annotations within 1–2 week deadlines.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.
- Contribute to building smarter, more inclusive AI systems.
- Work on annotation, evaluation, and prompt creation projects.
- Join a global network of linguists and language enthusiasts.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems.
- Review and evaluate content to ensure accuracy, clarity, and proper source attribution.
- Create data and then evaluate the resulting AI-generated content.
- Read and synthesize content from PDF documents in Finnish.
RWS embraces DEI and promotes equal opportunity; they are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.
- Shape the future of AI using high-quality language data.
- Work on freelance projects involving Annotation, Evaluation, and Prompt creation.
- Be part of global projects with real-world impact.
Welo Data is at the frontier of AI and localisation, providing high-quality language data that fuels smarter, more inclusive technologies.
- Record short videos of unique mannequins.
- Complete up to 10 short mannequin clips per participant.
- Use the Appen Mobile app to submit recordings.
CrowdGen is helping to build the next generation of fraud-detection AI. CrowdGen is part of Appen and provides opportunities for flexible participation and project-based roles.
- Coordinate appointments, manage calendars, and track important dates.
- Arrange professional conference travel, including flights, hotels, and registrations.
- Manage renewals for professional memberships, medical licenses, and organizational affiliations.
We are seeking a reliable, detail-oriented Personal Assistant to support a highly accomplished medical professional and their family with day-to-day administrative tasks.
- Evaluate AI-generated Japanese speech and text for linguistic accuracy, naturalness, and educational quality.
- Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
- Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.
Alignerr partners with leading AI labs to build expert-driven data pipelines that improve how models reason, learn, and communicate. They work with domain specialists around the world to evaluate and refine AI systems in areas where precision, pedagogy, and human judgment matter most.
- Evaluate AI-generated Korean speech and text for linguistic accuracy, naturalness, and educational quality.
- Assess learner speech and writing across proficiency levels from CEFR Pre-A1 through B2+.
- Apply expert judgment to identify learner errors, unnatural phrasing, and pedagogical gaps.
Alignerr collaborates with top AI labs, creating data pipelines driven by experts to enhance AI models' reasoning, learning, and communication. They partner with domain specialists worldwide, perfecting AI systems where precision, pedagogy, and human judgment are crucial.
- Review and evaluate AI-generated content to ensure accuracy, clarity, and proper source attribution.
- Utilize linguistic expertise to create data and then evaluate the resulting AI-generated content.
- Adhere strictly to detailed annotation and fact-checking guidelines provided in English.
RWS embraces DEI and promotes equal opportunity, we are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind.
- Purchase materials from vendors (Amazon, Home Depot, Lowe’s, etc.).
- Work closely with project managers to track tasks and requirements.
- Connect with vendors/subcontractors to follow up on project status.
Wing is redefining the future of work for companies worldwide and aims to be the one-stop shop for companies looking to build world-class teams and place their operations on autopilot. The company fosters an inclusive culture with opportunities for career growth in a fun, supportive environment.
- Participate in round-table style discussions about AI tools, including capabilities, weaknesses, cultural alignment, prompt behavior, and model differences.
- Share real examples of how you use AI - coding, writing, document creation, design support, idea generation, manga/comic development, translation, etc.
- Evaluate model outputs and provide detailed feedback on issues such as: overly formal or informal tone, incorrect cultural references or mismatched context.
With 27+ years of experience, Welo Data stands as a global leader in high-quality datasets and AI services.
Contribute to AI technology’s speech recognition. Collect audio recordings with various accents. Conduct all tasks in English.
Contribute directly to the future of AI in your language.
- Help improve AI by reviewing videos and judging how similar they are.
- Your work helps AI make better and more accurate video evaluations.
- Follow a step-by-step, rule-based workflow.
CrowdGen, by Appen, is a company offering project-based opportunities. The role is a project-based opportunity where you will join as an Independent Contractor.