Source Job

20 jobs similar to AI Safety Evaluator – Tamil (Singapore)

Jobs ranked by similarity.

US

  • Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
  • Identify issues and provide refined, high-quality rewritten responses.
  • Create natural prompts and responses in English to improve conversational datasets.

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They build smarter, more human AI with a diverse community in 100+ countries.

Global

  • Localize English-based questions into your language.
  • Provide clear, concise, and verifiable answers.
  • Cite credible sources to support your answers.

CrowdGen, by Appen, is focused on AI solutions. They offer project-based roles for independent contractors to contribute to AI development and language comprehension projects.

Global

  • Read and review texts, identifying differences.
  • Assess output for accuracy and consistency.
  • Identify and document issues and suggest corrections.

RWS provides language, content management, and intellectual property support services. They embrace DEI and promote equal opportunity, with a commitment to equal employment opportunity for all employees in a work environment free of discrimination and harassment.

Europe

  • Review and label content for sentiment, factual accuracy, and reasoning issues.
  • Evaluate model outputs across quality dimensions using scoring frameworks.
  • Validate automated assessments and identify discrepancies or errors.

Welo Data provides AI services helping to develop and evaluate large language models (LLMs). The job posting does not provide information regarding the company's size and culture.

$13–$15/hr
US Canada Europe Australia New Zealand

  • Create and answer questions to train AI models.
  • Review, analyze, and rank AI-models' chains of thought for correctness and approach.
  • Provide clear, constructive feedback to improve AI-generated responses.

An Enterprise client is seeking talents who are fluent in English who will help train generative artificial intelligence models. They seem to maintain a contractor-based work environment.

US

  • You'll work with AI tools, test model outputs, and evaluate responses.
  • Document errors, gaps, and collaborate with our team.
  • Spot inconsistencies and provide structured feedback.

Project World Wide is involved in shaping the future of AI through training data. They seek motivated individuals to contribute to the development of cutting-edge AI systems.

Europe

  • Write or rewrite high-quality business review summaries tailored to the Dutch (Netherlands) locale.
  • Adjust tone, style, and register to align with professional standards and local conventions.
  • Ensure cultural appropriateness and relevance for the target audience.

Welo Data specializes in AI services. They likely have a modern and innovative culture.

Europe

  • You will be matched with another participant for 1-on-1 verbal or text-based exchanges.
  • Use your natural Dutch from Netherlands dialect to discuss various topics provided by the researcher.
  • Help the AI understand the nuances, slang, and cultural context of Dutch from the Netherlands, through real-world interaction.

Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.

Europe

  • Write high-quality business review summaries tailored to the Dutch (Belgium) locale.
  • Adjust tone and style to align with professional standards and local conventions.
  • Refine and edit content to improve clarity, readability, and overall quality.

Welo Data provides AI services. They're focused on innovation and quality content creation.

  • Write or rewrite high-quality business review summaries tailored to the Greek locale.
  • Adjust tone, style, and register to align with professional standards and local conventions.
  • Ensure cultural appropriateness and relevance for the target audience.

Welo Data provides AI services. It appears they are a smaller company that values detail-oriented language professionals.

$16–$24/hr
Europe

  • Evaluate the relevance of product search results returned for specific queries on e-commerce platforms
  • Analyze each task consisting of a search query and a corresponding product listing
  • Use provided context (e.g., search query, search category, and marketplace) to make informed judgments

They are seeking freelance contributors to participate in a search relevance annotation project aimed at improving e-commerce search quality across multiple international markets. This is a remote, task-based opportunity suitable for individuals with a strong command of the English language and an eye for detail.

Global

  • Make scripted and unscripted calls with an AI agent.
  • Produce clear, natural speech following provided guidelines.
  • Test and validate the AI’s ability to understand and interpret speech.

RWS is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at RWS are based on business needs, job requirements and individual qualifications.

Global

  • Challenge AI models on realistic educational scenarios.
  • Validate whether its understanding of pedagogical concepts reflects best-in-class teaching practice.
  • Evaluate AI outputs for clarity and correctness, analyze subtle reasoning errors, document gaps in logic.

The company is seeking independent Instructional Experts with hands-on experience teaching, tutoring, or building curriculum to train AI models. As a contractor you’ll supply a secure computer and high-speed internet; company-sponsored benefits such as health insurance and PTO do not apply.

$15–$15/hr

  • Make scripted and unscripted voice calls with an AI agent.
  • Produce clear, natural Haitian Creole speech while following the provided guidelines.
  • Evaluate how well the AI agent understands spoken language, helping test speech recognition and transcription accuracy.

RWS is a company that embraces DEI and promotes equal opportunity. They are committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment.

US

  • Support model launch readiness by running evaluations, monitoring and interpreting results, and surfacing regressions or unexpected behavior changes to relevant stakeholders
  • Partner closely with policy and domain experts throughout the evaluation lifecycle — from identifying risks and scoping the right evaluation approach, to coordinating creation of new evals and ensuring existing ones remain current with evolving policies, threat vectors, and model capabilities
  • Work with cross-functional stakeholders to help manage evaluation outcomes, including interpreting results and driving mitigations where needed

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. Their team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

Global

  • Challenge advanced language models on topics like verb conjugation and word order.
  • Verify factual accuracy and logical soundness, capturing reproducible error traces.
  • Suggest improvements to prompt engineering and evaluation metrics.

I am unable to extract the company description from this job posting, because Greenhouse is a recruiting platform, and the posting company is not clearly named.

$177,000–$250,300/yr
US

  • Own Agent retrieval accuracy and relevance.
  • Drive automated resolution rates.
  • Manage AI safety and trust.

Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes. More than 500,000 organizations, including 80% of the Fortune 100, rely on Airtable to transform how work gets done.

US

  • Review contributor evaluations of model-generated responses to ensure adherence to project-specific guidelines.
  • Verify that contributors consistently apply all instructions and evaluation criteria when assessing model responses.
  • Confirm that contributors accurately identify factual errors, hallucinations, or missing information in model responses.

Welo Data, part of Welocalize, is a global AI data company delivering high-quality, ethical data to train the world’s most advanced AI systems. Welo Data has a diverse community in 100+ countries building smarter, more human AI, offering limitless opportunities for the global community to grow and contribute.

US

  • Utilize Automatic Prompt Generation tools to create baseline prompts.
  • Manually draft, test, and refine prompts to navigate complex template architectures.
  • Monitor shadowbot runs to ensure sufficient disagreements are registered, generated, and tracked.

Welo Data is an AI Services company. We focus on data validation and freelance remote work.

$26–$26/yr

  • Record 200 Cantonese sentences used in smart devices.
  • Complete all recordings in one session in a quiet environment.
  • Follow different speaking speeds and ensure accurate pronunciation.

CrowdGen, powered by Appen, helps companies improve their AI. They offer project-based opportunities to independent contractors.