Remote Data Jobs · Evaluation

Job listings

Evaluate machine translations by assessing text and assigning semantic similarity scores. Evaluate the accuracy, fluency, and overall quality of online messages that have been translated by machines, similar to online conversations or social media. Native Bilingual in English and Thai, comfortable understanding and applying guidelines, and have some experience in translation or translation evaluation.

Welo Data is building a global network of linguists, language enthusiasts, and culturally aware contributors to shape safer, smarter AI. By joining our talent community, you’ll be first in line for flexible, remote projects in annotation, evaluation, and prompt creation—always on your terms. This is not an active job opening, instead you are joining a network for future opportunities.

Train and evaluate cutting-edge AI models as a Personal Finance Advisor, reviewing AI-generated responses to Personal Finance scenarios, rating them for accuracy, appropriateness, safety, and reasoning quality. You will compare multiple model answers and select/justify the best response and write improved exemplars, rationales, or structured feedback to help models learn where they fall short.