You will challenge advanced language models on topics like verb conjugation, gender and number agreement, French idioms, phonetic nuances, sentence structure, and stylistic variation—documenting every failure mode so we can harden model reasoning. On a typical day, you will converse with the model on language scenarios, verify factual accuracy and logical soundness, capture reproducible error traces, and suggest improvements to our prompt engineering and evaluation metrics.
Job listings
RWS Group is seeking native Libyan Arabic speakers with a linguistic background and excellent English command to annotate or evaluate AI-generated speech, comparing AI and human responses. This part-time, remote position offers the opportunity to earn extra income while helping to improve the reliability of today’s AI models. The role requires part-time availability, a personal computer with high-speed internet, and the ability to follow instructions.
Are you a Kannada language expert eager to shape the future of AI? We’re looking for Kannada language specialists who live and breathe Kannada grammar, syntax, morphology, phonetics, semantics, and pragmatics. You’ll challenge advanced language models on topics like verb conjugation, sentence structure, honorifics, word order, regional dialects, script variations, and idiomatic expressions—documenting every failure mode so we can harden model reasoning.
Looking for part-time, remote, work-from-home jobs where you can set your own schedule, this opportunity helps to improve the reliability of today’s AI models! Typical tasks include annotating or evaluating AI-generated speech for naturalness, human likeness, and expressiveness and comparing AI and human responses.
Looking for native Danish speakers with a linguistic background with excellent command of the English language. Annotate or evaluate AI-generated speech for naturalness, human likeness, and expressiveness. Compare AI and human responses. Seeking detail-oriented individuals able to follow instructions, and reliable, responsible, and communicative people.
Looking for native Slovak speakers with a linguistic background with excellent command of the English language. Annotate or evaluate AI-generated speech for naturalness, human likeness, and expressiveness, and compare AI and human responses.
Review pairs of inputs and AI-generated outputs, then assess the quality of those outputs based on accuracy, relevance, and clarity—following the project’s established guidelines. Your evaluations will directly support improvements to model performance and help shape the future of responsible AI development.
Shape the future of AI as a Bengali language expert. You will challenge advanced language models on topics like dialectal variation, formal versus colloquial language, Bengali script, translation accuracy, semantic ambiguity, and cultural context—documenting every failure mode so we can harden model reasoning. Converse with the model on language scenarios, verify factual accuracy and logical soundness, capture reproducible error traces, and suggest improvements to our prompt engineering and evaluation metrics.