Review and label content for sentiment, factual accuracy, and reasoning issues.
Evaluate model outputs across quality dimensions using scoring frameworks.
Validate automated assessments and identify discrepancies or errors.
Welo Data provides AI services helping to develop and evaluate large language models (LLMs). The job posting does not provide information regarding the company's size and culture.
Reviewing, annotating, and testing AI outputs for grammatical accuracy, naturalness, and strict cultural context.
Acting as a primary quality check during production to proactively identify and correct subtle cultural errors or awkward phrasing in the target language.
Analyzing task quality trends and autonomously developing educational resources and feedback documentation to increase alignment between AI task outputs and campaign expectations.
Greenhouse provides recruiting software. No information about company size or culture is available in the job description.
Reviewing, annotating, and testing AI outputs for grammatical accuracy, naturalness, and strict cultural context.
Acting as a primary quality check during production to proactively identify and correct subtle cultural errors or awkward phrasing in the target language.
Analyzing task quality trends and autonomously developing educational resources and feedback documentation.
They are sourcing independent Language Alignment & Resource Partners (LARPs) to provide native-level French language vetting and QA for a specialized AI data project. As a contractor, you would not receive company-sponsored benefits.
Review, analyze, and rank AI-models' chains of thought for correctness and approach.
Provide clear, constructive feedback to improve AI-generated responses.
An Enterprise client is seeking talents who are fluent in English who will help train generative artificial intelligence models. They seem to maintain a contractor-based work environment.
Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
Identify issues and provide refined, high-quality rewritten responses.
Create natural prompts and responses in English to improve conversational datasets.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They build smarter, more human AI with a diverse community in 100+ countries.
Challenge advanced language models on software engineering tasks.
Verify logical accuracy and coding fluency in German.
Capture reproducible error traces and suggest improvements.
Project World Wide is shaping the future of AI through high-quality training data. They appear to be a technologically advanced organization focused on evolving language models into powerful engines.
Review and evaluate AI-generated text for linguistic accuracy, grammatical correctness, and cultural appropriateness.
Identify issues and provide high-quality rewritten responses to improve language outputs.
Develop natural prompts and responses in the target language to enhance conversational datasets.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They are building smarter, more human AI with a diverse community in 100+ countries.
Creatively writing prompts and responses to a variety of diverse topics.
Leading labeling initiatives with third party firms and internal customers.
Creating and updating detailed guidelines and specifications for stakeholders.
Welo Data provides AI services, specifically data annotation. They enable brands and companies to reach, engage, and grow international audiences, delivering multilingual content transformation services in translation, localization, and adaptation.
Converse with the model on language scenarios, verify factual accuracy and logical soundness.
Capture reproducible error traces, and suggest improvements to our prompt engineering and evaluation metrics.
Challenge advanced language models on topics like verb conjugation, sentence structure and nuances of Japanese writing systems.
They are shaping the future of AI by providing high-quality training data to large-scale language models. As a contractor for this project, company-sponsored benefits such as health insurance and PTO do not apply.
Challenge advanced language models on topics like verb conjugation and word order.
Verify factual accuracy and logical soundness, capturing reproducible error traces.
Suggest improvements to prompt engineering and evaluation metrics.
I am unable to extract the company description from this job posting, because Greenhouse is a recruiting platform, and the posting company is not clearly named.
Completing AI training tasks such as analyzing, editing, and writing Python
Judging the performance of AI in performing Python-related prompts
Improving cutting-edge AI models
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.
Creatively writing prompts and responses to a variety of diverse topics
Perform LLM annotation and evaluation tasks (ranking, scoring, labeling, tagging)
Evaluate model outputs for accuracy, relevance, and instruction-following
Welo Data is an AI services company that specializes in data annotation. They deliver high-quality training data transformation solutions for NLP-enabled machine learning by blending technology and human intelligence to collect, annotate, and evaluate all content types.
Design, develop, and refine large language model workflows to steer and improve model behaviors.
Build language processing components for intent detection, summarization and conversational response quality.
Drive R&D-style exploration on cutting-edge speech and language systems, rapidly prototyping novel approaches.
Cresta's platform combines AI and human intelligence to help contact centers discover customer insights and behavioral best practices, automate conversations, and empower team members. They are led by founders with experience at Google, Waymo, and Open AI, and are on a mission to revolutionize the workforce with AI.
Localize English-based questions into your language.
Provide clear, concise, and verifiable answers.
Cite credible sources to support your answers.
CrowdGen, by Appen, is focused on AI solutions. They offer project-based roles for independent contractors to contribute to AI development and language comprehension projects.
Challenge advanced language models on topics like verb conjugation.
Verify factual accuracy and logical soundness, capture reproducible error traces.
Suggest improvements to our prompt engineering and evaluation metrics.
They are evolving large‑scale language models from clever chatbots into powerful engines of linguistic discovery. They value high‑quality training data to democratize world‑class education, keep pace with cutting‑edge research, and streamline communication for Hijazi speakers everywhere.
Evaluate AI-generated responses using a structured safety rubric
Complete two independent evaluations per item
Provide concise, well-structured rationales in English
Welo Data, part of Welocalize, is a global AI data company delivering high-quality, ethical data to train the world’s most advanced AI systems. They have 500,000+ contributors and offer limitless opportunities for their global community to grow, contribute, and work on their terms.
Annotate and label responses and their reasoning steps.
Compare outputs and assess quality and relevance.
RWS is committed to equal employment opportunities for all employees. They embrace diversity, equity, and inclusion, promoting a work environment free of discrimination and harassment.
Evaluate AI-generated responses for accuracy, grammar, and cultural relevance.
Identify issues and provide refined, high-quality rewritten responses.
Create natural prompts and responses in Marathi to improve conversational datasets.
Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. They are building smarter, more human AI with a diverse community in 100+ countries.
Contribute to xAI's mission by training and refining Grok to excel in voice interactions across diverse languages.
Curate and annotate high-quality audio data to enhance Grok's global accessibility and improve its handling of multilingual audio nuances.
Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation and noise in real-world recordings.
xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence.