Interact with generative AI models and project guidelines.
Create prompts to test model behavior across safety categories.
Document model breakability and effort level.
Welo Data provides AI services and specializes in data annotation. We foster a collaborative and innovative culture where employees contribute to cutting-edge AI safety evaluation.
Perform annotation and labeling tasks for generative AI datasets, including text, image, video, and multimodal content.
Create, review, and evaluate prompts and responses across a variety of domains and use cases.
Conduct quality assurance reviews to ensure annotation accuracy, consistency, and adherence to guidelines.
Welo Data delivers multilingual content transformation services in translation, localization, and adaptation for over 250 languages. They drive innovation in language services, delivering high-quality training data transformation solutions for NLP-enabled machine learning, with a network of over 400,000 in-country linguistic resources.
Evaluate and improve model safety: Label, rank, audit, and refine human- and model-generated text to improve safety, quality, and policy alignment.
Apply nuanced safety judgment: Assess model outputs against detailed safety guidelines, rubrics, and style standards, making consistent decisions across ambiguous, sensitive, and context-dependent cases.
Create prompts and safety test cases: Write realistic prompts, user scenarios, and adversarial examples that help evaluate model behavior across safety categories and uncover unsafe, evasive, over-refusing, or policy-inconsistent responses.
Cohere's mission is to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises. They are a team of researchers, engineers, and designers passionate about their craft, believing that a diverse range of perspectives is a requirement for building great products.
Build agentic AI systems that change how Dataiku runs internally.
Turn real problems into working software.
See your solutions through from first conversation to production.
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.
Evaluate outputs based on accuracy, relevance, clarity, and instruction-following.
Perform side-by-side (SBS) comparisons of AI-generated responses.
Identify nuances in tone, meaning, and cultural context across French.
Blueprint Technologies is a technology solutions firm headquartered in Bellevue, Washington, with a strong presence across the United States and an expanding footprint across Latin America (LATAM). They are united by a shared passion for solving complex problems and bring diverse perspectives, deep expertise, and real-world experience across industries to help organizations grow, transform, and innovate.
Review, refine, and validate AI translation prompts for attraction and travel content.
Optimize AI-generated translations to ensure naturalness, fluency, and cultural relevance.
Test language prompts to ensure the output meets the required standards.
Welo Data provides AI services. They focus on helping businesses leverage the power of artificial intelligence to improve their operations and create innovative solutions.
Write, iterate, and maintain system prompts and instruction sets for Noodle’s AI agents across the student journey.
Build and maintain evaluation frameworks to measure agent accuracy, tone, hallucination rate, task completion, and alignment with rubric-based learning objectives.
Partner with Noodle teammates and university stakeholders to design, build, and test agents — translating learning objectives, operational flows, rubric assessments, and more into prompt-level agent instructions.
Noodle is higher education’s leading strategy, services, and technology partner that develops infrastructure, provides life-changing learning experiences, and grows the awareness of and the enrollment in some of the best academic institutions in the world. They empower universities to change the world by offering university partners various products and services.
Identify and label languages and dialects from model-generated responses.
Review outputs from two different AI models and determine which model correctly identified the proposed language.
Compare model responses and select the appropriate evaluation outcome from predefined options
RWS – TrainAI is looking for Language Data Annotators. They embrace DEI and promotes equal opportunity and prohibits discrimination and harassment of any kind.
Perform side-by-side (SBS) comparisons of AI-generated responses.
Evaluate outputs based on accuracy, relevance, clarity, and instruction-following.
Apply detailed, scenario-specific annotation guidelines and maintain consistency and high-quality evaluations.
Blueprint Technologies is a technology solutions firm headquartered in Bellevue, Washington, with a strong presence across the United States and an expanding footprint across Latin America (LATAM). Our people bring diverse perspectives, deep expertise, and real-world experience across industries to help organizations grow, transform, and innovate.
Evaluate AI responses across scenarios like general Q&A and web search results.
Perform side-by-side comparisons of AI-generated responses, judging accuracy and clarity.
Apply detailed guidelines, maintaining consistency and high-quality evaluations.
Blueprint Technologies is a technology solutions firm that helps organizations grow, transform, and innovate. They have a strong presence across the United States and are expanding across Latin America, with teams united by a shared passion for solving complex problems.
Review pre-written prompt instructions for tone and grammar.
Translate product-specific terms and cross-check against glossaries.
Run sample attraction descriptions through GPT-4.O, and refine prompts.
Welo Data is an AI services company. We focus on providing Language Specialists to review, refine, and validate AI translation prompts. The company appears to be a community where people can collaborate on exciting projects.
Assess the factual accuracy, relevance, and quality of AI-generated Computer Science content
Craft and answer domain-specific questions related to Computer Science and adjacent technical disciplines
Evaluate and rank AI-generated responses based on technical correctness and reasoning quality
The company is seeking Computer Science Experts with PhDs to support the training and evaluation of advanced AI models. This initiative focuses on improving the accuracy, reasoning, and domain expertise of generative AI systems through expert human feedback.
Consult clients during presales to assess AI readiness and translate visions into actionable requirements.
Architect multi-agent frameworks, design AI systems with defined roles, and implement learning & feedback loops.
Develop RAG pipelines, design custom models, and ensure governance, security, and cost-efficiency.
Sigma Software is seeking a Senior/Principal AI Engineer to join their Stellar AdTech Business Unit. They deliver innovative systems to global AdTech leaders and startups since 2008, with a strong AdTech competence center of 300+ employees.
Coordinate project workstreams to ensure on-time delivery against execution standards.
Deliver high-quality annotation and QA to establish quality benchmarks.
Analyze datasets to surface patterns and optimization opportunities.
Appen specializes in human-generated data to train, fine-tune, and evaluate models across generative AI, large language models, computer vision, and speech recognition. They have over 1 million contributors in over 200 countries supporting model pre-training, supervised fine-tuning, evaluation and benchmarking, safety and red teaming, and multilingual global expansion.
Be responsible for the end-to-end technical migration workflow for transitioning templates to LLM autoraters.
Utilize Automatic Prompt Generation (APG) tools to create baseline prompts for complex parent-child template clusters.
Manually draft, test, and refine prompts to navigate complex template architectures, overcome anti-patterns, and handle edge cases.
Welo Data specializes in AI services and data validation. The company's culture emphasizes innovation, with a focus on freelance and remote work opportunities, offering flexibility and a global perspective.