You'll work with AI tools, test model outputs, and evaluate responses.
Document errors, gaps, and collaborate with our team.
Spot inconsistencies and provide structured feedback.
Project World Wide is involved in shaping the future of AI through training data. They seek motivated individuals to contribute to the development of cutting-edge AI systems.
Review, analyze, and rank AI-models' chains of thought for correctness and approach.
Provide clear, constructive feedback to improve AI-generated responses.
An Enterprise client is seeking talents who are fluent in English who will help train generative artificial intelligence models. They seem to maintain a contractor-based work environment.
Write code and build production software and workflows on top of the Tessera Mosaic platform
Conduct deep process discovery with customers, breaking down complex, ambiguous, cross-system problems into well-defined solutions
Build lightweight prototypes or workflows that showcase Tessera’s AI agents in real enterprise scenarios
Tessera Labs is building the agentic automation layer for the modern enterprise. Their AI platform automates the most complex manual business workflows with superhuman quality and speed and is backed by a16z and Foundation Capital.
Challenge AI models on realistic educational scenarios.
Validate whether its understanding of pedagogical concepts reflects best-in-class teaching practice.
Evaluate AI outputs for clarity and correctness, analyze subtle reasoning errors, document gaps in logic.
The company is seeking independent Instructional Experts with hands-on experience teaching, tutoring, or building curriculum to train AI models. As a contractor you’ll supply a secure computer and high-speed internet; company-sponsored benefits such as health insurance and PTO do not apply.
Challenge advanced language models on software engineering tasks.
Verify logical accuracy and coding fluency in German.
Capture reproducible error traces and suggest improvements.
Project World Wide is shaping the future of AI through high-quality training data. They appear to be a technologically advanced organization focused on evolving language models into powerful engines.
Build AI-powered systems to automate and improve workflows.
Work closely with business teams to understand processes and pain points.
Use AI coding agents to build software more rapidly than traditional methods.
M3 USA delivers digital solutions to healthcare, life sciences, and pharmaceutical industries. They focus on physician communities globally and have a dynamic, innovative work environment.
Build and deploy AI models with RAG and tool calling for various product features.
Collaborate with frontend and backend engineers to bring AI features into production.
Stay updated with the latest AI research and propose innovative applications relevant to Finom’s mission.
Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial services for entrepreneurs. They offer an all-in-one B2B financial solution integrating banking, accounting, financial management, and invoicing into a seamless, mobile-first platform, actively expanding across key EU markets.
Shaping the Python language ecosystem with a strong product and platform mindset.
Architecting, building and delivering high-impact solutions that uplift the Python developer experience.
Developing internal observability tooling and metrics that give the team actionable insights.
Canva is a design platform that enables users to create a variety of visual content. They have campuses in Sydney and Melbourne and co-working spaces in Brisbane, Perth and Adelaide; they value work-life balance by providing their teams with the choice in where and how they work.
Contribute to AI model training initiatives by curating code examples, offering precise solutions, and providing meticulous corrections in specialized programming languages.
Evaluate and refine AI-generated code, ensuring it adheres to industry standards for efficiency, scalability, and reliability.
Collaborate with cross-functional teams to enhance AI-driven coding solutions, ensuring they meet enterprise-level quality and performance benchmarks.
xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Their team is small, highly motivated, and focused on engineering excellence with a flat organizational structure.
Ship full-stack features end-to-end, owning the full lifecycle from conception to production with minimal oversight.
Build and maintain the infrastructure powering Fastino's services, keeping systems reliable and performant as we scale.
Work directly with AI researchers to bring new model capabilities into the product, moving fast without sacrificing quality.
Fastino is building the next generation of LLMs, developing specialized, efficient AI. The team, with alumni from Google Research, Apple, Stanford, and Cambridge, has raised $25M and is backed by leading investors including Microsoft and Github CEO Thomas Dohmke.
Create AI augmented applications, conversational experiences, and UI features
Build chatbots, agents, and interactive demos that showcase new AI capabilities
Experiment with the latest LLM frameworks, information retrieval and model steering techniques
Experian is a global data and technology company, powering opportunities for people and businesses around the world. A FTSE 100 Index company listed on the London Stock Exchange (EXPN), they have a team of 25,500 people across 32 countries.
Focuses on simplifying the infrastructure behind large language model (LLM) integrations, runtime orchestration, and data workflows.
Work at the intersection of LLM tooling, serverless infrastructure, and financial data systems.
Make spawning new research pipelines seamless and scalable.
The client is one of the world's fastest-growing AI companies accelerating the advancement and deployment of powerful AI systems. They help customers by working with the world’s leading AI labs to advance frontier model capabilities and leveraging that work to build real-world AI systems that solve mission-critical priorities for companies.
Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
Focus on packaging and integrating new ML models into the platform, using Python and common ML frameworks.
Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They power everything from real-time communication and streaming to enterprise AI and secure web applications, with over 550 professionals globally and partnerships with technology leaders.
Design and optimise AI-ready tools and APIs that enable LLM platforms to reliably interact with Canva's design capabilities.
Build and maintain evaluation frameworks to systematically measure tool-use accuracy across platforms.
Experiment with LLM orchestration and agent architectures – Develop Canva agents that any 3rd party provider can call to design quickly, efficiently and at scale.
Canva is a platform redefining how the world experiences design. They have a flagship campus in Sydney, with a second campus in Melbourne and co-working spaces in Brisbane, Perth, Adelaide, and Auckland, NZ.
Design, develop, and deploy intelligent AI Agents using Python-based frameworks.
Architect and implement robust Retrieval-Augmented Generation (RAG) pipelines from scratch.
Integrate LLMs into public sector and healthcare applications while ensuring high accuracy and reliability.
Deutsche Telekom IT Solutions, a subsidiary of the Deutsche Telekom Group, is Hungary’s most attractive employer in 2025. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees, serving hundreds of large customers, corporations in Germany and in other European countries.
Drive the design and evolution of AI-ready tools and APIs for LLM platforms.
Own and evolve evaluation frameworks that measure tool-use accuracy across platforms.
Shape Canva's agent architecture, making strategic technical decisions about intelligence location.
Canva is a design platform that enables users to create various visual content. They have offices in multiple locations in Australia and New Zealand, and they offer a flexible work environment.
Research & Train: Design, train, and evaluate our proprietary deep learning models.
High-Performance ML Systems: Optimize our models for maximum inference speed and efficiency, ensuring they can handle massive datasets and real-time workloads at scale.
Deepslate is building Speech to Speech Voice AI models that sound and act indistinguishable from a human, believing everyone should be able to use it. Backed by top-tier investors from the Tech and AI sectors, as well as a major German VC fund, they are incredibly well-funded and moving fast.
Rapidly prototype MVPs using LLM APIs to address business bottlenecks.
Develop production-grade internal applications with reliable frontends and robust backends (Python).
Design and implement RAG architectures and structured output pipelines grounded in company data.
Bestow is a leading vertical technology platform that serves some of the largest and most innovative life insurers. Their platform unifies the fragmented, legacy value chain, enabling carriers to launch products in weeks instead of years. They are backed by leading investors and trusted by major carriers.
Build and test AI-driven automations to improve operational processes.
Translate service design ideas into quick, working prototypes.
Partner with operations teams to test solutions in real workflows and gather feedback.
EXANTE is a pioneering wealth tech company that delivers cutting-edge centralized trading solutions and robust B2B financial infrastructure. As a rapidly expanding global firm with over 600 talented employees from 65 nationalities across 70 locations, they are a frontrunner in the financial sector.
Review and label content for sentiment, factual accuracy, and reasoning issues.
Evaluate model outputs across quality dimensions using scoring frameworks.
Validate automated assessments and identify discrepancies or errors.
Welo Data provides AI services helping to develop and evaluate large language models (LLMs). The job posting does not provide information regarding the company's size and culture.