Design, train, evaluate, and deploy ML systems powering real-time voice experiences including ASR, speech understanding, turn detection, and text-to-speech.
Improve voice AI quality through error analysis, data curation, metric design, and iterative model improvement with a focus on real-world performance.
Build evaluation frameworks for complex voice systems, measuring accuracy, robustness, latency, naturalness, and task completion.
Machine LearningSpeech RecognitionNLPReal-time Systems
Scope and lead ML initiatives end-to-end from identifying opportunities through production deployment.
Design, develop, and optimize ML models and AI systems for document processing and automation.
Build and maintain production ML pipelines that are robust, observable, and scalable.
Medallion is a healthcare technology company building a provider operations platform to eliminate administrative bottlenecks. They are one of the fastest-growing healthcare technology companies, with a mission to transform healthcare at scale and are backed by $130M in funding.
Architect and build agentic workflows that combine large language models, reasoning components, and data pipelines to create adaptive, goal-driven conversational systems
Lead the design and development of advanced ML/NLP products, from ideation to production - including model training, evaluation, optimization, and deployment
Drive experimentation with new approaches for agentic reasoning, coordination, and autonomous system design
SmartRecruiters is the Recruiting AI Company that transforms hiring for the world’s leading enterprises. Built for global scale, SmartRecruiters, an SAP company, delivers an AI-powered hiring platform that automates and optimizes the entire talent acquisition process, ensuring faster and smarter hiring decisions. They are a values-driven, globally focused tech company with strong financial backing and a bold vision for the future of work.
Developing ranking and recommendation models that identify high-performing team designs.
Building brandification pipelines to conform to an organisation's brand guidelines.
Building layout extraction and understanding systems that parse Canva's design format.
Canva is a design platform that makes it easy for anyone to create professional-looking designs. They have a flagship campus in Sydney, a second campus in Melbourne, and co-working spaces in Brisbane, Perth, & Adelaide, and provides flexibility in how and where you work.
Design and implement scalable ML infrastructure to support model development and deployment
Develop and maintain evaluation frameworks for Large Language Models (LLMs), including RAG-based systems
Evaluate model performance using tools such as RAGAS, DeepEval, or similar frameworks
EX Squared LATAM collaborates with global clients to build innovative digital solutions that drive real business impact. They foster a collaborative, inclusive, and innovation-driven culture where continuous learning and professional growth are at the core of everything they do.
Design, build, and iterate on machine learning models and LLM-based systems that power critical decisions across fraud, compliance, growth, and operations
Work with messy, real-world data to identify signals, build features, and continuously improve model performance
Make practical tradeoffs between model performance, interpretability, and operational cost
River is building the world’s most trusted financial institution to empower people to take ownership of their financial lives through Bitcoin. River is growing quickly and has raised more than $50 million from leading investors.
Define team vision and strategic direction for conversion modeling.
Oversee model development from ideation to deployment.
Recruit, mentor, and retain top ML talent.
Reddit is a platform built on shared interests, passion, and trust, and is home to open and authentic conversations. With 100,000+ active communities and approximately 121 million daily active unique visitors, Reddit is one of the internet’s largest sources of information.
Train, evaluate, and iterate on ML models and agentic systems for customer feedback, owning custom fine-tuning pipelines.
Build and maintain LLM-powered features including retrieval pipelines, reranking systems, insight agents, and automated taxonomy generation.
Design robust evaluation frameworks, write production-quality code, and collaborate with Engineering on productionisation and monitoring.
Chattermill provides a Customer Experience Intelligence platform that analyzes feedback to help large brands like Uber and Amazon center their customers. The company fosters a choice-first, trust-based culture focused on collective growth and ambitious goals.
Lead the design, development, and deployment of production, multi-turn LLM-powered features.
Own backend services in Python that integrate LLM agents with Fullscript’s platform and support reliable production use.
Partner with medical, product, and engineering teams to identify high-value opportunities for AI and turn them into practical, scalable product capabilities.
Fullscript is a health technology company committed to helping people get better by creating a platform that powers every part of care. More than 125,000 practitioners use Fullscript for clinical insights, lab interpretations, patient analytics, education, and access to high-quality supplements.
You will personally own the Intelligence Engine -- Scoring & Activation and the behavioral scoring algorithms that power user-facing activation across B2C and B2B.
You will own the entire ML Pipeline Architecture, from data ingestion and feature stores to model training, evaluation, deployment, monitoring, and retraining triggers.
You will be responsible for LLM Integration & Optimization, integrating, fine-tuning, and deploying large language models for contextual inference, personalization, and behavioral pattern recognition.
Gesture is a fast-growing tech company using AI, machine learning, and intelligent logistics to power a unique platform that connects people and brands through real-world, tangible experiences. Inside their NYC headquarters, you'll find an environment that moves with the pace and precision of Silicon Valley but with the heart of something far greater.
Conduct experiments with LLMs and evaluate different architectures and techniques to improve conversational AI quality.
Develop and maintain robust evaluation frameworks to assess model performance, accuracy, and user satisfaction using offline and online metrics.
Optimize models for inference, improving speed, efficiency, and scalability for production environments.
Social Discovery Group (SDG) unites millions of users on dozens of products, solving loneliness, isolation, and disconnection by transforming virtual intimacy into the new normal. Their international team of 1000+ professionals works remotely from various locations, and they've been recognized as a "Great Place to Work".
Own agent quality end-to-end: diagnosis, improvement, and validation across SmartAssist's orchestrator and subagents
Drive quality improvements through prompt engineering, context engineering, and RAG retrieval tuning
Extend and mature our evaluation framework: scorers, golden datasets, regression gates, and online evaluation for production traffic
Smartsheet has been helping people and teams achieve for over 20 years. They are building tools that empower teams to automate the manual, uncover insights, and scale smarter.
Design, build, and maintain ML infrastructure across training, evaluation, serving, and monitoring
Own data pipelines including generation, cleaning, validation, and versioning
Build and improve experiment tracking, orchestration, and reproducibility tooling
Quilter is helping electrical engineers save time and accomplish more by automating the tedious and time-consuming task of designing printed circuit boards (PCBs). Their small team is composed of experts in electrical engineering, electromagnetic simulation, ML/AI, and high-performance computing (HPC).
Design, implement, and evaluate machine learning models and AI algorithms.
Develop and optimize prompts for LLMs to improve model outputs.
Collaborate with software engineers, data scientists, and product teams.
Cadre AI is focused on building and optimizing AI-powered platforms, bringing together cutting-edge technologies and expertise in machine learning and large language models. The team is dedicated to advancing AI capabilities and applying them to real-world challenges through scalable, high-impact solutions.
Lead and drive ambitious research initiatives that advance the state of the art in computer vision, multimodal understanding, and visual generation.
Develop novel models, algorithms, and training methodologies for challenging vision problems.
Translate cutting-edge research into practical model improvements that can shape product direction and unlock new user experiences.
Webflow is building the world’s leading AI-native Digital Experience Platform as a remote-first company built on trust, transparency, and a whole lot of creativity. They empower teams to design, launch, and optimize for the web without barriers.
Build and deploy end-to-end AI/ML solutions, from data pipelines and feature engineering to model training and inference
Develop and maintain data pipelines for ingesting, transforming, and preparing data for analytics and machine learning
Write clean, modular, and maintainable code to support scalable AI applications
Eimagine fosters a remote-enabled environment where their people can thrive. They are a team of professionals who take pride in their craft, continuously learn, and support one another, helping clients navigate technology and business change while delivering meaningful outcomes.
Design, prototype, and deploy Generative AI solutions across client-facing and internal platforms.
Build and optimize applications using large language models (LLMs), vector databases, prompt engineering, and RAG pipelines.
Lead development of AI agents for both digital and voice channels, supporting real-time interactions with clients and internal users.
National Debt Relief, founded in 2009, aims to help consumers deal with overwhelming debt. They are a debt settlement organization that has helped over 450,000 people settle over $10 billion of debt, striving to empower them to lead a healthier financial lifestyle.
Own technical direction for high-impact AI products and work across teams to turn big ideas into shipped systems.
Help raise the bar for how they build, evaluate, and operate AI in production.
Rula strives to create a world where mental health is no longer stigmatized and provides quality, evidence-based care. They are a remote-first company that is dedicated to treating the whole person, not just the symptoms, and making a positive impact in the field of mental healthcare.
Build and ship customer‑facing AI, combining Generative AI with machine‑learning techniques.
Develop new models end-to-end, from understanding product requirements to implementation and deployment.
Create an ML Ops framework to ensure models scale effectively with proper monitoring and alerts.
Qonto is creating the leading finance workspace with banking at its core for SMEs in Europe, augmented by financial tools. Founded in 2017 by Alexandre and Steve, Qonto has grown to over 1,600 employees and serves over 600,000 customers across 8 European countries, with a culture that prioritizes customer satisfaction.
Design and maintain training systems that can process and learn from petabyte-scale multimodal datasets.
Identify and resolve bottlenecks in the training pipeline to maximize GPU utilization and reduce training time.
Work with the ML team to develop and refine neural network architectures suitable for autonomy tasks.
Serve Robotics is reimagining how things move in cities. Their personable sidewalk robot is their vision for the future; it's designed to take deliveries away from congested streets, make deliveries available to more people, and benefit local businesses. Their team is agile, diverse, and driven aiming to grow robotic deliveries from surprising novelty to efficient ubiquity.