Source Job

Global

  • Set long-term technical direction for Kraken’s AI/ML strategy, influencing multiple teams and initiatives.
  • Own the architecture and evolution of core AI/ML systems, including training, feature management, serving, experimentation, and monitoring.
  • Lead cross-team efforts to standardize ML development and operational practices across the company.

Python Scala Go Rust MLOps

20 jobs similar to Staff Machine Learning Engineer

Jobs ranked by similarity.

Europe 5w PTO

  • Guide the technical direction of Bondora’s ML engineering stack by selecting, evaluating, and implementing technologies to improve scalability and reliability.
  • Lead complex, high-risk, or cross-departmental projects that directly influence Data Science delivery, risk model performance, and production stability.
  • Act as the bridge between Data Science, Data Engineering, and Development to identify and solve systemic technical challenges.

Bondora's mission is to empower people to enjoy life more while alleviating the stress of managing finances. Founded in 2008, Bondora has served over 1 million customers for 16 years and is rapidly growing as a fintech company, set to acquire a banking license and expand investment and loan products across Europe.

Global

  • Lead the technical direction for key Growth initiatives.
  • Design and evolve distributed, high-scale systems that power user acquisition and retention.
  • Mentor engineers across Product and Platform teams, guiding architecture, design, and execution.

Kraken is building the future of crypto and aims to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has Krakenites in 70+ countries who speak over 50 languages.

Global 5w PTO

  • Design, develop, and deploy robust ML systems and multi-model AI agents that solve real-world retail challenges.
  • Lead the entire lifecycle, including prototyping, deployment, monitoring, and maintenance using modern CI/CD and containerisation practices.
  • Build high-performance data pipelines (ETL/ELT) for both training and real-time inference, ensuring our systems are scalable and reliable.

EDITED is the world’s leading AI-driven retail intelligence platform. They empower the world’s most successful brands and retailers with real-time decision making power. Their environment is dynamic and supportive, encouraging team members to take initiative, innovate, and continuously grow.

  • Manage, mentor, hire and grow 8+ ML Engineers and Data Engineers across three distinct teams
  • Be a strong technical partner for engineers to guide ML system architecture, model deployment, and data platform design & execution
  • Ensure ML solutions are production-grade, scalable, observable, cost effective and maintainable

Apella is applying computer vision and machine learning to improve the standard of care in surgery. They build applications to enable surgeons, nurses, and hospital administrators to deliver the highest quality care; they are committed to equal employment opportunity.

US Unlimited PTO

  • Build Enterprise-Scale Infrastructure leveraging infrastructure-as-code to manage complex cloud environments.
  • Sustain Platform Health and Performance owning critical systems in production, including reliability and security.
  • Enable Teams and Customers to Move Faster creating abstractions and tooling that deploy, run, and scale AI/ML workloads.

Cake is on a mission to make cutting-edge AI accessible to enterprise teams. Backed by top investors, Cake is seeing strong adoption and is positioned for rapid growth in the next 12 months, emphasizing ownership, clear communication, and collaboration.

US Canada Argentina India

  • Work with research teams to design and build our training infrastructure
  • Prototype new training frameworks and production-ize solutions at scale
  • Design, optimize and test model integration infrastructure

Clarifai is a leading AI platform specializing in computer vision, NLP, LLMs, and audio recognition, helping organizations transform unstructured data into structured data. Founded in 2013, they remotely operate across multiple countries with backing from industry leaders, fostering a diverse and equal opportunity workplace.

US 4w PTO

  • Architect, design, and oversee delivery of end-to-end AI/ML solutions.
  • Lead cross-functional teams to implement robust ML platforms, pipelines, and applications.
  • Communicate the business value and ROI of AI/ML solutions to stakeholders.

Jobgether is using an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

Canada

  • Contribute to our core ML infrastructure.
  • Prototype new training frameworks and production-ize solutions at scale.
  • Design, optimize and test model integration infrastructure.

Clarifai is a leading, full-lifecycle deep learning AI platform for computer vision, natural language processing, LLM's and audio recognition. Clarifai was founded in 2013 and has employees remotely based throughout the United States, Canada, Argentina, India and Estonia.

Australia New Zealand

  • Act as a solution expert across ML domains including evaluations, training, inference, data pipelines, quality, and optimisation.
  • Work directly alongside product teams as a trusted partner, helping them navigate technical challenges and arrive at effective solutions.
  • Develop blueprints, patterns, and paved roads that allow other teams to follow proven approaches and accelerate their own implementations.

Canva is a design platform that enables users to create professional designs. They have a flagship campus in Sydney, a second campus in Melbourne, and co-working spaces in other locations, with a flexible work environment.

  • Implement production AI / ML workloads using Ray and Anyscale.
  • Advise customers on ML system architecture.
  • Partner with customer MLE and MLOps teams to integrate Ray into existing platforms and workflows.

Anyscale is on a mission to democratize distributed computing and make it accessible to software developers. They are commercializing Ray, an open-source project creating an ecosystem of libraries for scalable machine learning and are backed by Andreessen Horowitz, NEA, and Addition.

$125,600–$157,000/yr
US

  • Design, build, and scale enterprise-grade AI/ML systems that power internal workflows and external-facing AI/ML platforms.
  • Develop a production-ready Generative AI and MLOps platform with reusable components used to deploy multiple AI solutions across Natera’s business units.
  • Implement cloud-native infrastructure for large-scale model training and serving using Kubernetes, MLflow, Terraform, and AWS-native services

Natera is a global leader in cell-free DNA (cfDNA) testing. They are dedicated to oncology, women’s health, and organ health, aiming to make personalized genetic testing and diagnostics part of the standard of care. The Natera team consists of highly dedicated statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers and many other professionals from world-class institutions.

Europe

  • Design, implement, and maintain robust, containerized, and reproducible pipelines for model training, evaluation, and deployment—across both batch and real-time settings.
  • Build and manage ML services, APIs, and model serving infrastructure using tools like MLflow, Amazon SageMaker, and Feature Store.
  • Set up and maintain monitoring, observability, and alerting systems to ensure high availability and performance (including model/data drift, feature logging, and inference latency).

AUTO1 Group Technology drives innovation in the used car market across Europe. They operate at the intersection of software engineering, data science, and DevOps, helping bring state-of-the-art ML models—such as large-scale recommendation systems and transformer-based neural networks—safely into production.

Europe 6w PTO

  • Own deployment engineering projects, leading the technical execution of Parloa’s deployments inside large, complex enterprise environments.
  • Design for scale and resilience, architecting deployment solutions that meet enterprise-grade requirements for performance, reliability, and security.
  • Engineer solutions where none exist, building custom extensions, integrations, and configurations to close product gaps and meet enterprise requirements.

Parloa is a fast-growing startup in the world of Generative AI and customer service. Their voice-first GenAI platform automates customer service with natural-sounding conversations and has over 400+ employees in Berlin, Munich, and New York.

$140,000–$180,000/yr
US

  • Design and deliver scalable AI systems that connect models, data, and products.
  • Turn research prototypes into secure, reliable, production-ready services.
  • Build pipelines and serving layers that power adaptive, real-time features.

KnowBe4 is a cybersecurity company that puts security first, offering an AI-driven Human Risk Management platform. They empower over 70,000 organizations worldwide to strengthen their security culture and transform their workforce into their strongest security asset.

US

  • Own the reliability, performance, and operational health of production AI systems.
  • Lead efforts to refactor and harden the AI codebase.
  • Design and build monitoring, alerting, and debugging tools.

MixMode is a leading provider of AI-powered cybersecurity solutions at scale, pioneering a patented third-wave, context-aware AI approach. Large organizations with big data workloads trust MixMode to defend their most important assets.

$191,000–$253,000/yr
US

  • Own complex, full-stack AI solutions end-to-end, from applied research to production deployment.
  • Set technical direction for ambiguous and high-impact use cases, while scaling the AI systems.
  • Mentor others, lead architectural decisions, and deepen Komodo’s AI-first culture.

Komodo Health is dedicated to reducing the global burden of disease by leveraging data. They have built the Healthcare Map, the industry’s largest view of the U.S. healthcare system. At Komodo, employees are ambitious, supportive, and passionate about delivering on its mission.

US

  • Design and implement MLOps pipelines to automate model training, deployment, monitoring, and management
  • Lead/mentor a team of MLOps Engineers, fostering an inclusive and collaborative environment that encourages innovation and continuous learning
  • Collaborate with Data Scientists and ML Engineers to ensure models are production-ready, scalable, and maintainable

Egen is a fast-growing and entrepreneurial company with a data-first mindset. They bring together the best engineering talent working with the most advanced technology platforms, including Google Cloud and Salesforce, to help clients drive action and impact through data and insights.

$168,785–$256,156/yr
US

  • Own ML powered features from design through deployment, partnering with product, design, and engineering to scope work and define success metrics.

Calendly's product enables millions of people to coordinate easily. They are experiencing exciting product growth, making it a great time to consider joining their journey.

US Canada Unlimited PTO

  • Defining and launching the next generation of GenAI features and client-facing agents.
  • Responsible for building agents customized for clients.
  • Owning the ML Platform used to detect sophisticated financial crime vectors such as synthetic identity, account takeover, and bot attacks.

Sardine is a leader in fraud prevention and AML compliance. Their platform stops fraud before it happens by using device intelligence, behavior biometrics, machine learning, and AI. Sardine has hubs in the Bay Area, NYC, Austin, and Toronto and maintains a remote-first work culture.

Europe

  • Work side by side with clients, PMs, and Architects to scope and deploy AI systems.
  • Build and integrate systems using LLMs, RAG pipelines, agent frameworks, vector databases and related tools.
  • Debug relentlessly and optimize for reliability in production, not just elegance in code.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Their system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.