Source Job

  • Design and build scalable ML training, deployment, and inference pipelines using CI/CD and cloud infrastructure.
  • Implement MLOps for model versioning, monitoring, and automated retraining to detect drift and performance degradation.
  • Partner with Data Scientists and Product teams to productionise models and integrate ML into customer-facing products.

Python Cloud Infrastructure Kubernetes CI/CD

17 jobs similar to Machine Learning Engineer

Jobs ranked by similarity.

Canada

  • Design and operate core AI platform components for training, deploying, and serving ML models at scale.
  • Own model serving and inference workflows end-to-end, optimizing for reliability, latency, throughput, and cost.
  • Collaborate with product, infrastructure, and security teams to build scalable platform capabilities for AI-powered features.

Mozilla Corporation is the non-profit-backed technology company behind Firefox and Pocket, with over 225 million monthly users. A wholly-owned subsidiary of the Mozilla Foundation, the company is mission-driven, employee-owned, and focused on privacy and open standards.

Brazil

  • Evolve and maintain our Kubeflow, Feast, and Spark-on-Kubernetes ML infrastructure.
  • Design tools and APIs empowering teams to transition from centralized bottlenecks to self-service excellence.
  • Collaborate with Data Science teams to apply software engineering best practices to ML workflows.

Wellhub revolutionizes workplace wellness by connecting employees to partners for fitness, mindfulness, therapy, nutrition, and sleep in one subscription. Headquartered in NYC with team members across the globe, we value wellbeing, collaboration, and different perspectives.

United States Canada

  • Build and operate the real-time inference service for the risk decision engine with low latency and high availability.
  • Own model deployment infrastructure including CI/CD, shadow mode, and staged rollouts.
  • Build model observability and partner with Risk Data Science for production operation.

Mercury is a fintech company that provides banking services for startups via partner banks. The company is committed to creating a safe environment and values diversity, with a growing team focused on innovation.

$81,112–$92,025/yr
Europe

  • Empower ML Engineers with the tools, infrastructure, and frameworks they need to iterate fast autonomously.
  • Accelerate time-to-market for production-ready ML products with seamless integration and access to data and resources.
  • Own ML CI/CD in close collaboration with the DevExp team, adapting existing frameworks to ML-specific needs.

Dailymotion is a video platform designed to broaden users' horizons with a unique algorithm. They foster inclusivity and aim to build a better and safer Internet with cutting-edge solutions for video hosting and advertising. With 400 employees in France, New York, and Singapore, Dailymotion is shaking up the global video platform ecosystem.

US

  • Design, build, and deploy AI/ML solutions from prototype to production for client business problems.
  • Apply generative AI and LLMs, establishing MLOps best practices including CI/CD and model monitoring.
  • Serve as a trusted technical advisor, translating ambiguous problems into well-scoped solutions and presenting to stakeholders.

DevIQ builds modern cloud and data solutions for mid-market companies focused on energy reduction, healthcare, education, and smart cities. The company offers competitive benefits, a strong team culture, and opportunities to work on end-to-end solutions with multi-disciplinary teams.

Ireland

  • Design and develop machine learning solutions ensuring accuracy, performance, security, and scalability
  • Implement and maintain end-to-end AI/ML pipelines from data ingestion to deployment
  • Collaborate across planning, design, and code review to raise overall code quality

We shape the future of communications from remote-first environments. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide, with a strong culture of connection and inclusion.

Global 16w maternity 16w paternity

  • Design, train, evaluate, and ship ML systems for governance and security, starting with prompt injection detection and behavioral anomaly detection.
  • Build supporting infrastructure including data pipelines, feature stores, model serving, and evaluation harnesses.
  • Set technical direction for ML work, own architecture, evaluation methodology, and model lifecycle.

Docker provides developer tools for building, sharing, and running applications across Docker Desktop, Docker Hub, and Docker Scout. With over 20 million monthly users and a globally distributed remote-first team, Docker is trusted by solo founders to the world's largest companies.

Canada

  • Define, drive, design, and build/ship end-to-end solutions that solve real customer problems.
  • Contribute to the end-to-end AI/ML software development lifecycle, ensuring reproducible research.
  • Drive architecture, design, and delivery of advanced ML systems in the Product R&D team.

Kinaxis is a global leader in modern supply chain orchestration. Known for its AI-infused platform and transparency across end-to-end supply chains, Kinaxis helps customers make faster, better decisions. The company has over 2000 employees worldwide and is recognized with Top Employer awards.

US Unlimited PTO

  • Own and scale AI compute and deployment platforms including Kubernetes and GitOps pipelines.
  • Build inference infrastructure and observability stacks for LLM-powered workflows.
  • Drive security, compliance, and governance at the systems level in a regulated healthcare environment.

Hims & Hers is a leading health and wellness platform focused on making healthcare accessible and personal. As a publicly traded company on the NYSE (HIMS), it offers flexible/remote work and a culture centered on innovation and employee well-being.

US

  • Design and develop production-grade AI/ML services and web applications from proof-of-concept to scalable platforms.
  • Implement MLOps best practices, CI/CD pipelines, and cloud deployment for AI/ML workloads.
  • Collaborate with cross-functional teams to integrate AI capabilities into engineering workflows.

Cayuse Civil Services, LLC provides enterprise AI and engineering solutions for government and infrastructure clients. The company values innovation, excellence, collaboration, adaptability, and integrity, fostering a culture of teamwork and quality.

Global

  • Contribute to the development of the Everywhere Inference platform, a Kubernetes-based solution.
  • Design and implement APIs and developer tools to simplify deployment, management, and monitoring of AI applications.
  • Optimize serverless container workflows for AI workloads, ensuring performance, scalability, and seamless autoscaling.

Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They have 550+ professionals globally and power everything from real-time communication and streaming to enterprise AI and secure web applications.

Global 6w PTO

  • Build, optimize, and embed machine learning models for on-device inference within the QSIDS detection engine.
  • Collaborate closely with systems engineers to integrate models efficiently into a Go-based engine.
  • Take models all the way to production and own them once they're running, monitoring performance, detecting drift, and iterating to keep them reliable.

Qohash builds the zero copy data security control layer for enterprises to adopt AI safely. The company has a strong culture centered on five core values: pursuit of excellence, resilience, mission focus, accountability, and embracing conflict.

UK Netherlands

  • Design and build systems that improve the efficiency of ML training and inference workloads.
  • Develop tooling that helps ML engineers debug, profile, optimize, and monitor model performance.
  • Partner with ML researchers and product teams to identify bottlenecks and drive performance improvements.

Reddit is a community of communities built on shared interests, passion, and trust, hosting the most open and authentic conversations on the internet. With over 100,000 active communities and approximately 126 million daily active users, Reddit is one of the internet's largest sources of information.

$120,000–$160,000/yr
US

  • Design, develop, and deploy AI/ML models to automate and improve internal workflow.
  • Build and maintain ML pipelines within an AWS cloud environment.
  • Integrate ML capabilities into existing Java and React application workflows.

Oddball aims to improve daily lives by delivering quality software to the federal space. With a team of experienced engineering, product, and UX professionals, we value learning, growth, and making a big impact in a rapidly growing company.

Europe

  • Define and evolve the architecture and roadmap for enterprise‑scale Data and AI platforms.
  • Design and build multi‑tenant, multi‑region, highly available AI platforms with governance.
  • Lead capacity planning and cost optimization strategies for GPU and CPU workloads.

NEORIS accelerates growth in Ibero‑America, combining global engineering with regional expertise. With over 60,000 professionals across 55+ countries, they offer technical specialization career paths and value responsibility, collaboration, creativity, and commitment.

Global 4w PTO

  • Take ownership of the ML API serving NBA recommendations and harden it for low-latency production traffic.
  • Ship your first agent tool contract end-to-end: schema design, handler implementation, and unit tests.
  • Set up the eval foundation for agents with golden transcripts, rubric-based judges, and regression suites.

Clutch is a vertical SaaS company backed by Andreessen Horowitz that helps credit unions become fintech lenders, providing affordable lending solutions to over 130 million Americans. The team is small, ambitious, and shipping fast with a culture that values pragmatism and real autonomy.

Unlimited PTO

  • Manage the Machine Learning Infrastructure team to design and build tools for scalable ML model development and deployment.
  • Work as a product owner to deliver on product vision, feature priorities, and make tradeoffs between scope, quality, and time.
  • Guide the team through complex technical projects, manage end-to-end product ownership from planning to on-call support.

ExtraHop reinvents Network Detection and Response (NDR) to offer enterprises unparalleled visibility, context, and control against emerging threats. The company has been recognized as a leader by major analyst firms and has around 1,000 employees, fostering a culture rooted in values like leading with purpose and acting with integrity.