Source Job

  • Help define the direction for the team.
  • Define and prioritize ML Platform initiatives.
  • Enable teams to build features at scale by providing a foundation of reusable software components and infrastructure.

Python AWS Kubeflow Terraform MLOps

20 jobs similar to Senior Software Engineer - ML Platform

Jobs ranked by similarity.

Europe 4w PTO

Design, build, and own AWS-based MLOps infrastructure, defining standards for security, automation, cost-efficiency, and governance. Architect and operate production Kubernetes clusters, including containerizing and deploying ML models using Docker and Helm. Build and maintain CI/CD pipelines for training, validation, and deployment of ML workloads, implementing canary, blue-green, and rollback strategies.

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

US

  • Evolve and support real-time machine learning/AI pipelines to build low-latency, high-throughput systems.
  • Design and implement cloud solutions using AWS to ensure high performance, scalability, and reliability.
  • Guarantee high levels of service availability through being a part of an on-call rotation, following best practices for disaster recovery and business continuity.

Sony Interactive Entertainment (SIE) PlayStation creates unforgettable gaming experiences and is dedicated to building a world-class team.

$213,000–$300,000/yr
US 4w PTO 12w maternity 12w paternity

  • Operationalize data science solutions for risk-prediction products.
  • Design and build ML pipelines using AWS services and tools like MLflow and Snowflake.
  • Implement testing strategies within CI/CD pipelines to maintain high platform quality.

Quanata is on a mission to help ensure a better world through context-based insurance solutions.

Latin America

  • Strong computer science or engineering background with 3+ years of coding experience with Python.
  • Advanced knowledge of AWS services including but not limited to their ML services (AWS SageMaker and AWS Step Functions).
  • Experience with ML monitoring and automation tools (MLflow, SagaMaker Pipelines).

Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.

US Canada

  • Design, build, and maintain our petabyte-scale data and ML platform.
  • Ensure reliability, security, scalability, and performance across our internal systems.
  • Automate deployment pipelines, monitoring, and alerting for ML and data services.

Serve Robotics is reimagining how things move in cities with its personable sidewalk robot designed to take deliveries away from congested streets.

India

  • Design and manage AWS infrastructure for AI services.
  • Implement Infrastructure as Code using Terraform.
  • Collaborate with cross-functional teams to enhance performance.

Jobgether uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly against the role's core requirements. Their system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

$175,000–$200,000/yr
US

Lead AI and ML initiatives to design and implement production-grade machine learning systems and pipelines. Develop scalable infrastructure for model training, evaluation, and deployment, ensuring reliability and observability. Collaborate with cross-functional teams to drive innovation and efficiency.

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

$145,831–$218,747/yr
Canada

  • Build, maintain and improve Torc ML frameworks.
  • Use Terraform, AWS Managed Services, EKS, Ray.
  • Focus on data ops, ML development pipeline, logging & aggregation.

Torc has been a leader in autonomous driving since 2007. Now a part of the Daimler family, they are focused solely on developing software for automated trucks to transform how the world moves freight. Their culture is collaborative, energetic, and team focused.

$203,000–$230,000/yr

  • Lead development stages for AI/ML projects from exploration to maintenance.
  • Design and implement scalable ML pipelines for large datasets with data scientists and network security experts.
  • Conduct experiments and analyze results using metrics and visualization techniques.

Corelight is a cybersecurity company that transforms network and cloud activity into evidence for elite defenders. Fueled by accelerating revenue and investments from top-tier venture capital organizations, they are rapidly expanding their team with a geographically dispersed yet connected employee base.

US

  • Build and deploy AI-driven products that accelerate clinical trials and improve patient outcomes.
  • Develop advanced ML models and LLM-powered agents for critical use cases like patient recruitment, enrollment forecasting, and study feasibility.
  • Leverage modern cloud tools and MLOps best practices to build robust data pipelines and deploy models at scale.

At OneStudyTeam (a Reify Health company), we specialize in speeding up clinical trials and increasing the chance of new therapies being approved with the ultimate goal of improving patient outcomes.

$125,000–$175,000/yr
Unlimited PTO

  • Act as a trusted advisor to customers, building relationships with technical and business stakeholders.
  • Advise on GenAI and ML best practices and give product demos to stakeholders.
  • Partner with product and engineering teams to drive the product roadmap and spearhead new opportunities within existing accounts.

Arize AI is the leading AI & Agent Engineering observability and evaluation platform, empowering AI engineers to ship high-performing, reliable agents and applications.

  • Own the end-to-end lifecycle of ML model deployment—from training artifacts to production inference services.
  • Design, build, and maintain scalable inference pipelines using modern orchestration frameworks (e.g., Kubeflow, Airflow, Ray, MLflow).
  • Implement and optimize model serving infrastructure for latency, throughput, and cost efficiency across GPU and CPU clusters.

MARA is building a modular platform that unifies IaaS, PaaS, and SaaS which will enable governments, enterprises, and AI innovators to deploy, scale, and govern workloads across data centers, edge environments, and sovereign clouds. They are redefining the future of sovereign, energy-aware AI infrastructure.

Brazil

Combine Software Engineering and Data Science disciplines to create production-ready Machine Learning models. Develop frameworks and platform to build, deploy, serve and monitor ML-based services. Contribute to vision and architecture to scale ML solutions at QuintoAndar's business.

We are Grupo QuintoAndar, the largest real estate ecosystem in Latin America, guided by a shared purpose of helping people love the place they live.

$120,000–$140,000/yr

  • Design and plan cloud-native systems aligned with business goals and security best practices.
  • Implement and support AI-based automation tools and services.
  • Continuously tune cloud and automation workloads to improve reliability and performance.

PerfectServe offers unified healthcare communication solutions to help physicians, nurses, and care team members provide exceptional patient care.

$65,274–$87,032/yr
Europe

  • Replace manual onboarding processes with automated pipelines using CI/CD and cloud tech.
  • Design, build, and deploy production-grade APIs and services within a microservices architecture.
  • Automate testing and delivery through robust unit, integration, and performance tests.

Mitek is a global leader in digital & biometric identity authentication, fraud prevention, and mobile deposit solutions. They have over 7,500 organizations trusting their platform, advanced image capture solutions in biometric recognition, AI, computer vision and machine learning. At Mitek, teams are more resilient, effective, and innovative. The strength of their organization is deeply rooted in the people who power it.

  • Design and implement Python services for real-time fraud detection and compliance monitoring.
  • Build scalable data pipelines for feature building and model training.
  • Develop and operate a developer-friendly ML platform with automated quality checks.

Kraken is a mission-focused company rooted in crypto values, aiming to accelerate the global adoption of crypto for financial freedom and inclusion.

Europe

  • Design and implement the "Golden Paths"—standardized, automated templates for microservices and infrastructure.
  • Develop the CLI tools, portals, or API interfaces that abstract the complexity of our cloud infrastructure.
  • Develop and maintain a library of modular, testable, and versioned Terraform modules.

SEON is a command center for fraud prevention and AML compliance, helping companies stop fraud, reduce risk and protect revenue. They are powered by real-time, first-party data signals, enriches customer profiles, flags suspicious behavior and streamlines compliance workflows.

Design scalable software solutions with a focus on quality, performance, and maintainability. Lead software architecture decisions, ensuring the design aligns with business requirements and technical standards. Collaborate with Research Leads and Product Owners to gather requirements, design MLOps pipelines and coordinate the software delivery that adds value to the business.

NIQ is the world’s leading consumer intelligence company, delivering the most complete understanding of consumer buying behavior.

$177,300–$265,900/yr
US Europe

  • Architect, build, test, and monitor AWS-based workflows to solve critical business problems
  • Develop microservices for ML-driven applications using Python or Java, ensuring scalability and resilience.
  • Guarantee high levels of service availability through participation in an on-call rotation.

PlayStation is a global leader in interactive and digital entertainment. They've thrilled gamers since 1994 and are a wholly-owned subsidiary of Sony Corporation, striving to create an inclusive environment that empowers employees and embraces diversity.

Australia New Zealand

As a Senior MLE, debug complex AI implementations and optimize inference performance. Work directly with product teams building solutions and develop blueprints for proven patterns. Operate in a high-velocity environment where priorities shift rapidly based on team needs.

Join the team redefining how the world experiences design.