Similar Jobs
See allLead Machine Learning Engineer, Inference & Performance
Egen
Python
Kubernetes
Senior ML Operations (MLOps) Engineer
Jobgether
US
Python
PyTorch
TensorFlow
Senior Machine Learning Engineer, AI Platform
Mozilla
Canada
Python
Machine Learning
Cloud Infrastructure
AI Infrastructure & Platform Operations Engineer
Mirantis
Europe
Linux
Kubernetes
Networking
Staff ML Systems Engineer
Hims & Hers
US
Kubernetes
Terraform
Python
About the Role:
- Seek an AI Infrastructure Engineer to build scalable, reliable ML inference platforms for a deep-tech cloud startup.
- Collaborate with platform and AI teams to ensure high availability and low latency in production.
Responsibilities:
- Own the full lifecycle of ML systems, from development through production and on-call support.
- Design observability systems for latency, throughput, and GPU cost metrics.
- Define best practices for deployment and platform scalability.
Qualifications:
- 4+ years in MLOps, Platform Engineering, or SRE with focus on ML systems.
- Strong experience with container orchestration and GPU-based workloads in production.
Pragmatike
They are a distributed cloud infrastructure startup building AI-native cloud services with GPU-powered compute. The company is well-funded, fast-scaling, and operates in a remote-first environment with a focus on sustainability and decentralization.