Similar Jobs
See allSenior or Staff ML Systems Engineer
Quilter
US
ML Systems
CI/CD
GPU
MLOps Engineer
Truelogic
US
Python
Docker
Kubernetes
Senior MLOps Engineer - Hudl Focus
Hudl
Europe
MLOps
Python
Docker
AI Operations Engineer
Newsela
LATAM
Python
Docker
Terraform
Platform Engineer
Vectara
US
Terraform
Helm
Kubernetes
YOUR RESPONSIBILITIES:
- Build and operate production-grade model serving infrastructure using frameworks such as vLLM, TGI, Triton, or equivalent
- Design and implement robust deployment pipelines with blue/green and canary rollout strategies for ML models
- Develop and maintain auto-scaling systems, multi-model serving architectures, and intelligent request routing layers
REQUIRED QUALIFICATIONS:
- 4+ years of experience in ML Ops, Platform Engineering, SRE, or similar infrastructure roles focused on ML systems
- Hands-on experience with model serving frameworks such as vLLM, TGI, Triton, or equivalent
- Strong background in container orchestration and operating GPU-based workloads in production
Pragmatike
Pragmatike is recruiting on behalf of a fast-scaling, well-funded distributed cloud infrastructure startup building next-generation AI-native cloud services. The company is redefining how compute is delivered by providing GPU-powered infrastructure for AI/ML workloads, secure storage, and high-speed data transfer through a decentralized architecture that significantly reduces environmental impact compared to traditional cloud providers.