Similar Jobs
See allSoftware Engineer (Python, Kubernetes, AI/ML)
Gcore
Global
Python
Kubernetes
AI/ML
Senior AI/ML Specialist Solutions Architect
NVIDIA
US
Python
Terraform
Kubernetes
Senior Software Engineer, Backend (AI Agent Runtime)
Cresta
Canada
Python
Golang
Kubernetes
Senior Software Engineer, Machine Learning (EST or EMEA)
AssemblyAI
EMEA
Python
PyTorch
Kubernetes
Senior Software Engineer II - AI/ML
Seismic
US
C#
.NET
Python
Main Responsibilities:
- Set up and manage GPU cluster infrastructure for distributed model training.
- Build and operate job orchestration and scheduling systems.
- Integrate and maintain ML training frameworks and post-training pipelines.
Preferred Qualifications:
- Hands-on experience managing GPU clusters on major cloud providers.
- Experience with distributed compute orchestration tools.
- Strong Python proficiency and solid software engineering fundamentals.
Snorkel AI
Snorkel AI helps enterprises transform expert knowledge into specialized AI at scale. They started as a research project in the Stanford AI Lab and work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before.