In this role, you will own and support the development of our data orchestration and processing pipeline. You will build apps and core components of our ML systems, deliver new AI features and drive improvements to our infrastructure and services. You‘ll have the opportunity to shape our technical direction, architecture processes and culture.
Remote Devops Jobs · Python
136 results
FiltersJob listings
Yassir is seeking a highly skilled and motivated MLOps Engineer to join our Artificial Intelligence (AI) team and drive our ML Operations in application deployment and infrastructure development enabling the training, deployment, experimentation, monitoring and altogether management of our ML applications and their artefacts, at scale and at speed. As an MLOps Engineer, you will play a crucial role in making Yassir’s products and operations more AI driven, through the use of modern technologies.
As a DevOps Engineer IV at Jumio, the candidate is expected to be strong in both the “Dev” and “Ops” aspects of DevOps. The company is seeking someone with a deep understanding of systems architecture and core computer science concepts, who can reason about system behavior rather than simply working with current toolsets. As one of the early hires in our Bengaluru office, you will help shape the team and culture.
We're looking for a hands-on Platform Engineering Manager to join our team and help drive the delivery of new features while maintaining operational excellence. In this role, you will directly manage a team of 5–6 senior platform engineers and collaborate closely with them on improving and evolving our products and tooling deployment processes using automated principles for different cloud targets.
Own and evolve Aircall’s monitoring, alerting, and observability stack. You’ll work cross-functionally with backend, front end and infrastructure and teams to ensure our systems are transparent, measurable, and continuously improving in reliability and performance. This role is ideal for someone passionate about observability-as-code, metric design, and helping engineering teams gain meaningful visibility into their systems.
Proton is seeking a DevOps Engineer to join our fast-growing SaaS team! As a pivotal part of our startup's growth, you'll directly lead projects, navigate scaling challenges, and balance immediate needs with long-term gains. Reporting to the Infrastructure Manager, you will design, implement, and manage complex infrastructure and web application environments.
The Senior MLOps Engineer will design, build, and scale the systems that power CivCheck and Clariti’s AI capabilities. As the first MLOps Engineer, you will lead the development of robust ML infrastructure, ensuring that models move efficiently from research to production with reliability, observability, and performance at scale. This role is ideal for someone who thrives at the intersection of machine learning, software engineering, and cloud infrastructure.
Collaborate with information security, DevOps and engineering teams to identify Platform needs and issues with respect to security. Collaborate with key third-party security partners to ensure that security controls adhere to defined policies and mitigate risks. Ability to manage projects as a technical lead to ensure project initiatives are completed on time and in scope. Daily operational security controls and monitoring. Author Agile stories, estimate story points, assist with sprint planning and retrospectives. Perform advanced security technical troubleshooting for cloud and e-commerce environments. Participate in incident response exercises and continue documenting security and incident response procedures.
You would be working in our pre-training team focused on building out our distributed training and inference of Large Language Models (LLMs). This is a hands-on role that focuses on software reliability and fault tolerance. You will work on cross-platform checkpointing, NCCL recovery, and hardware fault detection. You will make high-level tools. You will have access to thousands of GPUs to test changes.
Design, build, and maintain highly available, scalable, and secure infrastructure and systems related to our blockchain participation and custody offerings. Collaborate with cross-functional teams to identify and implement improvements in infrastructure, monitoring, system automation, and incident response. Develop a present and forward looking view into what’s happening in each part of your domain and how it applies to the business.