As a Principal DevOps Engineer, you will lead the design, implementation, and optimization of global cloud infrastructure and CI/CD pipelines, ensuring scalable, secure, and high-performing systems. You will collaborate across engineering, security, and operations teams to drive automation, enhance developer experience, and maintain compliance standards. This role offers autonomy and strategic impact.
Remote Devops Jobs · Kubernetes
131 results
FiltersJob listings
Partner with data scientists to transform their models into production-ready systems as a ML Ops Engineer. You will architect storage and compute, harden training/inference pipelines, and make ML code, data workflows, and services reliable, reproducible, observable, and cost-efficient. You'll also set best practices and help scale our platform as Nift grows.
You'll help build the internal developer ecosystem powering Grow’s next-generation mental healthcare platform. You’ll join a mission-driven team where engineering decisions have broad organizational impact and directly support Grow’s goal of making affordable mental health care more accessible across the U.S. As a platform engineer, you’ll tackle complex challenges spanning AWS cloud infrastructure, Kubernetes (EKS) operations, CI/CD pipelines, and the tooling, automations, and workflows that drive Grow’s engineering velocity.
Looking for a skilled individual to join Bluelight's rapidly growing team. This position is ideal for someone who thrives in a fast-paced, dynamic environment where everyone's opinions and efforts are valued and appreciated. You will have the opportunity to contribute to challenging and meaningful projects, developing high-quality applications that stand out in the market.
Drive the design, development, and deployment of cutting-edge software solutions in a dynamic, distributed environment. Lead the creation and management of automated CI/CD pipelines, ensuring efficient software development, testing, and deployment processes. Work closely with cross-functional teams, provide technical leadership, and maintain system reliability and security standards. Requires expertise in software configuration management, containerized environments, cloud deployment, and automated testing.
This role offers the opportunity to combine software engineering expertise with site reliability principles to build highly resilient, scalable, and secure systems. You will play a central role in shaping the infrastructure that powers critical AI-driven applications, ensuring optimal performance, availability, and reliability. The position involves driving automation, leveraging AI for proactive monitoring, and managing cloud-native microservices platforms at scale.
As a Cloud Engineer, you will design, implement, and manage scalable cloud infrastructure solutions for multiple client projects in a fast-paced, fully remote environment. You will work closely with cross-functional teams to ensure optimized cloud architectures, deployment pipelines, and operational stability. This role involves hands-on engineering with Kubernetes, Terraform, and cloud platforms such as AWS or GCP.
The engineering organization is a dynamic group of builders, thinkers, and problem-solvers dedicated to delivering scalable, AI-powered software products. As a Principal Software Engineer, you will participate in all technical aspects of team deliverables, communicate technical decisions, and evolve ServiceNow’s end-end CI/CD pipeline. You will also implement AI assistance tools and optimize the performance and reliability of our mission critical developer pipeline.
As a Senior Site Reliability Engineer, you will partner with the Engineering Department to drive the reliability, scalability, and performance of our production systems. You will define and implement best practices across infrastructure security, observability, release engineering, and developer tooling to meet department-level operational requirements, own our Incident Management process and automate operational tasks.
As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and industry best practice within Megaport. You will work alongside talented team members in multiple timezones ensuring that systems are secure, maintainable and available. External to the team you will be engaging with stakeholders in requirements analysis and demonstrations. Technically you will be very hands on and continually evolving your skills through peer reviews and research.