Source Job

US Unlimited PTO

  • Support the Platform Infrastructure by managing container environments on EKS, implementing GitOps workflows, and maintaining CI/CD pipelines.
  • Build for Reliability by defining SLIs/SLOs, leading incident response, and contributing to disaster recovery planning.
  • Drive Observability by designing and maintaining monitoring and logging stacks with Datadog, Sentry, and CloudWatch.

Terraform Kubernetes GitOps Datadog

20 jobs similar to Platform Operations Engineer

Jobs ranked by similarity.

US

  • Designing and managing cloud-based infrastructure on AWS.
  • Creating and maintaining deployment architectures and continuous delivery pipelines.
  • Automating infrastructure provisioning and management using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.

Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. Our team of 500 experts in 20+ countries is trusted by leading enterprises.

Brazil

  • Maintain and optimize AWS EC2 and EKS clusters to ensure high availability and performance.
  • Lead troubleshooting of production outages, providing timely resolution and root cause analysis.
  • Implement and improve CI/CD pipelines using tools like Jenkins and GitHub Actions to streamline deployment processes.

CI&T are tech transformation specialists uniting human expertise with AI to create scalable tech solutions. With over 8,000 CI&Ters globally, they have built partnerships with more than 1,000 clients over 30 years, and Artificial Intelligence is deeply embedded in their work reality.

6w PTO

  • Design, build, and maintain scalable CI/CD pipelines using GitLab CI/CD
  • Develop and maintain Infrastructure as Code solutions using Terraform and Ansible
  • Build and improve internal developer platform tools and deployment services

Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world.

$200,000–$260,000/yr
US Unlimited PTO

  • Lead the design, implementation, and continuous improvement of our cloud infrastructure and DevOps practices.
  • Ensure that our systems are scalable, reliable, and secure, enabling seamless software delivery across environments.
  • Improve development velocity while increasing system reliability

Cadence is building a remote care delivery system that keeps older people healthy, out of the hospital, and at home. They support tens of thousands of active patients nationwide with their AI‑powered system and scalable clinical model enabling proactive, population‑level care.

US Unlimited PTO

  • Lead the design, implementation, manage, support and operation of cloud-native infrastructure and container orchestration platforms.
  • Drive platform reliability, scalability, automation, and operational excellence across critical SaaS and cloud-based workloads.
  • Contribute to architectural decisions, mentoring engineers, and ensuring alignment with security, compliance, and operational standards.

Availity delivers revenue cycle and related business solutions for health care professionals who want to build healthy, thriving organizations. They are a global team with headquarters in Jacksonville, FL, and an office in Bangalore, India, united by a mission to bring the focus back to patient care.

$115,000–$130,000/yr
US Unlimited PTO

  • Develop and maintain scalable automation and integrations across cloud platforms and services.
  • Design, implement, and operate CI/CD pipelines using Jenkins, Dagger, Terraform, and Docker.
  • Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm.

People Inc. is America’s largest digital and print publisher. Our brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month make decisions.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.

Global

  • Deploy, manage, and maintain AWS infrastructure across development, staging, and production environments.
  • Build and maintain scalable, reusable and secure Infrastructure as Code (IaC) using Terraform Enterprise.
  • Develop, implement and manage CI/CD pipelines for automated application and infrastructure deployments.

Miratech helps visionaries change the world. We are a global IT services and consulting company that brings together enterprise and start-up innovation. They retain nearly 1000 full-time professionals, and their annual growth rate exceeds 25%.

Global

  • Deploy and maintain infrastructure using Terraform on AWS.
  • Operate and govern production-grade platforms running on Kubernetes / EKS.
  • Build and maintain CI/CD pipelines using GitHub Actions.

Muttdata is a dynamic startup committed to crafting innovative systems using cutting-edge Big Data and Machine Learning technologies. They are looking for a hands-on DevOps to join a strategic initiative focused on deploying and operating Data & AI platforms.

Latin America

  • Own and evolve production-grade cloud infrastructure on Azure.
  • Design and maintain robust Infrastructure-as-Code (IaC) architectures utilizing Terraform.
  • Build and optimize end-to-end CI/CD pipelines using GitHub Actions.

CodeRoad provides end-to-end software development services, helping businesses scale with ideal infrastructure solutions. From staff augmentation to dedicated IT teams and general software engineering, their nearshore technology services empower businesses to thrive in an ever-evolving digital landscape.

$205,000–$235,000/yr
US

  • Provide technical leadership for infrastructure, reliability, and observability.
  • Own the observability stack using Datadog and CloudWatch.
  • Design and evolve AWS infrastructure for reliability, security, scalability, and cost efficiency.

Topstep is an engaging working environment that ranges from fully remote to hybrid. They foster a culture of collaboration by keeping cameras on during meetings and maintaining a robust Slack environment for communication.

Global Unlimited PTO

  • Own and evolve CI/CD pipelines using GitHub Actions and OIDC-based authentication for microservices and agentic workloads.
  • Automate infrastructure provisioning using Infrastructure as Code tools such as Terraform and CloudFormation.
  • Operate and scale our Kubernetes platform, including autoscaling, ingress, and multi-tenant isolation for enterprise customers.

Zingtree is a next-generation intelligent process automation platform reimagining customer experience operations for enterprise support leaders. It is a small team with high ownership, emphasizing automation, collaboration, and transparency.

US

  • Design, implement, and maintain CI/CD pipelines, build automation, and deployment workflows.
  • Engineer secure and scalable cloud platform solutions using Infrastructure-as-Code.
  • Support platform reliability, observability, operational readiness, access management, and secrets handling.

VetsEZ is dedicated to providing expertise and innovative solutions. They focus on healthcare technology projects for the federal government. They are an equal opportunity employer.

US

  • Enhance the security of cloud infrastructure.
  • Ensure the protection of patient data and all of the technology behind our platform.
  • Work helps ensure the best outcomes for patients as we strive to make mental healthcare work for everyone.

Rula strives to create a world where mental health is embraced as part of overall well-being. They are dedicated to providing quality, evidence-based care and making a positive impact on the lives of individuals struggling with mental health issues.

$4,313–$5,391/mo
Europe

  • Own 5 AWS accounts across the organisation.
  • Architect and maintain infrastructure as code with Terraform.
  • Set up monitoring, alerting, and incident response.

We're a UK fintech building high-throughput digital infrastructure for the mortgage and property space. Recently acquired Trussle and we are taking our platform to the next level. The company values innovation and building high-quality products.

$113,850–$126,500/yr
Europe 5w PTO

  • Design, build, and maintain infrastructure using Infrastructure as Code tools such as Terraform.
  • Improve system reliability, scalability, resilience, and performance across the Mast platform.
  • Build systems and tooling that automate infrastructure management and operational workflows wherever possible.

Mast is on a mission to make complex lending simple by building modern, cloud-native lending technology purpose-built for specialist lenders. It is a high-performance team of engineers and lending experts that values radical honesty, transparency, and speed.

$160,000–$190,000/yr
US

  • Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
  • Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
  • Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.

Launch Potato is a digital media company that connects consumers with leading brands through data-driven content and technology. They are headquartered in South Florida with a remote-first team spanning over 15 countries, with a high-growth, high-performance culture.

$120,000–$170,000/yr
Global Unlimited PTO

  • Own and evolve Quansight's cloud infrastructure across AWS, Azure, and GCP.
  • Build, deploy, and maintain internal dashboards and reporting for operations and project management.
  • Lead infrastructure engagements for clients from scoping and architecture through delivery, upskilling client teams.

Quansight is rooted in the Python and PyData ecosystems. They provide services ranging from open-source software development to training and consulting, believing in a culture of do-ers, learners, and collaborators.

Argentina 18w maternity 12w paternity

  • Own and evolve the cloud substrate including compute, EKS fleet, networking, and cloud operations across AWS and GCP.
  • Design and maintain the networking fabric connecting Webflow's services, ensuring reliability, security, and scalability.
  • Build and enforce guardrails around IAM and permissions to keep infrastructure secure and auditable while driving FinOps and cost optimization.

Webflow is building the world's leading AI-native Digital Experience Platform. As a remote-first company built on trust and creativity, it empowers over 2 million users globally to design, launch, and optimize for the web without barriers.

$39,600–$47,600/yr
Argentina Mexico Unlimited PTO

  • Build small to medium-sized infrastructure components using Terraform and AWS.
  • Ensure reliable build-and-deploy cycles by maintaining and debugging CI/CD workflows, including GitHub Actions and ArgoCD.
  • Learn to troubleshoot and resolve issues in containerized environments, including Kubernetes pods and EKS networking bottlenecks.

TrueML is a mission-driven financial software company that aims to create better customer experiences for distressed borrowers. The TrueML team includes inspired data scientists, financial services industry experts and customer experience fanatics building technology.