Source Job

United States

  • Act as the design authority for multi-cloud infrastructure across AWS, Google Cloud, and Azure, owning the hardest architecture decisions.
  • Define firm-wide standards, patterns, and reference architectures for landing zones, networking, identity, and workload platforms.
  • Build reusable, modular Terraform and Kubernetes standards, and drive CI/CD pipelines, observability, security, and cost optimization.

Terraform Kubernetes AWS Azure Google Cloud

20 jobs similar to Principal Cloud Engineer - DevOps/Infrastructure

Jobs ranked by similarity.

North America

  • Design, build, and maintain cloud infrastructure across Azure, GCP, and AWS, including landing zones, Kubernetes, and CI/CD pipelines.
  • Implement monitoring, security, and hybrid connectivity for enterprise-scale cloud environments.
  • Collaborate cross-functionally, mentor engineers, and leverage AI tools to accelerate infrastructure development.

Applied is an Insurtech company that builds technology solutions for insurance professionals. With over 40 years of experience, they foster a culture of trust, inclusion, and growth.

Global Unlimited PTO 16w maternity 16w paternity

  • Own the operational excellence and infrastructure strategy for Remote Build's platform, ensuring reliability, performance, and security.
  • Lead incident response, build observability systems, and drive continuous improvement in system reliability.
  • Embed security into infrastructure, optimize costs, and automate operational toil to scale efficiently.

Remote solves modern organizations' biggest challenge of navigating global employment compliantly. With a fully distributed team across 6 continents, the company fosters a future-focused culture with core values of innovation and async work.

Poland

  • Build and maintain AWS infrastructure using Terraform and automation.
  • Configure and operate Kubernetes environments, mainly EKS, with possible AKS exposure.
  • Create and improve CI/CD pipelines for infrastructure and application delivery.

Software Mind develops solutions that make an impact for companies around the globe. Our culture embraces openness, acts with respect, shows grit & guts and combines employment with enjoyment.

Global Unlimited PTO

  • Lead the architecture and implementation of managed Kubernetes infrastructure across AWS, Azure, and GCP for enterprise customer deployments.
  • Own the systems that provision, organize, and manage cloud accounts, including resource governance and multi-tenant isolation.
  • Mentor P3/P4 engineers and define architectural patterns that scale across the company's infrastructure.

Ditto builds the world's leading edge sync platform, enabling applications to share data peer-to-peer with or without internet connectivity. With over $145 million in funding and trusted by organizations like Chick-fil-A and Delta Airlines, Ditto is a fast-growing, globally distributed startup committed to building a diverse and inclusive team.

US

  • Design and maintain reusable Terraform and Ansible modules for Azure and GCP, enforcing configuration standards and policy-as-code.
  • Build and optimize Jenkins and GitHub Actions CI/CD pipelines, implementing deployment strategies and security scanning.
  • Contribute to portal application code (modern JS/TS frontend, REST API) and wire applications into the platform with monitoring and observability.

BETSOL accelerates cloud transformation for enterprises across 17+ countries using AI and cloud-native solutions. The company holds several engineering patents, is recognized with industry awards, and maintains a net promoter score 2x the industry average.

Global

  • Manage a team of Engineers, conducting 1:1s, performance reviews, hiring, and career development in a distributed remote friendly environment.
  • Own the technical roadmap for shared cloud infrastructure across Azure and AWS, balancing reliability work against longer-term platform improvements.
  • Set and enforce standards for infrastructure-as-code (Terraform, Helm, Kubernetes), documentation, and operational readiness.

Delinea is a pioneer in securing human and machine identities through intelligent, centralized authorization, empowering organizations to seamlessly govern their interactions across the modern enterprise. They value diversity, innovation, and a culture of respect and fairness, with a global team supported by strategic investment from TPG.

US

  • Design, deploy, and operate secure cloud infrastructure across AWS and AWS GovCloud to support regulated deployments.
  • Drive platform reliability, release operations, and incident response for production and customer-facing systems.
  • Translate compliance obligations into practical engineering work, including access controls, monitoring, and documentation.

Arch Systems empowers discrete manufacturing facilities with deep data insights for optimal efficiency and proactive decision-making. As a remote-first company with a passionate, multidisciplinary team, they foster innovation and collaboration among employees.

Germany Unlimited PTO

  • Design and maintain scalable infrastructure-as-code solutions using Terraform and Kubernetes.
  • Build and operate observability systems while leading incident response and reliability improvements.
  • Embed security and compliance practices into infrastructure and optimize system performance and cloud costs.

This partner company builds a next-generation platform enabling AI-driven services across global employment infrastructure. It is a highly distributed, async-first organization where engineers thrive in ownership and autonomy.

Canada

  • Build and maintain infrastructure platforms for over 200 backend services running on Kubernetes clusters with 40,000+ cores.
  • Lead and mentor other engineers, own complex infrastructure failures, and participate in a shared on-call rotation.
  • Drive cloud cost efficiency, estimate schedules, and use AI tools as a first-class collaborator in daily workflows.

Life360's mission is to keep people close to the ones they love through location sharing, safe driver reports, and crash detection. The company serves approximately 97.8 million monthly active users across more than 180 countries and has more than 500 remote-first employees.

Europe

  • Design, build, and operate scalable cloud infrastructure using Kubernetes, Terraform, and modern infrastructure-as-code practices.
  • Improve and evolve cloud networking architecture, including VPC/VNet design, peering, routing, DNS, TLS, ingress/egress, and load balancing.
  • Contribute to system reliability through on-call support, incident response, root cause analysis, and performance optimization.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They use automated review and matching to ensure fair candidate evaluation.

US Canada

  • Build and maintain cloud infrastructure across GCP, Kubernetes, and Terraform.
  • Own CI/CD pipelines and deploy fully automated, locked-down systems.
  • Strengthen security, access control, and observability for a growing platform.

Gauntlet builds the financial systems of the future, operating across the entire stack to offer best-in-class vault products. The team serves over $1.5B in client TVL and brings together traditional finance and crypto-native expertise.

Global

  • Lead a technical pod of full-stack DevOps/DevSecOps engineers in a player-coach role, setting technical direction and managing projects.
  • Own end-to-end architecture across Azure and GCP, drive systems-level design, and champion AI-first development practices.
  • Manage sprint planning, release management, and embed DevSecOps governance with security and compliance standards.

BETSOL accelerates cloud transformation for enterprises across 17+ countries with AI and cloud-native solutions. They hold several engineering patents, have industry awards, and a net promoter score 2x the industry average, while being employee-centric with comprehensive benefits.

  • Lead cloud delivery and implementation of scalable, secure, and reliable cloud environments across Google Cloud Platform.
  • Design and manage cloud infrastructure including networking, compute, storage, identity, and platform services.
  • Implement cloud security controls, hybrid connectivity, observability, and automation using Terraform and Kubernetes.

Egen is a fast-growing and entrepreneurial company with a data-first mindset, helping clients drive action and impact through data and insights using advanced technology platforms including Google Cloud and Salesforce. They are committed to being a place where the best people choose to work, dedicated to learning, solving tough problems, and continual innovation.

Unlimited PTO

  • Define and own Hone's multi-year cloud infrastructure strategy on Microsoft Azure, balancing reliability, security, cost, and velocity.
  • Lead architecture and delivery of complex infrastructure initiatives including multi-region resilience and zero-trust networking.
  • Mentor senior and mid-level engineers, conduct architecture reviews, and raise the infrastructure engineering bar organization-wide.

Hone is an online medical clinic transforming healthcare and enhancing longevity through cutting-edge science. They are a remote-first employer with a culture focused on brand values like customer focus, execution, candid communication, collaboration, calculated risk-taking, and joy.

  • Maintain and develop secure, reliable, and scalable AWS cloud infrastructure to meet business and development needs.
  • Deploy and operate microservices running on EC2 (Docker Compose + Caddy) and Kubernetes (EKS + Karpenter).
  • Write and maintain Terraform modules and stacks for EC2, RDS, EKS, ECR, S3, IAM, VPC, and Secrets Manager.

INFUSE is a digital marketing company headquartered in the US and operating worldwide, providing services in demand generation. Our team is dispersed across 20 countries, and we are committed to giving each candidate a fair and detailed assessment.

Canada Unlimited PTO

  • Partner with engineering teams to design, build, and operate secure-by-default cloud infrastructure across AWS and Google Cloud.
  • Build reusable Terraform modules and policy-as-code guardrails to make secure implementation easier for engineering teams.
  • Operate CSPM/CNAPP tooling and drive remediation of cloud vulnerabilities and misconfigurations.

Fullscript is a health technology company that provides a platform for practitioners to access clinical insights, lab interpretations, and high-quality supplements, serving over 125,000 practitioners and 10 million patients. The company has a remote-first culture, emphasizes work-life balance, and values inclusivity and continuous learning.

US 5w PTO

  • Design and develop CI/CD systems for websites, services, and release workflows, and operate an EKS-based Kubernetes platform.
  • Diagnose debug production incidents, drive root-cause analysis, and implement improvements to enhance system reliability.
  • Write and maintain infrastructure as code using Pulumi or Terraform/OpenTofu across multiple AWS accounts with security-conscious practices.

Thunderbird is one of the world’s most trusted open-source email applications, empowering more than 20 million people globally. Our small but growing distributed team includes 65+ people across seven countries, and we build privacy-respecting communication tools with a collaborative, inclusive, and user-first spirit.

United States

  • Serve as the primary technical point of contact for external clients, translating business needs into secure cloud solutions.
  • Lead the design, deployment, and maintenance of scalable Landing Zone configurations using expert-level Terraform.
  • Administer complex Microsoft 365 services and Azure subscriptions, ensuring optimal uptime and issue resolution.

Alpha FMC is a consulting firm specializing in insurance technology solutions, providing custom software consulting services. They operate as a professional services organization with a focus on client advisory and hands-on engineering.

US

  • Owning cloud infrastructure on Azure, data pipeline orchestration, CI/CD, and observability to ensure production-grade reliability.
  • Building and maintaining foundational infrastructure that enables fast engineering velocity without breaking things.
  • Applying SRE principles such as SLOs, capacity planning, incident response, and eliminating toil through automation.

Terzo's platform processes enterprise-scale document corpora, powers real-time AI agents, and serves the Financial Intelligence Graph to Fortune 500 customers. As a small, senior team with strong ownership and minimal bureaucracy, we foster a culture of collaboration, mentorship, and continuous improvement.

Argentina 18w maternity 12w paternity

  • Own and evolve the cloud platform including compute layer, EKS fleet, serverless infrastructure, networking, and cloud operations across AWS and GCP.
  • Design and maintain infrastructure-as-code foundation and networking layer for reliability, security, and scalability.
  • Build AI-powered automation for cloud infrastructure management, including policy-as-code, drift detection, and LLM-assisted runbook generation.

Webflow builds the world's leading AI-native Digital Experience Platform, empowering teams to design, launch, and optimize for the web without barriers. As a remote-first company with over 2 million users across 190 countries, it fosters a culture of trust, transparency, and creativity.