Source Job

$205,000–$235,000/yr
US

  • Provide technical leadership for infrastructure, reliability, and observability.
  • Own the observability stack using Datadog and CloudWatch.
  • Design and evolve AWS infrastructure for reliability, security, scalability, and cost efficiency.

AWS Terraform Kubernetes Datadog CI/CD

20 jobs similar to Staff Platform Engineer

Jobs ranked by similarity.

$4,313–$5,391/mo
Europe

  • Own 5 AWS accounts across the organisation.
  • Architect and maintain infrastructure as code with Terraform.
  • Set up monitoring, alerting, and incident response.

We're a UK fintech building high-throughput digital infrastructure for the mortgage and property space. Recently acquired Trussle and we are taking our platform to the next level. The company values innovation and building high-quality products.

$188,550–$212,150/yr
Global Unlimited PTO

  • Own the technical direction of Remote's SRE/Platform domain.
  • Define and drive the reliability strategy across the platform.
  • Identify and lead AI enablement initiatives across the engineering organisation.

Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. With our core values at heart and a future-focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world.

$113,850–$126,500/yr
Europe 5w PTO

  • Design, build, and maintain infrastructure using Infrastructure as Code tools such as Terraform.
  • Improve system reliability, scalability, resilience, and performance across the Mast platform.
  • Build systems and tooling that automate infrastructure management and operational workflows wherever possible.

Mast is on a mission to make complex lending simple by building modern, cloud-native lending technology purpose-built for specialist lenders. It is a high-performance team of engineers and lending experts that values radical honesty, transparency, and speed.

$120,000–$170,000/yr
Global Unlimited PTO

  • Own and evolve Quansight's cloud infrastructure across AWS, Azure, and GCP.
  • Build, deploy, and maintain internal dashboards and reporting for operations and project management.
  • Lead infrastructure engagements for clients from scoping and architecture through delivery, upskilling client teams.

Quansight is rooted in the Python and PyData ecosystems. They provide services ranging from open-source software development to training and consulting, believing in a culture of do-ers, learners, and collaborators.

$138,700–$173,350/yr
US

  • Lead the architecture of a high-scale AWS environment optimized for AI workloads.
  • Manage and mentor a high-performing team of 8 engineers, providing technical leadership and career coaching.
  • Conduct user research with internal Natera developers to identify friction points.

Natera is a global leader in cell-free DNA (cfDNA) testing, dedicated to oncology, women’s health, and organ health. The Natera team consists of statisticians, geneticists, doctors, laboratory scientists, business professionals, software engineers, and many other professionals from world-class institutions.

US

  • Designing and managing cloud-based infrastructure on AWS.
  • Creating and maintaining deployment architectures and continuous delivery pipelines.
  • Automating infrastructure provisioning and management using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.

Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. Our team of 500 experts in 20+ countries is trusted by leading enterprises.

Global

  • Deploy and maintain infrastructure using Terraform on AWS.
  • Operate and govern production-grade platforms running on Kubernetes / EKS.
  • Build and maintain CI/CD pipelines using GitHub Actions.

Muttdata is a dynamic startup committed to crafting innovative systems using cutting-edge Big Data and Machine Learning technologies. They are looking for a hands-on DevOps to join a strategic initiative focused on deploying and operating Data & AI platforms.

US Unlimited PTO

  • Lead software engineering teams providing infrastructure-as-code to manage cloud infrastructure.
  • Hire experienced site reliability staff, and a line manager to grow and oversee the SRE team.
  • Establish design-before-build discipline; facilitate lightweight design documents, architectural decision records, and working group reviews.

Horizon3.ai is a cybersecurity company dedicated to enabling organizations to proactively find, fix, and verify exploitable attack vectors. They are a fast-growing company with a culture of respect, collaboration, ownership, and results.

Argentina 18w maternity 12w paternity

  • Own and evolve the cloud substrate including compute, EKS fleet, networking, and cloud operations across AWS and GCP.
  • Design and maintain the networking fabric connecting Webflow's services, ensuring reliability, security, and scalability.
  • Build and enforce guardrails around IAM and permissions to keep infrastructure secure and auditable while driving FinOps and cost optimization.

Webflow is building the world's leading AI-native Digital Experience Platform. As a remote-first company built on trust and creativity, it empowers over 2 million users globally to design, launch, and optimize for the web without barriers.

Global Unlimited PTO

  • Own and evolve CI/CD pipelines using GitHub Actions and OIDC-based authentication for microservices and agentic workloads.
  • Automate infrastructure provisioning using Infrastructure as Code tools such as Terraform and CloudFormation.
  • Operate and scale our Kubernetes platform, including autoscaling, ingress, and multi-tenant isolation for enterprise customers.

Zingtree is a next-generation intelligent process automation platform reimagining customer experience operations for enterprise support leaders. It is a small team with high ownership, emphasizing automation, collaboration, and transparency.

$160,000–$190,000/yr
US

  • Own and evolve Launch Potato's cloud infrastructure, CI/CD platform, and compliance posture.
  • Build the SRE function from the ground up so product teams can ship faster without compromising reliability, security, or cost control.
  • Stand up the SRE practice from scratch: on-call rotation, PagerDuty configuration, SLA/SLO definitions for core infrastructure services, runbook library, and observability dashboards that tie site performance to business metrics.

Launch Potato is a digital media company that connects consumers with leading brands through data-driven content and technology. They are headquartered in South Florida with a remote-first team spanning over 15 countries, with a high-growth, high-performance culture.

$145,000–$250,000/yr
US Unlimited PTO

  • Construct infrastructure as code, developing and enforcing best practice across configurations while preventing drift between Terraform configurations and infrastructure deployments.

SentiLink provides innovative identity and risk solutions, empowering institutions and individuals to transaction with confidence. They are building the future of identity verification in the United States replacing a clunky, ineffective, and expensive status quo with solutions that are 10x faster, smarter, and more accurate.

US Unlimited PTO

  • Support the Platform Infrastructure by managing container environments on EKS, implementing GitOps workflows, and maintaining CI/CD pipelines.
  • Build for Reliability by defining SLIs/SLOs, leading incident response, and contributing to disaster recovery planning.
  • Drive Observability by designing and maintaining monitoring and logging stacks with Datadog, Sentry, and CloudWatch.

Turquoise Health is a Series C price transparency platform for finance leaders across healthcare, building the infrastructure for a more open, efficient healthcare marketplace. The company is a remote-first, US-based team of over 300 enterprise organizations that values transparency, empathy, inclusivity, creativity, and ownership.

$200,000–$260,000/yr
US Unlimited PTO

  • Lead the design, implementation, and continuous improvement of our cloud infrastructure and DevOps practices.
  • Ensure that our systems are scalable, reliable, and secure, enabling seamless software delivery across environments.
  • Improve development velocity while increasing system reliability

Cadence is building a remote care delivery system that keeps older people healthy, out of the hospital, and at home. They support tens of thousands of active patients nationwide with their AI‑powered system and scalable clinical model enabling proactive, population‑level care.

$160,000–$200,000/yr
US

  • Drive the stability and reliability of Epic's GCP infrastructure.
  • Manage and harden our Docker and GKE container platform.
  • Maintain and improve CI/CD pipelines.

Epic is the leading digital reading platform for kids ages 12 and under, used by millions of children, families, and educators around the world. As Epic continues to grow, we are reimagining what reading can be through thoughtful technology, data, and global collaboration to make learning more engaging, accessible, and impactful.

Global

  • Deploy, manage, and maintain AWS infrastructure across development, staging, and production environments.
  • Build and maintain scalable, reusable and secure Infrastructure as Code (IaC) using Terraform Enterprise.
  • Develop, implement and manage CI/CD pipelines for automated application and infrastructure deployments.

Miratech helps visionaries change the world. We are a global IT services and consulting company that brings together enterprise and start-up innovation. They retain nearly 1000 full-time professionals, and their annual growth rate exceeds 25%.

Canada 4w PTO

  • Design and build scalable infrastructure to support rapid growth in data volume, service usage, and engineering velocity.
  • Implement and maintain core security infrastructure and controls including, service-to-service authentication, secrets management, application security primitives.
  • Partner closely with Security Engineering to implement infrastructure that supports best-in-class security and compliance practices.

Vanta helps businesses earn and prove trust by providing a platform that continuously monitors and verifies security. They empower companies to practice better security and prove it with ease. Vanta has a kind and talented team with offices in SF, NYC, London, Dublin, Tel Aviv, and Sydney.

Unlimited PTO

  • Assess and improve visibility by identifying gaps in dashboards, metrics, and logs.
  • Refine alerts and dashboards for critical services to catch issues earlier.
  • Automate routine checks and monitoring tasks to free up engineers.

PlayOn is where high school sports come to life through platforms like GoFan, NFHS Network, and MaxPreps. As a growth-stage company backed by KKR, we build the technology that powers high school athletics from ticketing and streaming to fundraising and merchandise.

US Unlimited PTO

  • Lead the design, implementation, manage, support and operation of cloud-native infrastructure and container orchestration platforms.
  • Drive platform reliability, scalability, automation, and operational excellence across critical SaaS and cloud-based workloads.
  • Contribute to architectural decisions, mentoring engineers, and ensuring alignment with security, compliance, and operational standards.

Availity delivers revenue cycle and related business solutions for health care professionals who want to build healthy, thriving organizations. They are a global team with headquarters in Jacksonville, FL, and an office in Bangalore, India, united by a mission to bring the focus back to patient care.

$100,000–$160,000/yr
Global

  • Design and implement secure AWS infrastructure using Infrastructure as Code across multiple client engagements.
  • Build CI/CD pipelines, automate deployments, and establish monitoring and observability solutions.
  • Collaborate with architects and client teams to translate requirements into robust cloud solutions.

Innovative Solutions provides cloud solutions and services. They help clients with digital transformation and cloud adoption. They use AI tools to support parts of the hiring process and have a supportive recruitment team.