Source Job

US Unlimited PTO

  • Build and operate the delivery platform across AWS, EKS, ArgoCD, Helm, and Terraform, fixing production problems and driving root-cause analysis.
  • Standardize CI/CD pipelines using GitHub Actions and Azure DevOps, implement progressive delivery with Argo Rollouts, and build observability with Grafana and Prometheus.
  • Support platform adoption, reduce toil and cost, unblock cross-team delivery, and write documentation to eliminate knowledge silos.

Kubernetes Terraform AWS Python Grafana

20 jobs similar to Senior Site Reliability Engineer

Jobs ranked by similarity.

US

  • Ensure reliability, availability, and observability for a large-scale cloud-based SaaS platform serving millions in education.
  • Design and maintain infrastructure-as-code and CI/CD pipelines while leading incident response and resolution.
  • Mentor peers and integrate AI-driven tools to improve SRE workflows and system performance.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. The company manages the application process and uses AI to shortlist top-fitting candidates based on core requirements.

Global Unlimited PTO 16w maternity 16w paternity

  • Own the operational excellence and infrastructure strategy for Remote Build's platform, ensuring reliability, performance, and security.
  • Lead incident response, build observability systems, and drive continuous improvement in system reliability.
  • Embed security into infrastructure, optimize costs, and automate operational toil to scale efficiently.

Remote solves modern organizations' biggest challenge of navigating global employment compliantly. With a fully distributed team across 6 continents, the company fosters a future-focused culture with core values of innovation and async work.

US

  • Designing and managing cloud-based infrastructure on AWS.
  • Creating and maintaining deployment architectures and continuous delivery pipelines.
  • Automating infrastructure provisioning and management using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.

Nearform is an independent team of data & AI experts, engineers, and designers who build intelligent digital solutions and capability at pace. Our team of 500 experts in 20+ countries is trusted by leading enterprises.

US 5w PTO

  • Design and develop CI/CD systems for websites, services, and release workflows, and operate an EKS-based Kubernetes platform.
  • Diagnose debug production incidents, drive root-cause analysis, and implement improvements to enhance system reliability.
  • Write and maintain infrastructure as code using Pulumi or Terraform/OpenTofu across multiple AWS accounts with security-conscious practices.

Thunderbird is one of the world’s most trusted open-source email applications, empowering more than 20 million people globally. Our small but growing distributed team includes 65+ people across seven countries, and we build privacy-respecting communication tools with a collaborative, inclusive, and user-first spirit.

US

  • Design, provision, and manage AWS infrastructure using Terraform and Kubernetes.
  • Build, operate, and improve observability, monitoring, and incident response processes.
  • Collaborate with engineering teams on capacity planning, performance optimization, and resilient system design.

Vynca provides comprehensive care for individuals with complex needs, focusing on quality days at home. The company is a close-knit community guided by core values of Excellence, Compassion, Curiosity, and Integrity.

Argentina 18w maternity 12w paternity

  • Own and evolve the cloud platform including compute layer, EKS fleet, serverless infrastructure, networking, and cloud operations across AWS and GCP.
  • Design and maintain infrastructure-as-code foundation and networking layer for reliability, security, and scalability.
  • Build AI-powered automation for cloud infrastructure management, including policy-as-code, drift detection, and LLM-assisted runbook generation.

Webflow builds the world's leading AI-native Digital Experience Platform, empowering teams to design, launch, and optimize for the web without barriers. As a remote-first company with over 2 million users across 190 countries, it fosters a culture of trust, transparency, and creativity.

Europe

  • Design and operate our Kubernetes ecosystem with a focus on high availability and zero-downtime operations.
  • Own and evolve our PaaS strategy, using GitOps and CI/CD to empower domain teams to deploy independently.
  • Define and implement our observability strategy across metrics, logs, and tracing.

Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial services for entrepreneurs. They offer an all-in-one financial B2B solution integrating banking, accounting, financial management, and invoicing into a mobile-first platform, with about 346 million in funding.

US

  • Design, deploy, and manage production Kubernetes clusters with workload scheduling, resource quotas, network policies, and RBAC.
  • Build and optimize CI/CD pipelines using Infrastructure as Code and GitOps principles.
  • Implement observability solutions using Prometheus, Grafana, and OpenTelemetry for performance tuning and reliability.

VerTALENTS is a subsidiary of VerSprite Cybersecurity, specializing in technology staffing. The company connects top technical talent with industry clients through various methods, adding value to both clients and candidates for full-time and contracting opportunities.

6w PTO

  • Design, build, and maintain scalable CI/CD pipelines using GitLab CI/CD
  • Develop and maintain Infrastructure as Code solutions using Terraform and Ansible
  • Build and improve internal developer platform tools and deployment services

Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world.

Canada

  • Own and operate production cloud environments, ensuring high availability, reliability, and performance across distributed systems.
  • Design, build, and maintain scalable infrastructure using automation-first principles and Infrastructure as Code practices.
  • Drive automation initiatives and continuous improvement across infrastructure, deployment, and operational workflows.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They have an inclusive, employee-driven culture with a strong focus on collaboration and innovation.

US Unlimited PTO

  • Lead the design, implementation, manage, support and operation of cloud-native infrastructure and container orchestration platforms.
  • Drive platform reliability, scalability, automation, and operational excellence across critical SaaS and cloud-based workloads.
  • Contribute to architectural decisions, mentoring engineers, and ensuring alignment with security, compliance, and operational standards.

Availity delivers revenue cycle and related business solutions for health care professionals who want to build healthy, thriving organizations. They are a global team with headquarters in Jacksonville, FL, and an office in Bangalore, India, united by a mission to bring the focus back to patient care.

Latin America

  • Own and evolve production-grade cloud infrastructure on Azure.
  • Design and maintain robust Infrastructure-as-Code (IaC) architectures utilizing Terraform.
  • Build and optimize end-to-end CI/CD pipelines using GitHub Actions.

CodeRoad provides end-to-end software development services, helping businesses scale with ideal infrastructure solutions. From staff augmentation to dedicated IT teams and general software engineering, their nearshore technology services empower businesses to thrive in an ever-evolving digital landscape.

Latin America

  • Design and maintain CI/CD processes and infrastructure as code using tools like Terraform and Kubernetes.
  • Troubleshoot and resolve issues across dev, testing, and production environments.
  • Work with high-growth technology clients to scale applications and improve operational practices.

Bluelight is a leading software consultancy designing and developing innovative technology to enhance users' lives. With a presence across the United States and Central/South America, it fosters a collaborative and enriching work environment where each team member can grow and thrive.

$115,000–$130,000/yr
US Unlimited PTO

  • Develop and maintain scalable automation and integrations across cloud platforms and services.
  • Design, implement, and operate CI/CD pipelines using Jenkins, Dagger, Terraform, and Docker.
  • Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm.

People Inc. is America’s largest digital and print publisher. Our brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month make decisions.

United States

  • Design, deploy, and operate service mesh platforms (Istio and Linkerd) across multi-cluster Kubernetes environments.
  • Implement mTLS, certificate lifecycle automation, and workload identity propagation for secure communication.
  • Build and enhance observability for service-to-service communication using tracing, metrics, and topology insights.

Jobgether uses AI-powered matching to connect candidates with roles. They focus on efficient hiring processes and data privacy.

United States

  • Design and build core platform infrastructure for large-scale cloud-native data and analytics systems.
  • Own and improve CI/CD pipelines, testing frameworks, and deployment in a high-scale PaaS environment.
  • Contribute to reliability engineering, observability, and operational excellence across distributed systems.

Jobgether uses an AI-powered matching process to connect candidates with roles. The company is a growing platform focused on efficient job matching and data privacy compliance.

Brazil

  • Design and improve cloud architectures, deployment pipelines, and infrastructure systems for large-scale applications across multiple cloud environments.
  • Collaborate with engineering, product, and platform teams to ensure infrastructure reliability, scalability, security, and operational excellence.
  • Drive engineering best practices, contribute to architecture decisions, and participate in on-call rotations for production support.

We are a multinational team that believes technology solves business challenges. Since 2016, we have been helping customers translate technology into success, combining Latin American talent with Swiss organization.

US Unlimited PTO

  • Leads DevOps delivery for cloud-native applications, translating architecture into infrastructure and CI/CD across environments.
  • Designs and maintains AWS infrastructure as code using Terraform across multiple services.
  • Builds and enhances CI/CD pipelines in Azure DevOps and GitHub for high-velocity delivery.

Origami Risk delivers single-platform SaaS solutions that help organizations navigate the complexities of risk, insurance, compliance, and safety management. Founded by industry veterans, the company focuses on client success with award-winning software solutions.

Europe

  • Design, build, and operate scalable cloud infrastructure using Kubernetes, Terraform, and modern infrastructure-as-code practices.
  • Improve and evolve cloud networking architecture, including VPC/VNet design, peering, routing, DNS, TLS, ingress/egress, and load balancing.
  • Contribute to system reliability through on-call support, incident response, root cause analysis, and performance optimization.

Jobgether is an AI-powered job matching platform that connects candidates with hiring companies. They use automated review and matching to ensure fair candidate evaluation.

US Canada

  • Own and evolve AWS infrastructure using Terraform, managing EKS clusters, databases, and core services.
  • Maintain CI/CD reliability and developer tooling across the full engineering org.
  • Lead incident response, drive post-incident reviews, and improve monitoring and alerting standards.

Babylist is the leading platform for expecting and new families, helping parents feel confident, connected, and cared for at every step. As a modern, AI-forward tech company with over 10 million yearly shoppers, Babylist has expanded into a full ecosystem and generated $750M in revenue in 2025, reshaping the $235B kids and baby market.