Source Job

$126,000–$184,000/yr
US

  • Own the operational stability and performance of Juul’s hybrid cloud infrastructure.
  • Lead automation efforts and architect for reliability.
  • Act as the final escalation point for critical incidents.

Python PowerShell Bash Terraform Kubernetes

20 jobs similar to Senior Site Reliability Engineer

Jobs ranked by similarity.

Canada 5w PTO

  • Design and evolve infrastructure systems to ensure scalability, reliability, and cost efficiency.
  • Lead and mentor a distributed infrastructure team, fostering a collaborative and inclusive culture.
  • Oversee all cloud environments supporting MZLA’s products and business systems.

MZLA Technologies Corporation (MZLA) is a wholly owned, for-profit subsidiary of the Mozilla Foundation and home to Thunderbird. They are a small but growing team of 50+ people distributed across seven countries building an open-source email and productivity platform.

US

  • Ensure the smooth operation and high availability of Clarifai's core services
  • Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
  • Design and implement scalable, secure, and cost-effective infrastructure solutions

Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.

Global

  • Automate infrastructure provisioning, configuration management, monitoring, and operational workflows using IaC and scripting languages.
  • Own the deployment, maintenance, and lifecycle management of systems supporting engineering, leveraging deep expertise in Kubernetes.
  • Troubleshoot complex infrastructure and application issues, driving root-cause analysis and developing long-term remediation solutions

SingleStore delivers the cloud-native database with the speed and scale to power the world’s data-intensive applications. They are venture-backed and headquartered in San Francisco with offices in Sunnyvale, Raleigh, Seattle, Boston, London, Lisbon, Bangalore, Dublin and Kyiv.

$155,000–$165,000/yr
US Unlimited PTO

  • Lead maintenance and operations for production and development environments.
  • Architect and implement complex solutions spanning OS, virtualization, network, and cloud layers.
  • Lead automation initiatives for infrastructure provisioning and operational tasks.

NMI enables partners with choice in payments, challenging the one-size-fits-all approach. They power innovative tech for SMBs, entrepreneurs, and fintech startups, fostering a diverse and welcoming workplace with a dedicated Diversity, Equity & Inclusion action group.

Nigeria

  • Design, implement, and manage multi-cloud infrastructure.
  • Implement container orchestration strategies for microservices architectures.
  • Develop and maintain infrastructure as code using Terraform.

Moniepoint is an all-in-one financial services platform for emerging markets, offering personal and business banking, payment, credit, and business management tools. It is also the second-fastest-growing company in Africa with over 3 million users.

US

  • Architect and deploy secure, scalable infrastructure using Terraform, CloudFormation, or similar tools.
  • Ensure the platform meets strict SLA requirements for enterprise clients, minimizing downtime.
  • Implement comprehensive monitoring, logging, and alerting to provide deep visibility into system health.

Filevine provides cloud-based workflow tools for legal professionals, helping them manage organizations and serve clients. They are recognized as a fast-growing and innovative technology company with a team of passionate professionals.

Americas EMEA Unlimited PTO

  • Design and implement highly scalable infrastructure for GitLab.com to support current and future growth.
  • Collaborate with cross-functional teams across the Infrastructure organization to plan and deliver projects that shape GitLab’s platform direction.
  • Operate and improve edge services and Kubernetes workloads, acting as a subject matter expert within the infrastructure department.

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. They aim to enable everyone to contribute to and co-create the software that powers our world.

India

  • Oversee the reliability, scalability, performance, and security of key production services.
  • Collaborate with cross-functional teams to develop and maintain resilient infrastructure.
  • Provide expert mentorship and guidance on best practices to engineers throughout the organization.

Cision is a global leader in PR, marketing and social media management technology and intelligence, helping brands and organizations connect with customers and stakeholders to drive business results. The company has offices in 24 countries throughout the Americas, EMEA and APAC.

US

  • Design, build, and maintain secure, scalable cloud infrastructure.
  • Own CI/CD pipelines and deployment workflows across services and environments.
  • Improve reliability, availability, and performance through monitoring, alerting, and incident response practices.

Jobgether is a company that uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates and share this short list directly with the hiring company.

US Unlimited PTO

  • Contribute to high impact AWS cloud infrastructure initiatives.
  • Participate in operability and production readiness reviews.
  • Advocate and implement Site Reliability Engineering practices.

Patreon is a media and community platform where creators give fans access to exclusive work. They have generated over $10 billion for creators and have 25 million+ paid memberships, with a hybrid work model and offices in New York and San Francisco.

US

  • Ensure near-zero downtime with monitoring and alerting, self-healing automation, and continuous improvement
  • Create highly automated, available and scalable systems by applying software and infrastructure principles
  • Employ and advise clients on DevOps and SRE principles and practices, covering deployment pipelines, HA, service reliability, technical debt, and operational toil for live services running at scale

66degrees is an AI transformation partner. They guide enterprises from business challenges to quantifiable outcomes, helping businesses reach their inflection point where chaotic data becomes a strategic asset, complexity becomes clarity, and AI becomes an engine for growth. They believe in thriving through challenges and winning together.

Latin America

  • Design, build, and maintain cloud infrastructure primarily on AWS, with exposure to GCP and Azure.
  • Support developers and product teams by troubleshooting infrastructure and deployment issues.
  • Enforce and promote security best practices, including least-privilege access and monitoring.

EX Squared LATAM works with international clients to build scalable, data-driven platforms that support complex digital ecosystems. They have a multicultural, LATAM-based engineering team with a culture focused on collaboration, ownership, and continuous improvement.

$125,000–$169,000/yr
Unlimited PTO

  • Design, scale, and operate resilient, cloud-native infrastructure in AWS with an emphasis on EKS, IAM, RBAC, and modern security-first practices.
  • Build and optimize CI/CD pipelines with GitHub Actions and GitHub Advanced Security enabling velocity without compromising safety.
  • Own observability across the stack using Datadog (metrics, logging, alerting, and tracing).

DexCare optimizes time in healthcare, streamlining patient access, reducing waits, and enhancing overall experiences. They are committed to creating an inclusive workplace where diversity drives innovation and belonging strengthens collaboration, enabling everyone to thrive.

$219,000–$245,000/yr
US Unlimited PTO

  • Architect, operate, improve and secure the platform the Garner Health app runs on
  • Boost development velocity and productivity
  • Build systems to a high engineering standard and hold others to the same high standard

Garner has developed a revolutionary approach to evaluating doctor performance and a unique incentive model that's reshaping the healthcare economy to ensure everyone can afford high quality care. They have more than doubled their revenue annually over the last 5 years. Garner's award winning culture is designed to cultivate teamwork, trust, autonomy, exceptional results, and individual growth.

US Canada Europe

  • Lead a global team of Site Reliability Engineers.
  • Recruit, hire, onboard and develop engineers.
  • Guide project planning by defining milestones and identifying dependencies.

AuthZed creates and maintains SpiceDB and the authorization infrastructure. They are a Series A company with a fully remote team across the US, Canada, and Europe and a hardworking, close-knit group with a software-driven culture that values integrity, collaboration, and open-mindedness.

$140,700–$239,200/yr
North America

  • Write and maintain software-defined declarative infrastructure, at scale.
  • Deploy, scale, and manage containerized applications using Kubernetes, docker, and other related tools.
  • Create build systems, web services, and automation tools that are thoroughly specified, documented, and testable.

ServiceNow began in San Diego, California in 2004. Today, they stand as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. They connect people, systems, and processes to empower organizations.

Latin America Unlimited PTO

  • Audit and optimize cloud usage, capacity, and spend.
  • Improve reliability through better automation, monitoring, and alerting.
  • Partner with engineers to upgrade infrastructure components and roll out changes safely.

Our client builds a high-scale data and analytics platform used by sophisticated teams to make critical business decisions. They are trusted by 800+ companies and value collaboration, high ownership, and long-term system reliability.

Global

  • Own and operate core platform systems across AWS, GCP, Vercel, Github, and Cloudflare.
  • Improve reliability, scalability, and security of production and non-production environments.
  • Improve local development environments and onboarding experience for engineers.

Moxie empowers ambitious aesthetic entrepreneurs to build profitable, independent practices. A global, remote-first team of more than 140 people supports hundreds of practices nationwide as they unlock sustainable success for aesthetic entrepreneurs.

$167,249–$216,090/yr
US

  • Contribute to the design of a scalable cloud infrastructure platform on Google Cloud.
  • Develop and maintain infrastructure automation using Terraform and Kubernetes controllers.
  • Ensure cloud infrastructure adheres to best practices for security and compliance.

Virta Health is dedicated to reversing metabolic disease in one billion people. They innovate through technology, personalized nutrition, and virtual care, partnering with health plans, employers, and government organizations, with over $350 million raised from investors.

Hungary

  • Design, implement, and maintain cloud infrastructure on GCP and/or AWS.
  • Manage and optimize Kubernetes clusters (GKE/EKS) including node pools and security.
  • Build and optimize CI/CD pipelines using GitLab CI, GitHub Actions, or similar tools.

Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group and was Hungary’s most attractive employer in 2025 according to Randstad’s representative survey. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees.