Source Job

$130,000–$140,000/yr
Global 7w PTO

  • Act as a primary responder for system incidents and outages, ensuring high availability and fast recovery.
  • Own and continuously improve monitoring, alerting, and log management systems.
  • Manage, optimize, and scale database infrastructure including MySQL, PostgreSQL, ClickHouse, and Redis.

AWS Kubernetes MySQL PostgreSQL ClickHouse

20 jobs similar to Senior Site Reliability Engineer

Jobs ranked by similarity.

US Unlimited PTO 11w maternity

  • Own and maintain the incident response process, including defining procedures, tools, and best practices
  • Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
  • Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs

Underdog makes sports more fun by building the best products for sports fans. They are a fast-growing sports company valued at $1.3B with a focus on a seamless, simple, easy to use, intuitive and fun app.

$150,000–$200,000/yr
US Unlimited PTO

  • Architect, maintain, and scale critical infrastructure.
  • Ensure system reliability and optimize performance.
  • Implement modern deployment strategies.

Scribe's Workflow AI platform automatically captures and optimizes workflows so teams work smarter, faster, and more consistently. They are a fast-growing company founded in 2019 with over 5 million users across 600,000 businesses, and they are backed by leading investors.

Global

  • Design and implement reliable and scalable AWS architecture.
  • Support the CICD process with ArgoCD and GitOps, automating deployments with Terraform.
  • Optimize system performance and troubleshoot issues, collaborating with development teams.

Cloudbeds is transforming hospitality with its intelligently designed platform that powers properties across 150 countries. They are a completely remote team of 650+ employees across 40+ countries, focused on building AI-powered solutions for hotels.

$112,000–$120,000/yr
US

  • Lead platform engineering initiatives using Kubernetes (EKS), Helm, and Infrastructure as Code.
  • Design and operate CI/CD platforms and deployment strategies to enable safe, low-risk releases.
  • Build and maintain strong observability foundations, including metrics, logging, alerting, and dashboards tied to service health.

Patriot Software is a remote-first, product-led tech company with a mission to make accounting and payroll fast, simple, and affordable for millions of American businesses. With 175+ team members across the U.S. and a collaborative office hub in Canton, Ohio, we’re building software that empowers the backbone of the American economy.

India

  • Design and support critical database infrastructure and automation tools.
  • Implement monitoring frameworks to ensure system reliability and efficiency.
  • Manage database replication, backups, and observability platforms.

Jobgether is a platform that connects job seekers with companies. They use an AI-powered matching process to ensure applications are reviewed quickly and fairly.

US

  • Ensure the smooth operation and high availability of Clarifai's core services
  • Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
  • Design and implement scalable, secure, and cost-effective infrastructure solutions

Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.

Europe

  • Maintaining and updating Glia’s core infrastructure.
  • Troubleshooting and resolving infrastructure-related issues.
  • Improving our security posture.

Glia provides an AI customer service solution for banks and credit unions, unifying AI and human agents across every voice and digital conversation through its ChannelLess® Architecture. Valued at over $1 billion, Glia powers over 700 financial institutions and is certified as a Great Place to Work, with 98% employee satisfaction.

Europe

  • Drive the design, execution, and maintenance of cloud infrastructure, focusing on AWS technologies and automation.
  • Ensure platforms remain scalable, stable, and secure, supporting the demands of digital marketing and iGaming offerings.
  • Collaborate with developers and QA teams to optimize deployment workflows, troubleshoot incidents, and guarantee high availability.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify top-fitting candidates for partner companies and share the shortlist directly with the hiring company.

$170,000–$200,000/yr
US Unlimited PTO

  • Lead and contribute to projects focused on enhancing system reliability, release processes, developer experiences, cost optimizations, observability, and security.
  • Collaborate with various engineering teams to solve reliability, performance, and security issues.
  • Implement and manage infrastructure-as-code (IaC) strategies.

AllTrails is the world’s most popular and trusted platform for outdoor exploration, connecting people to the outdoors. They have a global community of millions of trailgoers and an inclusive workplace that values diversity.

Europe Unlimited PTO

  • Bring 8+ years of professional experience as a Lead Backend Developer.
  • Deliver new backend features with focus on scalability, performance, and reliability.
  • Maintain, optimize, and improve existing services and infrastructure.

Ruby Labs is a leading tech company that creates and operates innovative consumer products, offering diverse opportunities across health, education, and entertainment. Our innovative teams are driving the future of consumer-led products, and it looks for passionate individuals to join.

US Europe Global Unlimited PTO

  • Designing and implementing database-related Cloud features.
  • Ensuring operational excellence, performance, and observability.
  • Diving deep into complex performance issues faced by Cloud customers.

Tiger Data empowers developers and businesses with the fastest PostgreSQL platform designed for transactional, analytical, and agentic workloads. As a globally distributed, remote-first team, they are committed to direct communication, accountability, and collaborative excellence.

India

  • Oversee the reliability, scalability, performance, and security of key production services.
  • Collaborate with cross-functional teams to develop and maintain resilient infrastructure.
  • Provide expert mentorship and guidance on best practices to engineers throughout the organization.

Cision is a global leader in PR, marketing and social media management technology and intelligence, helping brands and organizations connect with customers and stakeholders to drive business results. The company has offices in 24 countries throughout the Americas, EMEA and APAC.

US 12w maternity

  • Continuously review and fine-tune critical PostgreSQL configuration parameters to maximize resource utilization.
  • Forecast future resource needs based on application growth and agent usage trends, and propose scaling solutions.
  • Design, implement, and manage comprehensive monitoring dashboards and alerting systems to track key performance indicators.

Huntress is a fully remote, global team of passionate experts and ethical badasses on a mission to break down the barriers to cybersecurity. Founded in 2015 by former NSA cyber operators, Huntress protects all businesses with enterprise-grade cybersecurity products. They protect 4M+ endpoints and 7M+ identities worldwide, elevating underresourced IT teams with protection that works as hard as they do.

US

  • Ensure near-zero downtime with monitoring and alerting, self-healing automation, and continuous improvement
  • Create highly automated, available and scalable systems by applying software and infrastructure principles
  • Employ and advise clients on DevOps and SRE principles and practices, covering deployment pipelines, HA, service reliability, technical debt, and operational toil for live services running at scale

66degrees is an AI transformation partner. They guide enterprises from business challenges to quantifiable outcomes, helping businesses reach their inflection point where chaotic data becomes a strategic asset, complexity becomes clarity, and AI becomes an engine for growth. They believe in thriving through challenges and winning together.

Americas EMEA Unlimited PTO

  • Design and implement highly scalable infrastructure for GitLab.com to support current and future growth.
  • Collaborate with cross-functional teams across the Infrastructure organization to plan and deliver projects that shape GitLab’s platform direction.
  • Operate and improve edge services and Kubernetes workloads, acting as a subject matter expert within the infrastructure department.

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. They aim to enable everyone to contribute to and co-create the software that powers our world.

Global

  • Automate infrastructure provisioning, configuration management, monitoring, and operational workflows using IaC and scripting languages.
  • Own the deployment, maintenance, and lifecycle management of systems supporting engineering, leveraging deep expertise in Kubernetes.
  • Troubleshoot complex infrastructure and application issues, driving root-cause analysis and developing long-term remediation solutions

SingleStore delivers the cloud-native database with the speed and scale to power the world’s data-intensive applications. They are venture-backed and headquartered in San Francisco with offices in Sunnyvale, Raleigh, Seattle, Boston, London, Lisbon, Bangalore, Dublin and Kyiv.

$120,000–$290,000/yr
US

  • Design and build critical systems that power PlanetScale's database platform.
  • Collaborate with a team of expert engineers to solve complex distributed systems challenges.
  • Work directly with customers to understand their needs and translate them into robust technical solutions.

PlanetScale is rapidly growing and reinventing the database space. The PlanetScale platform offers both Postgres and Vitess clusters with a company philosophy centered around building small teams. They are recognized as one of the fastest growing companies in America and strive to build an inclusive environment where all people feel that they are equally respected and valued.

US

  • Improve deployment reliability and reduce operational risk.
  • Modernize AWS infrastructure toward Kubernetes.
  • Support the whole Engineering organization on top of EC2 and AWS.

Peek.com's platform offers business software and a marketplace for booking experiences. They have over 250 employees distributed across locations like San Francisco and New York and have secured over $100 million in funding.

$149,450–$274,430/yr
US

  • Design, develop, and evolve cloud infrastructure, driving improvements.
  • Contribute to infrastructure initiatives such as containerization and data services.
  • Help modernize and standardize platform patterns to simplify development.

Included Health is a healthcare company delivering integrated virtual care and navigation. They aim to raise the standard of healthcare for everyone by breaking down barriers to provide high-quality care for every person, offering personalized virtual and in-person care.

$167,249–$216,090/yr
US

  • Contribute to the design of a scalable cloud infrastructure platform on Google Cloud.
  • Develop and maintain infrastructure automation using Terraform and Kubernetes controllers.
  • Ensure cloud infrastructure adheres to best practices for security and compliance.

Virta Health is dedicated to reversing metabolic disease in one billion people. They innovate through technology, personalized nutrition, and virtual care, partnering with health plans, employers, and government organizations, with over $350 million raised from investors.