Jobs Similar to Site Reliability Engineer | TangerineFeed

Site Reliability Engineer

Newton 3 days ago

Canada

Implementing the improvements to the reliability, fault tolerance, scalability, and performance of our infrastructure
Managing incidents using your technical know-how to involve the appropriate teams and automate away manual practices
Improving observability across our systems (metrics, logs, tracing) to reduce time to detection and resolution

Python Javascript Java AWS

20 jobs similar to Site Reliability Engineer

Jobs ranked by similarity.

Senior Site Reliability Engineer

Akuity 11 days ago

US Canada

Own SLI/SLO/SLA definitions for the Akuity SaaS platform and drive continuous improvement.
Participate in an on-call rotation and act as incident commander for high-severity production events.
Partner with engineering teams to build reliability into new features before they ship to production

Akuity helps enterprises ship software faster and more reliably with modern GitOps best practices. The Akuity Platform enables teams to manage the development and deployment across hundreds – if not thousands – of Kubernetes clusters from a single control plane.

View details Similar jobs

Staff Site Reliability Engineer

SmarterDx 17 days ago

$230,000–$250,000/yr

US Unlimited PTO 12w paternity

Define and evolve reliability standards for the SmarterDx platform.
Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR).
Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms.

SmarterDx, a Smarter Technologies company, builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.

View details Similar jobs

Site Reliability Engineer

Ivanti 12 days ago

US

Deploy, manage, and secure Ivanti’s production Software-as-a-Service (SaaS) environments in AWS and Azure
Automate common and repetitive tasks
Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and problem resolution

Ivanti's mission is to elevate human potential within organizations by managing, protecting and automating technology for continuous innovation. They are committed to building a diverse team and fostering an inclusive environment where everyone belongs.

View details Similar jobs

Site Reliability Engineering Manager II

Flywire 23 days ago

$160,000–$200,000/yr

US

Help drive reliability, automation and performance within our cloud-based infrastructure.
Become embedded within an Engineering team helping them navigate production excellence and advocate for best practices.
Debug production issues across services and levels of the stack as well as practice incident response and blameless postmortems.

Flywire is a global payments enablement and software company that was founded over a decade ago. They have over 1,200 global FlyMates, representing more than 40 nationalities, in 12 offices worldwide, and are looking for people to join the next stage of their journey as they continue to grow.

View details Similar jobs

Senior Platform Engineer

Lillio 4 days ago

$103,174–$117,720/yr

Canada

Lead efforts to scale and improve our infrastructure.
Develop and support internal team tooling.
Troubleshoot, debug and resolve issues as part of a shared on-call rotation.

Lillio, formerly HiMama, empowers early childhood educators through innovative tools. They are a Series B, private-equity backed company recognized as an industry leader and selected in 2025 by Time Magazine as one of the world's top EdTech companies.

View details Similar jobs

Site Reliability Engineer II

InvestorFlow 28 days ago

Global

Design and implement comprehensive monitoring strategies.
Take ownership of production incident response, lead handling, and drive remediation.
Continuously improve operational processes, reliability practices, and team readiness.

InvestorFlow delivers industry specialized CRM and digital portals to help alternative asset firms find opportunities, create and manage relationships, and turn relationship insights into action. They serve over 175 clients, including 25 of the top 50 alternative asset managers, managing more than $6 trillion in assets.

View details Similar jobs

Site Reliability Engineering (SRE) Intern

AWP Safety 23 days ago

$30–$34/hr

US

Help deploy and configure Dynatrace OneAgent and ActiveGates with automated tooling.
Define and instrument user‑centric metrics and objectives in Dynatrace.
Combine Davis® AI with Copilot/Claude to identify root causes and reduce MTTR.

AWP Safety's IT Internship Program is a hands‑on, learning experience for early‑career professionals who want to build a future in IT Site Reliability Engineering. They operate at the intersection of Software Engineering and Systems Operations, using Dynatrace to diagnose performance bottlenecks and automate "toil" out of existence.

View details Similar jobs

Site Reliability Engineer

Upsun 19 days ago

Europe Unlimited PTO

Enhance system monitoring with tools like Prometheus, Grafana, and ELK Stack, ensuring visibility and alignment with business objectives.
Transition manual processes to automated solutions using IaC tools (e.g., Terraform, Ansible) to streamline deployments and improve operational efficiency.
Improve pipeline architecture for fast, reliable releases, ensuring scalability and resilience to handle high volumes of changes.

Upsun (formerly Platform.sh) is a cloud application platform designed for hybrid teams, enabling developers, DevOps engineers, and platform teams to build, ship, and scale confidently without backend infrastructure hassles. Upsunners are a remote, global workforce committed to open source and an open, welcoming environment, valuing curiosity, knowledge, and innovative ideas.

View details Similar jobs

Senior Site Reliability Engineer

Fixify 26 days ago

Europe

Design and maintain scalable, fault-tolerant infrastructure that supports our SaaS platform and keeps pace with business growth.
Define, document, and maintain SLIs, SLOs, and SLAs in partnership with product engineering, translating business commitments into technical guardrails.
Lead incident response with steady judgment, facilitate blameless postmortems, and drive remediation efforts that prevent recurrence.

Fixify is on a mission to reimagine IT teams support companies. They need a Senior Site Reliability Engineer who finds joy in building systems that fade into the background, empowering product engineers to ship with confidence and their customers to work without interruption.

View details Similar jobs

Sr. Site Reliability Engineer, Security

CentralReach 17 days ago

$160,000–$180,000/yr

US

Responsible for availability, latency, performance, efficiency, monitoring/observability, emergency response, capacity planning.
Analyze, troubleshoot and resolve operational challenges contributing to defined SLO's.
Manage site stability, performance, reliability, and maintain uptime for production environments.

CentralReach provides autism and IDD care software for Applied Behavior Analysis (ABA), multidisciplinary therapy, and special education. They are trusted by more than 200,000 users and is backed by Roper Technologies, Inc. (Nasdaq: ROP). Their culture is centered around impact, inclusion, and flexibility.

View details Similar jobs

Site Reliability Engineer

OnePay 27 days ago

US Unlimited PTO

Design, build, and maintain scalable infrastructure and tooling that improves reliability, performance, and availability across OnePay’s platform
Contribute to the evolution of our observability stack, platform libraries, cloud architecture, and CI/CD pipelines
Develop automation and monitoring systems to detect, prevent, and remediate incidents before they impact customers

OnePay is a consumer fintech company trusted by millions of Americans to make money better, providing an all-in-one financial services platform. Backed by Walmart and Ribbit Capital, OnePay provides banking, savings, credit cards, lending, investing, and crypto services and embedded financial services to frontline workers.

View details Similar jobs

Staff Software Engineer

Rula 12 days ago

US

Collaborate with application engineering teams on platform infrastructure.
Enhance observability and spearhead the adoption of SRE best practices.
Build and maintain reliable CI/CD pipelines, tooling, and infrastructure.

Rula strives to provide quality, evidence-based, compassionate mental healthcare and aims to create a world where mental health is no longer stigmatized. They are a remote-first company operating in most U.S. states, and are dedicated to having a culture of inclusion that supports their employees.

View details Similar jobs

Senior Site Reliability Engineer- Remote

ClickHouse 18 days ago

$141,000–$230,000/yr

US

Collaborate with engineering teams to design and implement scalable, secure systems.
Establish and manage service level objectives (SLOs) and service level agreements (SLAs).
Enhance incident response processes and post-mortem analysis for outages.

ClickHouse, recognized on the 2025 Forbes Cloud 100 list, is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.

View details Similar jobs

Senior Technical Product Manager

Filevine 10 days ago

$125,000–$175,000/yr

US

Own the strategy, execution, and continuous improvement of Filevine's site reliability and platform resilience.
Directly manage the prioritization for the teams responsible for keeping Filevine fast, stable, and available.
Drive measurable improvements in uptime, incident prevention, and release confidence across the platform.

Filevine is a Legal AI company delivering Legal Operating Intelligence for the future of legal work. They bring together data, documents, workflows, and teams into one unified platform and are ranked as one of the most innovative and fastest-growing technology companies in the country.

View details Similar jobs

Site Reliability Engineer

Ooma 14 days ago

$110,000–$175,000/yr

US

Become a subject matter expert in applications supporting Ooma customers.
Collaborate with Development, QA and other SREs to evaluate, deploy, and debug applications.
Improve observability by implementing, refining, and adjusting application monitoring and thresholds.

Ooma empowers people to connect in smarter ways by creating powerful communication experiences through their cloud-based platform. They help small business owners stay connected, provide customized unified communications solutions, and offer smart home security solutions.

View details Similar jobs

DevOps Engineer

Newton 3 days ago

Canada

Improve and maintain CI/CD, deployment workflows, and environment management across backend, web, and internal services.
Build, maintain and scale infrastructure across AWS and container based services.
Improve monitoring, alerting, logging, dashboards, tracing, and runbooks.

Newton aims to change how Canadians trade crypto and make financial freedom achievable for everyone by providing tools and knowledge. They foster a dynamic and collaborative remote team across Canada that values continuous improvement and creativity.

View details Similar jobs

Reliability Engineering VP

Jobgether 24 days ago

US

Define and execute the reliability engineering roadmap.
Establish SLO/SLI/error budget frameworks for system stability.
Drive continuous improvement through DORA metrics and analysis.

Jobgether leverages AI for HR solutions. They focus on connecting talent with opportunities, using AI-driven matching to ensure fair and objective application reviews.

View details Similar jobs

Site Reliability Engineer (f/m/n)

InPost Group 11 days ago

Europe

Write code, automate everything, design for reliability, and deeply understand the systems.
Build or extend Terraform modules and contribute to Platform Engineering around Observability.
Collaborate with developers to shape feature design so that reliability is built in, not added later.

InPost Group is an innovative European out of home deliveries company, revolutionizing the way parcels are delivered to customers. With over 10,000 employees worldwide, InPost Group is one of the largest out of home delivery providers in Europe, committed to providing sustainable and efficient delivery solutions.

View details Similar jobs

Sr Site Reliability Engineer

Pismo 25 days ago

South America

Own the end‑to‑end lifecycle of core platform components, including cloud infrastructure primitives and Kubernetes clusters.
Design platform components to be resilient by default, applying SRE principles like fault isolation and capacity planning.
Drive Infrastructure‑as‑Code and GitOps‑first practices to ensure platform components are reproducible and auditable.

Pismo, founded in 2016, provides a comprehensive processing platform for banking, card issuing, and financial market infrastructure, helping customers innovate in banking and payments. With over 500 employees across 10+ countries, Pismo joined Visa in 2024, leveraging Visa’s solutions to advance financial technology.

View details Similar jobs

Senior Site Reliability Engineer

Pismo 24 days ago

Global

Own the end-to-end lifecycle (design, provisioning, upgrades, and decommissioning) of core platform components.
Lead the design and implementation of infrastructure bootstrap orchestration, including: Automated cluster and environment provisioning.
Apply and promote SRE practices across the platform, including: Clear ownership and runbooks for platform components.

Pismo provides a comprehensive processing platform for banking, card issuing and financial market infrastructure and helps customers innovate and build the next generation of banking and payment solutions. Pismo’s 500+ employees are located in more than 10 countries around the world.

View details Similar jobs