Jobs Similar to Site Reliability Engineer | TangerineFeed

Site Reliability Engineer

Ditto 9 hours ago

Unlimited PTO

Develop and maintain observability solutions using platforms like Datadog, Prometheus and Grafana
Take a leading role in incident management, including coordinating response efforts, troubleshooting issues, and identifying follow-up actions
Partner with product engineering teams to architect reliable systems, recover from incidents, and learn from mistakes

Datadog Prometheus Grafana Terraform Helm

20 jobs similar to Site Reliability Engineer

Jobs ranked by similarity.

Senior Site Reliability Engineer

Kraken 4 days ago

Americas

Manage and support infrastructure for Growth teams, including Nomad, Hashistack, databases, and any other underlying systems
Maintain and troubleshoot GitLab CI pipelines, ensuring reliable and fast build, test, and deployment cycles
Provide operational support across Onboarding, Acquire, and Engage teams, helping debug issues in staging and production environments

Kraken is a mission-focused company rooted in crypto values, aiming to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. As a fully remote company, they have Krakenites in 70+ countries who speak over 50 languages.

View details Similar jobs

Staff Site Reliability Engineer

SmarterDx 28 days ago

$230,000–$250,000/yr

US Unlimited PTO 12w paternity

Define and evolve reliability standards for the SmarterDx platform.
Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR).
Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms.

SmarterDx, a Smarter Technologies company, builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.

View details Similar jobs

Sr. Site Reliability Engineer, Security

CentralReach 29 days ago

$160,000–$180,000/yr

US

Responsible for availability, latency, performance, efficiency, monitoring/observability, emergency response, capacity planning.
Analyze, troubleshoot and resolve operational challenges contributing to defined SLO's.
Manage site stability, performance, reliability, and maintain uptime for production environments.

CentralReach provides autism and IDD care software for Applied Behavior Analysis (ABA), multidisciplinary therapy, and special education. They are trusted by more than 200,000 users and is backed by Roper Technologies, Inc. (Nasdaq: ROP). Their culture is centered around impact, inclusion, and flexibility.

View details Similar jobs

Senior Site Reliability Engineer

Akuity 23 days ago

US Canada

Own SLI/SLO/SLA definitions for the Akuity SaaS platform and drive continuous improvement.
Participate in an on-call rotation and act as incident commander for high-severity production events.
Partner with engineering teams to build reliability into new features before they ship to production

Akuity helps enterprises ship software faster and more reliably with modern GitOps best practices. The Akuity Platform enables teams to manage the development and deployment across hundreds – if not thousands – of Kubernetes clusters from a single control plane.

View details Similar jobs

Senior Site Reliability Engineer- Remote

ClickHouse 29 days ago

$141,000–$230,000/yr

US

Collaborate with engineering teams to design and implement scalable, secure systems.
Establish and manage service level objectives (SLOs) and service level agreements (SLAs).
Enhance incident response processes and post-mortem analysis for outages.

ClickHouse, recognized on the 2025 Forbes Cloud 100 list, is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.

View details Similar jobs

Staff Software Engineer

Rula 24 days ago

US

Collaborate with application engineering teams on platform infrastructure.
Enhance observability and spearhead the adoption of SRE best practices.
Build and maintain reliable CI/CD pipelines, tooling, and infrastructure.

Rula strives to provide quality, evidence-based, compassionate mental healthcare and aims to create a world where mental health is no longer stigmatized. They are a remote-first company operating in most U.S. states, and are dedicated to having a culture of inclusion that supports their employees.

View details Similar jobs

Site Reliability Engineer

Newton 15 days ago

Canada

Implementing the improvements to the reliability, fault tolerance, scalability, and performance of our infrastructure
Managing incidents using your technical know-how to involve the appropriate teams and automate away manual practices
Improving observability across our systems (metrics, logs, tracing) to reduce time to detection and resolution

Newton is changing how Canadians trade crypto with the goal to make financial freedom achievable for everyone by giving their customers the tools and knowledge needed to navigate the crypto world. They are a remote team spread across Canada that values pushing boundaries and getting things done.

View details Similar jobs

Platform Engineer - Observability (Mid Level)

Kraken 28 days ago

Australia

Support and implement monitoring and alerting strategy across Kraken’s customer business.
Define and uphold observability best practices across multiple products and platforms.
Partner with product teams to implement observability tooling and improve reliability across the organisation.

Kraken is a technology company focused on creating a smart, sustainable energy system. Their operating system for energy is transforming the industry around the world in a way that benefits everyone. They are a Great Place to Work with genuinely decent, honest, and empathetic people.

View details Similar jobs

Site Reliability Engineer

Planet 22 days ago

US Canada 16w maternity

Build and deploy computing services and infrastructure in customer environments.
Clarify and surface requirements from ambiguous use cases defined by cross-functional stakeholders.
Improve reliability and scalability by resolving edge cases, studying failure modes, and writing tests.

Planet designs, builds, and operates the largest constellation of imaging satellites in history. They deliver an unprecedented dataset of empirical information via a revolutionary cloud-based platform to authoritative figures in commercial, environmental, and humanitarian sectors. Planet has a people-centric approach toward culture and community and it strives to iterate in a way that puts their team members first and prepares their company for growth.

View details Similar jobs

Site Reliability Engineer (f/m/n)

InPost Group 22 days ago

Europe

Write code, automate everything, design for reliability, and deeply understand the systems.
Build or extend Terraform modules and contribute to Platform Engineering around Observability.
Collaborate with developers to shape feature design so that reliability is built in, not added later.

InPost Group is an innovative European out of home deliveries company, revolutionizing the way parcels are delivered to customers. With over 10,000 employees worldwide, InPost Group is one of the largest out of home delivery providers in Europe, committed to providing sustainable and efficient delivery solutions.

View details Similar jobs

Senior Site Reliability Engineer

Loadsmart 30 days ago

$172,614–$172,614/yr

US

Design infrastructure, networking, and software platform architecture.
Build and maintain automation of Continuous Integration and Continuous Deployment pipelines.
Troubleshoot infrastructure, internal applications, networking, and security issues.

Loadsmart is a technology company focused on the logistics and supply chain industry. They leverage data and technology to automate and optimize freight transportation, connecting shippers and carriers to streamline the shipping process. They are a mid-sized company passionate about transforming the future of freight.

View details Similar jobs

Senior Site Reliability Engineer

Calendly 20 hours ago

$198,025–$287,952/yr

Building tools and applications to extends Calendly’s infrastructure platform
Evaluating and deploying cloud native open source tools
Exercising expertise in cloud infrastructure concepts and patterns

Calendly's product powers connections for millions through impactful innovation. They are in the midst of exciting growth and desire people that want to learn, grow, and do their best work.

View details Similar jobs

Site Reliability Engineer

Granicus 4 days ago

Global

Provide production support on a shift according to the team on-call roster.
Work on the customer and internal engineering/implementation team raised tickets while not on-call for production support.
Continuously monitor the health and performance of our services, systems, and infrastructure.

Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and its constituents together. They have served 5,500 federal, state, and local government agencies and more than 300 million citizen subscribers.

View details Similar jobs

Site Reliability Engineer

Arista Networks 16 days ago

Europe

Design, build, and deploy production systems with a focus on scalability and security.
Develop and maintain comprehensive automation solutions to streamline operational efficiency.
Proactively monitor systems, establish alerting strategies, and implement automated incident response.

Arista Networks is a data-driven, client-to-cloud networking company for large data center, campus, and routing environments. They have over $8 billion in revenue and value diversity of thought and perspectives, fostering an inclusive environment for creativity and innovation.

View details Similar jobs

Senior Technical Product Manager

Filevine 22 days ago

$125,000–$175,000/yr

US

Own the strategy, execution, and continuous improvement of Filevine's site reliability and platform resilience.
Directly manage the prioritization for the teams responsible for keeping Filevine fast, stable, and available.
Drive measurable improvements in uptime, incident prevention, and release confidence across the platform.

Filevine is a Legal AI company delivering Legal Operating Intelligence for the future of legal work. They bring together data, documents, workflows, and teams into one unified platform and are ranked as one of the most innovative and fastest-growing technology companies in the country.

View details Similar jobs

Site Reliability Engineer

Weedmaps 8 days ago

$133,110–$148,042/yr

US

Collaborate with stakeholders to drive best practices for monitoring, CI/CD pipelines
Troubleshoot deployment issues in our CI pipeline
Identify areas for automation and embrace the codification of all things

Weedmaps is a global leader in the cannabis industry. They are dedicated to transparency, education, and community, serving cannabis to consumers and businesses in the U.S. and worldwide.

View details Similar jobs

Senior Site Reliability Engineer

SSV Labs 6 hours ago

Global

Design and implement infrastructure and tools that empower our product teams to rapidly and securely iterate, emphasizing reliability and automation.
Influence the strategic direction of our infrastructure and operational practices, ensuring that we are well-positioned to scale and support our growing organization.
Take a proactive role in the resolution of production issues, ensuring that we are well-prepared to handle incidents and that we learn from them in a blameless manner.

SSV Labs is the core team behind the SSV Network - pioneering decentralized infrastructure for Ethereum staking. They are building tools, protocols, and standards to make staking more secure, scalable, and trustless.

View details Similar jobs

Site Reliability Engineer

Ooma 26 days ago

$110,000–$175,000/yr

US

Become a subject matter expert in applications supporting Ooma customers.
Collaborate with Development, QA and other SREs to evaluate, deploy, and debug applications.
Improve observability by implementing, refining, and adjusting application monitoring and thresholds.

Ooma empowers people to connect in smarter ways by creating powerful communication experiences through their cloud-based platform. They help small business owners stay connected, provide customized unified communications solutions, and offer smart home security solutions.

View details Similar jobs

Cloud Operations Engineer

Lumin Digital 29 days ago

$110,000–$125,000/yr

US

Monitor cloud infrastructure and application health using observability tools; respond to alerts.
Perform Tier 1 incident triage, document findings, and escalate appropriately to Development or SRE teams.
Monitor and support CI/CD pipelines to ensure successful builds and deployments.

Lumin Digital empowers credit unions and banks by creating cutting-edge digital experiences. They are a trailblazer in digital banking solutions with a culture that fosters trust, respect, and boldness, encouraging team members to explore and experiment with new ideas.

View details Similar jobs

Senior Platform Engineer

Lillio 16 days ago

$103,174–$117,720/yr

Canada

Lead efforts to scale and improve our infrastructure.
Develop and support internal team tooling.
Troubleshoot, debug and resolve issues as part of a shared on-call rotation.

Lillio, formerly HiMama, empowers early childhood educators through innovative tools. They are a Series B, private-equity backed company recognized as an industry leader and selected in 2025 by Time Magazine as one of the world's top EdTech companies.

View details Similar jobs