Jobs Similar to Reliability Engineering VP

Senior Site Reliability Engineer

Transcend 26 days ago

$150,000–$167,000/yr

US

Lead reliability-focused design and readiness reviews.
Build, operate, and continuously improve our observability stack.
Own and evolve incident management practices.

Transcend is building the privacy platform that easily embeds privacy into your entire tech stack. They are growing quickly, backed by top-tier investors and are proud to serve some of the world's most iconic brands.

View details Similar jobs

Senior DevOps & Platform Engineer

About Us 10 days ago

Maximize the velocity of our product engineering team.
Ensure platform scalability, reliability, and security.
Champion best practices and shape the engineering culture.

They are building a robust, scalable trading platform to serve high-traffic, latency-sensitive applications. They leverage state-of-the-art technologies to support real-time trading while providing unparalleled reliability and performance.

View details Similar jobs

Senior SRE DevOps Engineer

Jobgether 7 days ago

Canada

Designing and implementing SLI/SLO frameworks with error budgets to guide reliability and performance decisions.
Building and maintaining AWS-based production infrastructure using Infrastructure as Code (Terraform, CloudFormation), including ECS, EKS/Kubernetes, and microservices orchestration.
Developing internal tools, automation frameworks, and reliability services in TypeScript, Python, or similar languages to enhance operational efficiency.

Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates, and this shortlist is then shared directly with the hiring company.

View details Similar jobs

Senior Site Reliability Engineer

Fixify 4 days ago

Europe

Design and maintain scalable, fault-tolerant infrastructure that supports our SaaS platform and keeps pace with business growth.
Define, document, and maintain SLIs, SLOs, and SLAs in partnership with product engineering, translating business commitments into technical guardrails.
Lead incident response with steady judgment, facilitate blameless postmortems, and drive remediation efforts that prevent recurrence.

Fixify is on a mission to reimagine IT teams support companies. They need a Senior Site Reliability Engineer who finds joy in building systems that fade into the background, empowering product engineers to ship with confidence and their customers to work without interruption.

View details Similar jobs

Site Reliability Engineer II

InvestorFlow 6 days ago

Global

Design and implement comprehensive monitoring strategies.
Take ownership of production incident response, lead handling, and drive remediation.
Continuously improve operational processes, reliability practices, and team readiness.

InvestorFlow delivers industry specialized CRM and digital portals to help alternative asset firms find opportunities, create and manage relationships, and turn relationship insights into action. They serve over 175 clients, including 25 of the top 50 alternative asset managers, managing more than $6 trillion in assets.

View details Similar jobs

Senior Site Reliability Engineer

Pismo 2 days ago

Global

Own the end-to-end lifecycle (design, provisioning, upgrades, and decommissioning) of core platform components.
Lead the design and implementation of infrastructure bootstrap orchestration, including: Automated cluster and environment provisioning.
Apply and promote SRE practices across the platform, including: Clear ownership and runbooks for platform components.

Pismo provides a comprehensive processing platform for banking, card issuing and financial market infrastructure and helps customers innovate and build the next generation of banking and payment solutions. Pismo’s 500+ employees are located in more than 10 countries around the world.

View details Similar jobs

Sr Site Reliability Engineer

Pismo 3 days ago

South America

Own the end‑to‑end lifecycle of core platform components, including cloud infrastructure primitives and Kubernetes clusters.
Design platform components to be resilient by default, applying SRE principles like fault isolation and capacity planning.
Drive Infrastructure‑as‑Code and GitOps‑first practices to ensure platform components are reproducible and auditable.

Pismo, founded in 2016, provides a comprehensive processing platform for banking, card issuing, and financial market infrastructure, helping customers innovate in banking and payments. With over 500 employees across 10+ countries, Pismo joined Visa in 2024, leveraging Visa’s solutions to advance financial technology.

View details Similar jobs

Senior Infrastructure Engineer

Flex 2 days ago

$146,200–$212,000/yr

US Unlimited PTO

Collaborate with service engineering teams to design, implement, and maintain scalable and resilient infrastructure solutions.
Implement SRE principles to improve system reliability and reduce downtime.
Improve developer workflows by creating self-service tools, optimizing CI/CD pipelines, and enhancing deployment processes.

Flex is a growth-stage FinTech company creating the best rent payment experience. They empower renters with flexibility over their most significant recurring expense and are growing quickly with a focus on building an inclusive culture.

View details Similar jobs

Infrastructure Engineer IV

HackerOne 23 days ago

$165,000–$200,000/yr

US Unlimited PTO

Contribute to building and operating the infrastructure that supports the HackerOne platform.
Improve the reliability, security, and scalability of our systems.
Design and operate highly available cloud systems and apply best practices for reliability, observability, and security.

HackerOne is a global leader in Continuous Threat Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world’s largest community of security researchers to continuously discover, validate, prioritize, and remediate exposures across code, cloud, and AI systems. They combine the ingenuity of the largest security research community with a best-in-class AI-powered platform, trusted by the world’s top organizations.

View details Similar jobs

Infrastructure Engineer

Attune 1 day ago

$120,000–$140,000/yr

US Unlimited PTO

Architect and manage scalable cloud infrastructure within AWS.
Implement and maintain infrastructure using Terraform.
Develop automation scripts to improve operational efficiency.

Attune empowers insurance agents with their technology solutions. We foster a remote-first culture and value employee development.

View details Similar jobs

Site Reliability Engineering Manager

Customer.io 29 days ago

$175,000–$195,000/yr

Americas Unlimited PTO 16w maternity

Lead effective squad rituals and ensure production readiness.
Partner with engineers to ensure solutions are scalable, architecturally sound, flexible, and secure.
Provide timely, specific coaching and development opportunities for your direct reports.

Customer.io's platform allows over 8,000 companies to send messages using real-time behavioral data. Their team uses Go, React, Ember, and AI to ship fast and scale with confidence and they value ownership, leadership, and healthy skepticism.

View details Similar jobs

Sr. DevOps Engineer

Jobgether 8 days ago

Europe

Implement SLI/SLO frameworks with error budgets to drive reliability decisions
Design release strategies including blue/green deployments and version tracking
Lead incident response and develop automated runbooks to reduce MTTR

Jobgether is a company that helps connect individuals with jobs through an AI-powered matching process. They ensure applications are reviewed quickly, objectively, and fairly against roles' core requirements.

View details Similar jobs

Senior Software Engineer, Core Services SRE

StackAdapt 26 days ago

US Canada

Maintain tooling, libraries, and infrastructure leveraged by core service teams
Develop and maintain infrastructure services that enable engineers to manage, deploy, and scale systems
Act as a technical leader, guiding core service teams to design robust and reliable software

StackAdapt is a technology company that empowers marketers to reach, engage, and convert audiences with precision. They are an AI-powered platform connecting brand and performance marketing, recognized for their diverse workplace and high-performing campaigns.

View details Similar jobs

Site Reliability Engineering Manager II

Flywire 1 day ago

$160,000–$200,000/yr

US

Help drive reliability, automation and performance within our cloud-based infrastructure.
Become embedded within an Engineering team helping them navigate production excellence and advocate for best practices.
Debug production issues across services and levels of the stack as well as practice incident response and blameless postmortems.

Flywire is a global payments enablement and software company that was founded over a decade ago. They have over 1,200 global FlyMates, representing more than 40 nationalities, in 12 offices worldwide, and are looking for people to join the next stage of their journey as they continue to grow.

View details Similar jobs

Staff Infrastructure Engineer

Fieldguide 8 days ago

Global Unlimited PTO

Lead infrastructure initiatives across the engineering organization.
Design technical quality bar and architectural standards.
Build platforms and AI-enabled systems for multiple teams.

Fieldguide is automating and streamlining the work of assurance and audit practitioners specifically within cybersecurity, privacy, and financial audit, building software for the people who enable trust between businesses. They are based in San Francisco, CA, but built as a remote-first company with an inclusive, driven, humble and supportive team.

View details Similar jobs

Senior DevOps Engineer

CodeRoad 3 days ago

US

Design and maintain scalable cloud environments using tools like Terraform, CloudFormation, or Ansible.
Build and optimize automated deployment pipelines to ensure rapid and reliable software delivery.
Implement robust monitoring, logging, and alerting frameworks to ensure 24/7 system health.

CodeRoad offers end-to-end software development services, helping businesses scale with infrastructure solutions. They provide staff augmentation, dedicated IT teams, and software engineering to empower businesses in a digital landscape.

View details Similar jobs

DevOps Manager

BlastPoint 12 days ago

$140,000–$170,000/yr

US 3w PTO

Ensure high availability, fault tolerance, and scalability of cloud services.
Optimize performance and cost efficiency across AWS environments.
Implement security best practices and SOC 2 compliance monitoring.

BlastPoint is a B2B data analytics startup located in Pittsburgh. They empower companies to engage with customers more effectively by discovering the humans in their data and understanding customer journeys; they are a tight-knit, forward-thinking team.

View details Similar jobs

Sr Site Reliability Engineer

Dataiku 24 days ago

Europe Middle East Africa

Design, deploy and maintain a cloud infrastructure to support a Dataiku SaaS offering mainly on AWS and Azure and GCP
Continuously improve the infrastructure, deployment and configuration to deliver more reliable, resilient, scalable and secure services
Automate as much as possible all technical operations

Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. They connect many data science technologies and integrate the best of data and AI tech.

View details Similar jobs

Director, Software Engineering (Site Reliability Engineering)

Affirm 17 days ago

$267,000–$360,000/yr

US

Set the vision and drive execution for Reliability Engineering at Affirm
Build software and program management structure to perform continual risk management across the entire Affirm system and Engineering organization
Hire and build a global team of SREs, system engineers, and full stack engineers

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They seem to be a remote-first company with competitive benefits that are anchored to their core value of people come first.

View details Similar jobs

Director, Software Engineering (Site Reliability Engineering)

Affirm 17 days ago

$174,411–$218,450/yr

Canada

Set the vision and drive execution for Reliability Engineering.
Build software and program management structure to perform continual risk management.
Hire and build a global team of SREs, system engineers, and full stack engineers.

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They are a remote-first company that values learning, experimentation, and accountability.

View details Similar jobs

Source Job