As a Site Reliability Engineer, you will play a key role in ensuring systems remain reliable, available, and performant for both customers and internal teams. Your expertise will directly impact users' experience and the success of the business. You'll collaborate closely with product development and platform engineering teams to build scalable systems and create robust automation that supports the company's goals.
Job listings
Kunai is partnering with a major player in global finance to modernize their systems and architecture; the Senior SRE will play a critical role in supporting the platformβs operational stability, observability, automation, and performance. Youβll work closely with the SRE architect and other engineers to implement tooling, improve system reliability, and ensure a seamless developer and platform experience.
As a Senior Infrastructure Engineer at CompScience, you'll be the architect and guardian of our critical AWS infrastructure. You'll design and implement highly reliable and scalable systems, automate deployments with IaC, and ensure optimal performance and cost efficiency. Your expertise in security and disaster recovery will be crucial in maintaining our compliance and business continuity.
Architect and scale the systems that support billions in venture assets and redefine how founders and investors connect. Implement modern CI/CD pipelines, container orchestration, and automation that enable rapid iteration across all engineering teams. Design resilient, secure, and observable systems using Terraform, Pulumi, ArgoCD, and modern SRE practices.
Looking for a Lead DevSecOps Engineer to manage and mentor a team of DevSecOps engineers while driving the development and implementation of secure, scalable, and automated solutions for the DoDβs Air Force integration platform powered by Apigee and Google Cloud (GCP). Individuals must be willing to work East Coast working hours with some flexibility afforded.
Collabora is looking for a technically capable, enthusiastic and passionate Linux Kernel Software Developer to join its engineering team. As a member of the Kernel domain team, you will participate in the development, integration, validation and deployment of Linux board support packages and kernel device drivers, merging code upstream, working with the mainline Linux community, configuring kernels, troubleshooting functional and performance problems for different customersβ projects and products.
As the Director of Engineering for Reliability, Infrastructure, and Quality, you will be responsible for defining, evangelizing and executing on the technical and business strategy for (1) all things quality across the CIQ product and infrastructure (2) developer productivity (3) cost and efficiency and (4) cloud infrastructure. The ideal candidate will have a strong operational background and is passionate about enabling internal developers.
We are seeking a DevOps & Site Reliability Engineer to join a growing AI-focused SaaS startup. In this role, youβll be responsible for maintaining, optimizing, and scaling the infrastructure that supports our platform, ensuring high availability, performance, and reliability. Youβll work closely with development and product teams to improve deployment processes, monitor systems, and respond to incidents proactively.
Participate in the development and operation of DevOps-based development and operation processes β from CI/CD to application monitoring. Automate build, deployment and infrastructure processes in Azure environment. Supervise and configure Azure resources through the Azure Portal. Work with Azure DevOps, Azure Pipelines, Terraform and Ansible. You will work with the team according to agile principles, supporting the operation of continuous development and integration.
As a Senior Site Reliability Engineer, you will help enhance the stability, performance, and observability of platforms, focusing on maintaining and optimizing the current infrastructure and ensuring strong monitoring coverage. You will also support compliance and security practices and collaborate closely with development teams to supervise the platforms, optimize system behavior, and drive improvements in security and documentation practices.