In this role, youβll build and lead a talented team responsible for monitoring, scaling, and improving our infrastructure, while evolving our incident response capabilities. Your leadership will ensure we meet strict SLAs, deliver a seamless user experience, and continually improve the way we operate at scale. Your work will be central to empowering customers, protecting data, and enabling our teams to innovate without compromise.
Job listings
Automate, manage, and maintain ClickHouse as PostHog grows towards capturing trillions of events per year and having one of the worldβs largest clusters. This includes ClickHouse operations and scaling infrastructure, as well as node and instance-level performance optimization. Ensure that the right hardware is deployed at the right time for each workload on ClickHouse.
Be an integral member of the team implementing our platform in a DoD IL4 cloud environment. Maintain infrastructure from conception to completion within AWS. Build upon the operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's solutions.
Looking for a Mid-Level SecDevOps Engineer to help secure and streamline delivery pipelines for cloud-native, containerized applications. You'll work across engineering and security teams to embed best practices into GitLab CI/CD workflows, harden AWS infrastructure, and automate Kubernetes deployments - all with security built in from day one.
As the primary owner for the development and maintenance of the cloud services that the Open Home Foundation projects rely on, you will design and implement scalable, high-performance, and reliable Node.js applications. You will work in close collaboration with the Home Assistant and Ecosystem teams to ensure the performance, quality, and responsiveness of the cloud services that the OHF projects use.
Play a key role in scaling and supporting H1's cloud infrastructure. Work closely with engineering and data teams to improve the reliability, visibility, and efficiency of our systems and deployment pipelines. This is a hands-on role focused on automation, enablement, and operational excellence in a fast-paced AWS-based environment.
This opportunity involves designing and driving robust, automated solutions that optimize CI/CD pipelines and cloud infrastructure utilizing tools like GitLab, AWS CloudFormation, SAM templates, CDK, and Terraform. This role helps provide teams with tools that enable consistent, high-quality software delivery through reliable and secure infrastructure management. You will lead the execution of infrastructure strategies.
The Senior DevOps Engineer will take complete ownership of architecting, deploying, and managing our sophisticated AWS -based infrastructure, including services like EC2, EKS, S3, IAM, Fargate, Lambda, and SQS. Independently drive and own the entire infrastructure as code strategy, expertly using Terraform and Terragrunt to build and maintain a fully automated environment.
As a Senior DevOps Engineer, you will continuously improve our development operations and support the reliability and availability of all our applications and services deployed to the cloud. Partner with various engineering teams to own and manage availability, latency, performance, reliability and scalability of all services to maintain SLAs that our customers expect from us. Provide strong technical leadership and people management to the team.
In this role, you will address critical challenges such as efficiently operating and managing over 200 Kafka clusters, supporting and monitoring over 1 million Kafka topics, and handling over 1PB of data daily, ensuring minimal latency and high reliability.