Site Reliability Engineers enhance the reliability, scalability, and performance of production systems across the organization by bridging the gap between development and operations teams. They collaborate with stakeholders to design and implement robust infrastructure solutions, applying software engineering principles to create automated solutions to ensure system availability, latency, performance, and capacity.
Job listings
As a Senior DevEx Engineer, you'll be a key member of the Developer Experience team building and maintaining internal tools that enable our product engineers to be efficient and self-reliant. You will maintain and improve platform tooling systems and guide less-experienced team members. The role involves delivering well-tested software and infrastructure, collaborating with stakeholders, and participating in on-call rotation and incident reviews.
As a Site Reliability Engineer, you will play a key role in ensuring systems remain reliable, available, and performant for both customers and internal teams. Your expertise will directly impact users' experience and the success of the business. You'll collaborate closely with product development and platform engineering teams to build scalable systems and create robust automation that supports the company's goals.
Kunai is partnering with a major player in global finance to modernize their systems and architecture; the Senior SRE will play a critical role in supporting the platformβs operational stability, observability, automation, and performance. Youβll work closely with the SRE architect and other engineers to implement tooling, improve system reliability, and ensure a seamless developer and platform experience.
As a Senior Infrastructure Engineer at CompScience, you'll be the architect and guardian of our critical AWS infrastructure. You'll design and implement highly reliable and scalable systems, automate deployments with IaC, and ensure optimal performance and cost efficiency. Your expertise in security and disaster recovery will be crucial in maintaining our compliance and business continuity.
Looking for a Lead DevSecOps Engineer to manage and mentor a team of DevSecOps engineers while driving the development and implementation of secure, scalable, and automated solutions for the DoDβs Air Force integration platform powered by Apigee and Google Cloud (GCP). Individuals must be willing to work East Coast working hours with some flexibility afforded.
Weβre building our talent pool of outstanding DevOps Engineers for upcoming client projects. This is not an active position tied to a current project, but a proactive opportunity to become part of our expert network. Solve complex problems and be considered for impactful future roles. Design, build, and maintain containerized environments, develop and manage CI/CD pipelines, and automate configuration management.
We are seeking an experienced and motivated Engineering Manager to lead our platform engineering team responsible for designing, building, and maintaining our cloud infrastructure. This role involves overseeing the development of solutions for BYOC deployments, building multi-tenant SaaS platforms, and developing custom Kubernetes operators to automate and enhance our cloud capabilities. The ideal candidate will have a strong background in cloud technologies, Kubernetes.
You will be part of a team designing, automating, and deploying services on behalf of our customers to the cloud in a way that allows these services to automatically heal themselves if things go south. Every week is different and the problems you will be challenged to solve are constantly evolving. Rackspace build solutions using infrastructure-as-code so our customers can refine and reuse these processes again and again.
As the Senior Site Reliability Engineer you'll ensure services and systems are reliable by focusing on scalability, latency, performance, availability, efficiency, and observability. The company wants to enhance system reliability while decoupling system size from operational toil and complexity via training and mentoring. You'll use relevant development languages and knowledge of systems, services, and tools appropriate for the business area to build software applications.