A versatile and experienced DevOps Engineer is needed to take ownership of build systems, pipelines, and development tooling. Add efficiency, robustness and security to release processes by standing up automated tools or processes as necessary. Integrate security and compliance workflows, including tools like Grype and SBOM generation for ATO/IATT processes.
Job listings
As a member of the Compute Reliability and Efficiency team, youβll be solving some of the worldβs largest at-scale infrastructure problems using software we create along with integrating services from the cloud native ecosystem. Your work will directly impact hundreds of millions of users around the world. You will work collaboratively with a team of software engineers to create and maintain the foundational platform for running Redditβs infrastructure. You will also design, write (Golang), and deliver software to improve the availability, scalability, latency, and efficiency of Redditβs Compute Platform.
Ensure the reliability, scalability, and performance of our systems. You will monitor infrastructure, troubleshoot issues, implement automation, and collaborate with development teams to maintain smooth production environments.
This Network Development role is for someone looking for a fulfilling challenge working with advanced equipment and managing a large, scalable, modern network. You'll provide hands-on technical design and engineering for network-related tools and systems, develop/maintain our network automation and monitoring platform, SDN solution and integration with the GPU network, and contribute to open source projects.
As an engineer, you will drive the mitigation and resolution of incidents, and improve toilsome availability-related processes. You will identify opportunities for improvement, communicate incident status clearly to customers, and embed directly with service teams. This role requires communicating internally with many engineers globally and responding to Slack messages.