As a member of the Compute Reliability and Efficiency team, you’ll be solving some of the world’s largest at-scale infrastructure problems using software we create along with integrating services from the cloud native ecosystem. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!
In your day-to-day, you can expect to: Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure. Execute performance and reliability analysis on our Linux-based Kubernetes fleet. Design, write (Golang), and deliver software to improve the availability, scalability, latency, and efficiency of Reddit’s Compute Platform. Contribute feedback to the technical and strategic direction of the compute platform. Automate critical aspects of the development process such as service creation and management, as well as critical infrastructure operations. Share on-call responsibilities with the Compute team.