Remote Devops Jobs · Automation

Job listings

The Senior Site Reliability Engineer role within the Cloud Compute team is pivotal in ensuring the robust and scalable foundation of Affirm's platform. This role manages all of Affirm's Kubernetes clusters. The mission is to provide a highly reliable and available cloud environment that empowers all of Affirm's engineering teams to build and deploy innovative solutions seamlessly. The engineer will drive initiatives to enhance observability capabilities, fortify the reliability of critical infrastructure, and automate key operational workflows.

Manage cloud infrastructure including configuration, backups, scaling, costing. Configure and manage orchestration and containerization tools such as Kubernetes, Helm, Terraform and GitOps tools. Roll out fixes and upgrades to the software. Develop and maintain automated build, deployment, and testing processes using Jenkins and GitOps tools. Ensure high availability and disaster recovery capabilities for critical systems.