This role involves driving self-service and automation at Loop by designing workflows that enable product teams to provision infrastructure, integrate monitoring, and release safely. The manager will lead the strategy and execution for scaling to multi-region, implementing active-active/active-standby architectures and global traffic management. Championing the evolution of deployment patterns, including blue/green, canary, and feature-flag releases, is crucial to minimize risk. You will implement Site Reliability Objectives (SLOs), error budgets, chaos testing, and auto-remediation playbooks to raise the reliability bar, and own the infrastructure on-call rotation culture. Furthermore, this role includes mentoring and developing a diverse team of DevOps engineers, DBAs, and MLOps Engineers, partnering with Product Engineering Teams and stakeholders to align roadmaps and unlock velocity, andcontributing hands-on through writing Terraform modules, optimizing Helm charts, and supporting the team with reactive work.