Ditto is at an inflection point, scaling to meet the growing demands of enterprise customers, they need experienced SRE Leads to drive and mature their Site Reliability Engineering practice. As a Senior Engineering Manager of Site Reliability Engineering, you will lead a multi-layered team of SREs, including other SRE managers, to shape and scale reliability practices across their platform. You will drive strategy, execution, and people development across regions while embedding a culture that values high availability, resiliency, and operational excellence.
Lead and scale a globally distributed SRE organization, including managers and ICs, setting the long-term vision and execution plan for reliability at scale.
Develop engineering leaders and senior talent, coaching on both technical depth and leadership maturity to create a high-trust, high-performance organization.
Drive adoption of SRE best practices and establish and evolve an incident management practice.
Lead the architecture and execution of observability systems that offer real-time visibility into system health and customer experience.
Partner with platform, infrastructure, and security teams to build scalable, self-service reliability tooling.
Guide teams to define, implement, and iterate on SLIs, SLOs, and SLAs that are meaningful to end user experience.
Establish best-in-class documentation and operational hygiene.
Lead strategic programs to transform engineering culture toward reliability and resilience of our mission critical software.