Similar Jobs
See allPrincipal Site Reliability Engineer (AI-first SRE)
Groupon
South America
GCP
Kubernetes
Terraform
Senior DevOps Engineer/SRE
Cision
India
Kubernetes
Ansible
Terraform
Site Reliability Engineer
66degrees
US
Linux
Windows
K8s
Engineering Manager, Infrastructure Platforms
GitLab
US
Kubernetes
Ruby
Go
Intermediate Site Reliability Engineer, Tenant Scale
GitLab
Americas
Kubernetes
GCP
AWS
About The Role:
- You’ll manage and grow a team of SREs while remaining deeply engaged with production systems.
- You’ll use a blend of Site Reliability Engineering and Platform Engineering to reduce operational toil, improve safety, and enable product teams to ship reliably at scale.
- You’ll work with cutting-edge technologies, design resilient systems, and build automation and paved paths so customers can rely on AuthZed for their most critical workloads.
What You'll Own:
- Lead a global team of Site Reliability Engineers delivering infrastructure automation, observability, and operational scalability across multi-cloud and multi-region kubernetes based architectures.
- Drive automation and platform engineering: safer deploys, progressive delivery, guardrails, and paved paths that reduce toil.
- Collaborate with product and engineering to ship features like self-service workflows and infra-as-code expectations with reliability baked in.
What You Bring:
- Strong grasp of SRE fundamentals: SLOs/SLIs, error budgets, incident management, capacity planning, and operational excellence.
- Extensive experience with AWS, GCP and Azure managed services.
- Proven ability to translate operational pain points into engineering deliverables.
AuthZed
AuthZed creates and maintains SpiceDB and the authorization infrastructure. They are a Series A company with a fully remote team across the US, Canada, and Europe and a hardworking, close-knit group with a software-driven culture that values integrity, collaboration, and open-mindedness.