The Opportunity:
- You'll be part of a newly formed squad within the Databases department, owning and operating shared, production-critical infrastructure.
- You will work closely with high-volume analytical and storage systems that power query-heavy and aggregation-heavy workloads.
- You'll be working directly with distributed systems behavior, Kubernetes scheduling dynamics, storage engines, and compression trade-offs.
What Makes You a Great Fit:
- You'll need experience with high-throughput streaming systems, analytical or storage backends, or large-scale database infrastructure.
- You will be expected to lead design discussions and reviewing PRs with a focus on reducing operational risk and increasing system resilience.
- You will raise the bar for practices across teams by mentoring engineers and sharing distributed systems knowledge.
Requirements:
- You'll need 8+ years of engineering experience in SRE, platform engineering, infrastructure engineering, or distributed systems.
- You will need strong Kubernetes experience in AWS, GCP, or Azure, and familiarity with infrastructure-as-code tooling.
- You should have a strong understanding of distributed systems failure modes in multi-cloud environments and proficiency in a systems-oriented language.