You'll be part of a newly formed squad within the Databases department, owning and operating shared, production-critical infrastructure.
You will work closely with high-volume analytical and storage systems that power query-heavy and aggregation-heavy workloads.
You'll be working directly with distributed systems behavior, Kubernetes scheduling dynamics, storage engines, and compression trade-offs.

What Makes You a Great Fit:

You'll need experience with high-throughput streaming systems, analytical or storage backends, or large-scale database infrastructure.
You will be expected to lead design discussions and reviewing PRs with a focus on reducing operational risk and increasing system resilience.
You will raise the bar for practices across teams by mentoring engineers and sharing distributed systems knowledge.

Requirements:

You'll need 8+ years of engineering experience in SRE, platform engineering, infrastructure engineering, or distributed systems.
You will need strong Kubernetes experience in AWS, GCP, or Azure, and familiarity with infrastructure-as-code tooling.
You should have a strong understanding of distributed systems failure modes in multi-cloud environments and proficiency in a systems-oriented language.

Grafana Labs

Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics, logs, and traces and thrive in an innovation-driven environment where transparency, autonomy, and trust fuel everything.

Apply for This Position