Earning the trust of our large-scale operator customers to further Grafana's "big tent" philosophy of data accessibility and to meet clear business objectives.
Designing and leading the development of backend services, distributed systems, and enterprise features at scale.
Driving continuous improvement of our engineering culture through words and actions.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana, the open source visualization tool, around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack. The Grafana team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Take an active role in influencing our roadmap and your own career objectives.
Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability.
Mentor and support other team members, participate in design discussions and collaborate with the team.
Grafana Labs is a remote-first, open-source powerhouse that provides visualization tools. They help companies manage their observability strategies. Grafana Labs has a global collaborative culture, and a passion for meaningful work.
Develop and maintain features as part of Observability solutions in Grafana Cloud.
Contribute to the design and implementation of high-quality, scalable integrations for various infrastructure components, databases, and applications
Build prototypes and present your ideas as part of a cross-functional team
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and thrive in an innovation-driven environment with a global collaborative culture.
Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Manage, hire, and develop a team of engineers, providing regular feedback.
Act as project manager and work with product owners to ensure the product roadmap is up-to-date.
Engage in technical conversations and challenge teams to arrive at strong technical decisions.
Grafana Labs is a remote-first, open-source powerhouse that provides visualization tools and helps companies manage their observability strategies. We value transparency, autonomy, and trust.
Design, build, and operate reconciliation systems to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
Improve operational efficiency by reducing deployment complexity and contributing to the Stack Config Reconciliation project.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).
Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and their team thrives in an innovation-driven environment.
Build out Rerun's commercial offering for ingesting, indexing and querying multimodal data at scale.
Grow your ownership of design and architecture.
Turn rough ideas into solid systems.
Rerun is building a new multimodal data stack for physical AI, aiming to transform the physical-world economy. They focus on data infrastructure and tools for extraction, ingestion, storage, querying, streaming, and visualization of temporal multimodal data.
Manage and grow a distributed team of engineers, providing feedback and supporting career development.
Partner with product management to shape the Usage squad's roadmap, ensuring alignment with company mission and customer impact.
Guide the team through the full project lifecycle, ensuring high-quality and timely outcomes within the Usage domain.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users globally. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Own and evolve a scalable observability platform spanning metrics, logs, traces, and events.
Design telemetry pipelines ingesting data from GPUs, CPUs, networking, containers, APIs, and BMC/Redfish.
Design and implement noise-resistant alerting systems to improve signal quality and reduce operational load.
Lightning AI builds an end-to-end platform for developing, training, and deploying AI systems, designed to take ideas from research to production with less friction. They combine developer-first software with cost-efficient, large-scale compute, serving solo researchers, startups, and large enterprises.
Help define architecture to support millions of daily API requests, build and scale infrastructure for sending tens of millions of emails daily, and improve high availability across distributed applications.
Scale databases like Postgres and Clickhouse for performance, enhance observability using tools like Datadog, and refine disaster recovery plans for quick and reliable service recovery.
Build infrastructure with IaC frameworks like CDK and TF, work with Typescript and Golang, and design and operate async pipelines handling tens of millions of messages daily with on-call rotation for critical services.
Resend is building the modern email sending platform for developers, focusing on quality, craft, and developer experience. It is a fully remote team of about 40 people spanning 11 countries, backed by investors like a16z and Y Combinator, and values honesty, low ego, and autonomy.
Design, build, and maintain scalable backend services and APIs that power Chattermill’s core analytics platform.
Architect reliable, maintainable distributed systems and contribute to the evolution of backend service design and infrastructure.
Own end-to-end delivery of backend engineering workstreams, from technical scoping and architecture through to implementation, testing, observability, and production support.
Chattermill helps large successful brands like Uber, Amazon, and Wise put their customers at the centre of everything they do. Using best-in-class tech in a fast-evolving AI space, their Customer Experience Intelligence platform continuously analyses feedback to help clients identify what to do next.
Build delightful interactive learning inside Grafana and ship features that make learning experiences feel obvious, smooth, and scalable.
Enable contribution and authoring by creating workflows and product features that let many contributors safely create, iterate on, and improve learning content.
Build fast feedback loops (metrics/logs/traces + user journey visibility) so issues stay shallow by making it easy to understand what’s happening in production and in real user experiences.
Grafana Labs is a remote-first, open-source powerhouse that provides the Grafana LGTM Stack for managing observability strategies. They have over 20M users worldwide and help over 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Drive projects from idea to operations, influencing the roadmap and mentoring team members.
Build and evolve the query experience, data manipulation engine, and transformation components using TypeScript and React.
Own system performance and reliability at a global scale, participating in on-call rotations and focusing on maintainable code.
Grafana Labs is a remote-first, open-source powerhouse building the Grafana LGTM Stack for observability. It is a high-growth, global company with a culture focused on innovation, transparency, autonomy, and trust, united by a passion for meaningful work.
Assess and improve visibility by identifying gaps in dashboards, metrics, and logs.
Refine alerts and dashboards for critical services to catch issues earlier.
Automate routine checks and monitoring tasks to free up engineers.
PlayOn is where high school sports come to life through platforms like GoFan, NFHS Network, and MaxPreps. As a growth-stage company backed by KKR, we build the technology that powers high school athletics from ticketing and streaming to fundraising and merchandise.
Design and build backend systems, APIs, infrastructure, and platform capabilities that improve developer workflows across Reddit.
Build scalable and reliable systems across both AI-powered developer workflows and the core non-AI systems engineers rely on every day.
Lead high-impact projects across Reddit’s developer tooling ecosystem by writing and reviewing code and design docs, aligning stakeholders, and making pragmatic technical tradeoffs.
Reddit is a community-based platform built on shared interests, passion, and trust, facilitating open and authentic conversations. With over 100,000 active communities and approximately 126 million daily active unique visitors, it serves as one of the internet’s largest sources of information.