Provide and own automation of the provisioning of CSP resources, including networking, Kubernetes clusters and specific CSP resources required by our application teams.
Work with users (Grafana Cloud application teams) to help understand their needs and ensure investment in the right capabilities.
Participate in the Platform department Infrastructure wing on-call rotation.
Helping internal engineers release software securely and measurably.
Leading automation of release processes using ‘golden path’ techniques.
Supporting diverse internal teams from application development to security.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users globally. It helps more than 3,000 companies manage their observability strategies, and their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything.
Operate and evolve multi-cloud streaming clusters and related database infrastructure, diagnosing and eliminating cross-layer failure modes.
Design safe upgrade and rollout strategies at scale, improving observability, automation, and operational ergonomics.
Partner closely with database and platform teams to ensure safe scaling, partitioning, consumer fan-out, and query performance.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack.
Design and implement high-quality, scalable services to be consumed by multiple Grafana Cloud products.
Support the technical direction and vision of the team, contributing to strategic discussions and future development of observability solutions
Be a part of your team’s follow-the-sun on-call rotations and take ownership of the services you’re running
Grafana Labs is a remote-first, open-source powerhouse that provides the leading open source visualization tool. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack. The team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.
Partner closely with product engineering squads (embedded model)
Own production reliability for high-SLA and complex customer environments
Design and implement automation to scale our reliability practices
Grafana Labs is a remote-first, open-source powerhouse that helps more than 3,000 companies manage their observability strategies. They are scaling fast and staying true to what makes them different: an open-source legacy, a global collaborative culture, and a passion for meaningful work.
Make deployments boring (in the best way possible)
Own CI/CD pipelines: optimize build times, improve caching, reduce flakiness
Evolve our Kubernetes (EKS) deployment strategy for reliability and speed
Obvious is building an AI-native workspace, an operating system for work that puts co-intelligence at the center. They are a small and talent-dense team with world-class builders, former founders, and leaders from companies like Netflix, Google, and Meta.
Extend and automate the existing container orchestration platform, ensuring its scalability, reliability, and performance
Work closely with SREs from different teams to reduce their cognitive load related to the orchestration platform
Implement and maintain security best practices for the orchestration platform, ensuring the security and availability of our systems
Kraken is a mission-focused company rooted in crypto values. They aim to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has employees in 70+ countries who speak over 50 languages.
Design and implement production-grade Kubernetes architectures aligned to security, reliability, and scalability best practices.
Lead technical assessments of Kubernetes and cloud-native environments, identifying risks, maturity gaps, and automation opportunities.
Serve as a trusted advisor for clients on Kubernetes strategy, platform engineering, and automation maturity.
GuidePoint Security provides trusted cybersecurity expertise, solutions and services that help organizations make better decisions and minimize risk. Since its inception in 2011, GuidePoint has grown to over 1000 employees and firmly-defined core values drive all aspects of the business.
Contribute to the core product, working across the stack on services that power their applications.
Design and refine technical systems, sharing ownership of customer use cases and the systems that power them.
Work directly with customers, providing pointers to documentation and debugging issues.
Humanitec is reshaping how enterprises build and run their cloud-native setups, and leading the transformation, helping teams build Internal Developer Platforms (IDPs) that unlock true developer self-service. They value humility, drive, and intelligence in their fully remote team.
Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.
Design, provision, and manage cloud infrastructure using Infrastructure as Code
Operate and support Kubernetes clusters in production environments
Build and maintain GitOps-based deployment and configuration workflows
BETSOL is a cloud-first digital transformation and data management company offering products and IT services to enterprises in over 40 countries. They are an employee-centric organization, offering comprehensive health insurance, competitive salaries, 401K, volunteer programs, and scholarship opportunities.
Lead end-to-end delivery of large, cross-functional projects.
Own architecture, reliability, performance and cost for critical systems.
Grafana Labs provides an open source observability platform that integrates metrics, logs, traces, and profiles with Grafana. They have a global collaborative culture, and passion for meaningful work. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Conception, implementation, operation and documentation for cloud projects, solutions and products
Perform health checks, optimize monitoring and executing deployments in cloud environments
Customer consulting on cloud architecture and intensive collaboration with developers and internal departments of customers around agile DevOps and operational processes
Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group and was Hungary’s most attractive employer in 2025, according to Randstad’s representative survey. They provide a wide portfolio of IT and telecommunications services with more than 5300 employees.
Work with your team to build and roll out new features, then use the results to iterate and improve.
Drive projects from initial ideation all the way to operations once it is in the hands of customers.
Maintain critical systems, and own their reliability, performance, and availability.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users. They provide observability strategies for over 3,000 companies, featuring scalable metrics, logs, and traces, and thrive in an innovation-driven environment with transparency, autonomy, and trust.
Define and execute a technical vision for Onebrief’s infrastructure.
Design and evolve a deployment strategy focused on AWS and on-prem.
Build security and compliance directly into the infrastructure lifecycle.
Onebrief provides collaboration and AI-powered workflow software designed specifically for military staffs, making them faster, smarter, and more efficient. They have raised $320m+ from top-tier investors and are valued at $2.15B, with a team spanning veterans and technologists.
Extend and improve our container-based infrastructure, running in Kubernetes and continue to automate away manual processes.
Partner with your colleagues from engineering to embrace the idea of self-service tooling, making the best way the easiest way.
Contribute to resilient CI/CD pipelines/processes to make sure that our engineering team is able to deploy features faster while being compliant to regulations.
Ada envisions a world where everyone gets the healthcare they need, using AI to help people get answers faster. With a team of physicians and clinical scientists, Ada identifies those at risk and guides them to the right care to transform healthcare and ensure no one goes undiagnosed.
Maintain the Field Engineering infrastructure, including the pre-sales Demo Kit application and infrastructure.
Design, develop, and deliver compelling product demos to add to the demo kit library.
Create and deliver Training Materials and Product workshops to the SEs, customers, and the community.
Grafana Labs is a remote-first, open-source powerhouse whose open source visualization tool has more than 20M users. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack and thrive in an innovation-driven environment.
Design and deploy customer infrastructure on different cloud providers and bare metal environments
Design and manage Kubernetes clusters for applications with microservices architecture
Develop and optimize CI/CD pipelines for seamless software delivery
Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. They empower platform engineering teams to deliver composable, production-ready developer platforms across any environment.
Contribute to building and operating the infrastructure that supports the HackerOne platform.
Improve the reliability, security, and scalability of our systems.
Design and operate highly available cloud systems and apply best practices for reliability, observability, and security.
HackerOne is a global leader in Continuous Threat Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world’s largest community of security researchers to continuously discover, validate, prioritize, and remediate exposures across code, cloud, and AI systems. They combine the ingenuity of the largest security research community with a best-in-class AI-powered platform, trusted by the world’s top organizations.
Design, implement, and manage cloud infrastructure using Infrastructure as Code (IaC) tools.
Design, build, and maintain scalable CI/CD pipelines using tools like CircleCI or GitHub Actions.
Implement and maintain observability tooling (Prometheus, Grafana, Datadog), and lead incident response to ensure system reliability.
Engine is transforming business travel into something personalized, rewarding, and simple. More than 20,000 companies already rely on Engine to support over 1 million travelers and billions in annual bookings each year.
Take an active role in influencing our roadmap and your own career objectives.
Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability.
Support other team members, participate in design discussions and collaborate with the team.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.