Operate and evolve multi-cloud streaming clusters and related database infrastructure, diagnosing and eliminating cross-layer failure modes.
Design safe upgrade and rollout strategies at scale, improving observability, automation, and operational ergonomics.
Partner closely with database and platform teams to ensure safe scaling, partitioning, consumer fan-out, and query performance.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack.
Work with your team to build and roll out new features, then use the results to iterate and improve.
Drive projects from initial ideation all the way to operations once it is in the hands of customers.
Maintain critical systems, and own their reliability, performance, and availability.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users. They provide observability strategies for over 3,000 companies, featuring scalable metrics, logs, and traces, and thrive in an innovation-driven environment with transparency, autonomy, and trust.
Design and implement high-quality, scalable services to be consumed by multiple Grafana Cloud products.
Support the technical direction and vision of the team, contributing to strategic discussions and future development of observability solutions
Be a part of your team’s follow-the-sun on-call rotations and take ownership of the services you’re running
Grafana Labs is a remote-first, open-source powerhouse that provides the leading open source visualization tool. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack. The team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.
Provide and own automation of the provisioning of CSP resources, including networking, Kubernetes clusters and specific CSP resources required by our application teams.
Work with users (Grafana Cloud application teams) to help understand their needs and ensure investment in the right capabilities.
Participate in the Platform department Infrastructure wing on-call rotation.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. The team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything that they do.
Develop automation to eliminate manual and repetitive operational tasks.
Investigate and resolve customer complaints escalated beyond L1 and L2 support.
Moniepoint is an all-in-one financial services platform for emerging markets. Since 2019, Moniepoint’s technology has powered over 3 million people, offering personal and business banking, payment, credit and business management tools to help them succeed.
Automate the provisioning of all of Juniper Square’s infrastructure in code.
Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements.
Partner with our Data Engineering team on improving our data posture and driving operational excellence.
Juniper Square's mission is to unlock the full potential of private markets by digitizing them to bring efficiency, transparency, and access. They are a values-driven organization with a hybrid workplace strategy, allowing employees to collaborate effectively across multiple countries and offering physical offices in several major cities.
Maintain the Field Engineering infrastructure, including the pre-sales Demo Kit application and infrastructure.
Design, develop, and deliver compelling product demos to add to the demo kit library.
Create and deliver Training Materials and Product workshops to the SEs, customers, and the community.
Grafana Labs is a remote-first, open-source powerhouse whose open source visualization tool has more than 20M users. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack and thrive in an innovation-driven environment.
Be a key contributor on an Agile development team, collaboratively realizing business value through iterative software development lifecycle.
Build and execute the monitoring strategy for ScienceLogic SaaS infrastructure.
Define, deploy, and maintain system and service monitors.
ScienceLogic is a leader in IT Operations Management, giving modern IT operations actionable insights for faster problem resolution and prediction. They see everything across cloud and distributed architectures, contextualizing data through relationship mapping, and acting on this insight through integration and automation.
Design, deploy and maintain a cloud infrastructure to support a Dataiku SaaS offering mainly on AWS and Azure and GCP
Continuously improve the infrastructure, deployment and configuration to deliver more reliable, resilient, scalable and secure services
Automate as much as possible all technical operations
Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. They connect many data science technologies and integrate the best of data and AI tech.
Helping internal engineers release software securely and measurably.
Leading automation of release processes using ‘golden path’ techniques.
Supporting diverse internal teams from application development to security.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users globally. It helps more than 3,000 companies manage their observability strategies, and their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything.
Take an active role in influencing our roadmap and your own career objectives.
Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability.
Support other team members, participate in design discussions and collaborate with the team.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Take an active role in influencing our roadmap and your own career objectives
Help your team drive projects from initial idea all the way to operations once it is in the hands of customers
Embrace our open-source culture and contribute to other projects that may not directly fall within your team’s scope
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana, the open source visualization tool, around the globe. Grafana Labs also helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.
Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure.
Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications.
Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Their platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices.
Lead end-to-end delivery of large, cross-functional projects.
Own architecture, reliability, performance and cost for critical systems.
Grafana Labs provides an open source observability platform that integrates metrics, logs, traces, and profiles with Grafana. They have a global collaborative culture, and passion for meaningful work. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Work directly with customers to ensure successful Teleport deployments.
Meet regularly with customers, understand pain points blocking deployments and remove roadblocks.
Work with customers to articulate the problem they are trying to solve, gather requirements, and make the business case to the product and engineering teams to invest in resolving the issue.
Teleport is the Infrastructure Identity Company, modernizing identity, access, and policy for infrastructure, improving engineering velocity and resiliency of critical infrastructure against human factors and/or compromise. They are a fast-growing, well-funded Y-Combinator company that values craft, strongly supports work/life balance, and embraces a culture of humility, honesty, and transparency.
Leading and coaching a team of software engineers building and evolving Canva’s core data and storage platforms.
Designing, implementing, and continuously improving database and storage infrastructure with scalability, reliability, security, compliance, and cost efficiency in mind.
Analysing fleet-wide performance and efficiency across Canva’s storage systems, using data-driven insights to uplift reliability and reduce operational overhead.
Canva is a design platform redefining how the world experiences design. Our flagship campus is in Sydney, with additional spaces in Melbourne, Brisbane, Perth, and Adelaide.
Develop and maintain features as part of Observability solutions in Grafana Cloud.
Contribute to the design and implementation of high-quality, scalable integrations for various infrastructure components, databases, and applications
Build prototypes and present your ideas as part of a cross-functional team
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. It helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack.
Lead reliability-focused design and readiness reviews.
Build, operate, and continuously improve our observability stack.
Own and evolve incident management practices.
Transcend is building the privacy platform that easily embeds privacy into your entire tech stack. They are growing quickly, backed by top-tier investors and are proud to serve some of the world's most iconic brands.
Contribute to building and operating the infrastructure that supports the HackerOne platform.
Improve the reliability, security, and scalability of our systems.
Design and operate highly available cloud systems and apply best practices for reliability, observability, and security.
HackerOne is a global leader in Continuous Threat Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world’s largest community of security researchers to continuously discover, validate, prioritize, and remediate exposures across code, cloud, and AI systems. They combine the ingenuity of the largest security research community with a best-in-class AI-powered platform, trusted by the world’s top organizations.
Support teammates with goal-setting, professional development, and mentoring.
Ensure delivery of maintainable, high-quality platform systems.
Build and sustain a healthy team culture where ownership and collaboration are the norm.
onX is a pioneer in digital outdoor navigation solutions through its suite of apps. With over 400 employees, they foster a fast-paced, tech-forward environment valuing ownership, accountability, and teamwork.