Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure.
Diagnosing and eliminating cross-layer failure modes.
Designing safe upgrade and rollout strategies at scale.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana, its open source visualization tool. Grafana Labs helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and its team thrives in an innovation-driven environment.
Maintain, optimize, and enhance on-premises and cloud computing environments.
Execute technical aspects of implementation projects, ensuring seamless software integration and customization.
Automate Infrastructure-as-Code (IaC) to manage virtual machines and deploy containers, services, and other infrastructure.
Striveworks helps organizations harness AI to solve national security and business challenges, acting as a command center for data and models. Founded by data scientists and engineers, they aim to simplify the deployment and optimization of AI systems, ensuring reliability and scalability.
Work as part of a small, cross functional XP team installing Imogen into client cloud environments.
Pair program with other engineers and collaborate closely with product managers and designers.
Work with client infrastructure, security, and network teams to deploy Imogen to their cloud infrastructure.
Mechanical Orchard builds Imogen, a modernization platform for rewriting business applications. Their Delivery team operates in complex environments and values strong fundamentals, AI, and collaboration.
Design, build, and maintain Kubernetes-based infrastructure and cloud environments.
Build and optimize CI/CD pipelines that enable fast, safe, and repeatable deployments.
Leverage AI coding tools and agentic workflows as a core part of your work.
Intrahealth, a subsidiary of HEALWELL AI Inc., is an enterprise class EMR provider supporting approximately 20,000 providers and the care delivery of tens of millions of patients and clients across Canada, Australia and New Zealand. Intrahealth provides a suite of flexible software solutions to a wide variety of customers including health authorities, public health, community health, home care, and primary care professionals.
Own the architecture and evolution of P2P.org's internal developer platform—Kubernetes, monitoring, secrets management, and delivery infrastructure.
Design and build scalable, fault-tolerant platform components—including capacity planning, multi-tenancy, networking topology, and storage architecture.
Use AI tooling as a core part of how you work and champion its adoption across the infrastructure team and wider engineering organization.
P2P.org is the largest institutional staking provider with a TVL of over $10B and a market share exceeding 20% in restaking. They unite talented individuals globally and prioritize customer satisfaction, developing innovative solutions.
Contribute to the design and implementation of the Libraries Platform, spanning the services, pipelines, and package index.
Design and maintain automation for artifact creation, updates, and verification, including vulnerability scanning and remediation workflows.
Build and operate shared platform services such as package indexes, registry mirrors, and orchestration tooling that serve both external customers and internal ecosystem teams.
Chainguard is the trusted source for open source, delivering hardened, secure, and production-ready builds of open source software. They help organizations build faster, stay compliant, and eliminate risk and is venture-backed by leading investors.
Lead the team covering existing areas including corporate systems, employee device lifecycles, helpdesk queues, and internal tooling development.
Handle corporate security initiatives, deploying security and compliance checks in an employee-enabling way in daily workflows.
Help define the wider internal AI rollout, enablement, and security strategy across all teams, not just developers.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies and thrive in an innovation-driven environment where transparency, autonomy, and trust fuel everything.
Evolve ArgoCD GitOps standards across environments
Build reusable Terraform modules and practices for safe, repeatable cloud infrastructure provisioning and drift detection
Lead the operation and evolution of production-grade Kubernetes clusters across cloud environments
GitLab is the intelligent orchestration platform for DevSecOps. More than 50 million registered users and more than 50% of the Fortune 100 trust GitLab to ship better, more secure software faster.
Work with other Engineering teams to design sustainable infrastructure and microservice solutions.
Automate tools and infrastructure to reduce manual work.
Monitor applications and participate in an on-call rotation as required.
Bloomreach is building the world’s premier agentic platform for personalization, revolutionizing how businesses connect with their customers by building and deploying AI agents to personalize the entire customer journey. They power personalization for more than 1,400 global brands.
Design and implement infrastructure and tools that empower our product teams to rapidly and securely iterate, emphasizing reliability and automation.
Influence the strategic direction of our infrastructure and operational practices, ensuring that we are well-positioned to scale and support our growing organization.
Take a proactive role in the resolution of production issues, ensuring that we are well-prepared to handle incidents and that we learn from them in a blameless manner.
SSV Labs is the core team behind the SSV Network - pioneering decentralized infrastructure for Ethereum staking. They are building tools, protocols, and standards to make staking more secure, scalable, and trustless.
Design, implement, and operate cloud-native infrastructure for production workloads.
PointClickCare's mission is to help providers deliver exceptional care. They are a leading health tech company that’s founder-led and privately held that empowers their employees to push boundaries, innovate, and shape the future of healthcare. They have the largest long-term and post-acute care dataset and a Marketplace of 400+ integrated partners, their platform serves over 30,000 provider organizations.
Build the foundational, reusable services that every other JumpCloud product relies on to function securely and efficiently.
Deepen your expertise in Go, AWS, and Kubernetes while gaining broad architectural exposure by adapting to different teams and tech challenges.
Perfect for a versatile engineer who loves solving core infrastructure problems, building common frameworks, and thrives in a dynamic, flexible environment.
JumpCloud delivers a unified open directory platform that makes it easy to securely manage identities, devices, and access across your organization. With JumpCloud, IT teams and MSPs enable users to work securely from anywhere and manage their Windows, Apple, Linux, and Android devices from a single platform.
Collaborate with stakeholders to drive best practices for monitoring, CI/CD pipelines
Troubleshoot deployment issues in our CI pipeline
Identify areas for automation and embrace the codification of all things
Weedmaps is a global leader in the cannabis industry. They are dedicated to transparency, education, and community, serving cannabis to consumers and businesses in the U.S. and worldwide.
Managing two small teams of software engineers who design and implement software to reduce risk.
Owning the strategy and roadmap for both teams, balancing security and developer experience.
Coach and develop engineers by providing regular, practical feedback to help them reach their personal growth goals
Canva is a design platform redefining how the world experiences design. The company has campuses in Sydney and Melbourne, and co-working spaces in other Australian cities; they trust their employees to choose a work arrangement that empowers them.
Develop and maintain observability solutions using platforms like Datadog, Prometheus and Grafana
Take a leading role in incident management, including coordinating response efforts, troubleshooting issues, and identifying follow-up actions
Partner with product engineering teams to architect reliable systems, recover from incidents, and learn from mistakes
Ditto is redefining how data moves at the edge, aiming to make resilient, real-time applications seamless for developers, regardless of network conditions. It's a globally distributed and fast-growing startup with over $145 million in funding that is committed to building a diverse and inclusive team.
Work with the head of DevOps to help develop and implement the DevOps strategy.
Build tooling that helps us automate and comply with security requirements / certifications.
Play a leading role in discussions / code review so the wider team “levels up” together.
Ivanti provides security and service management software, offering solutions primarily for IT departments within medium to large organizations. They help securely manage IT infrastructure and service clients with dozens of products developed and maintained.
Work on automated Ci/CD processes for building, testing and publishing our container images
Write tools and tests for assessing security compliance and cloud-native compatibility
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. As the company that publishes Ubuntu, with 1100+ colleagues in 75+ countries, they are changing the world on a daily basis.
Contribute to the Infrastructure Security team’s vision and strategic roadmap.
Manage an existing high-performing team of infrastructure security professionals and hire new members as appropriate.
Establish and implement security policies, procedures, standards, and guidelines in support of infrastructure security.
GitLab is the intelligent orchestration platform for DevSecOps. They enable organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. GitLab has more than 50 million registered users and is a high-performance culture is driven by their values and continuous knowledge exchange.
Enhancing the querying experience through improving how queries are transformed, routed, and delivered at scale
Influencing the design of our microservice architecture
Supporting the reliability, scalability, and operations of data sources in Grafana Cloud
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana around the globe. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, with a global collaborative culture, and a passion for meaningful work.
Learn platform infrastructure, developer tooling, and deployment patterns.
Own and drive at least one architecture decision that improves platform reliability.
Ship infrastructure improvements that measurably improve developer experience or platform stability.
Homebot is a homeownership platform for lenders and real estate, title & insurance agents that drives client retention and partner referrals. They maintain a clear focus on culture, engagement, and creating an environment where people are valued and can thrive.