Guide the technical vision for GitLab’s cloud-native, self-managed deployments and upgrade workflows.
Design and maintain Kubernetes Operators, Helm charts, and upgrade orchestration tooling for self-managed GitLab deployments.
Drive observability, testing, performance, and resilience practices for self-managed deployments.
GitLab is the intelligent orchestration platform for DevSecOps, enabling organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. With over 50 million registered users, GitLab emphasizes AI integration and a high-performance culture driven by values and continuous knowledge exchange.
Represent the IS team to stakeholders, customers, and internal teams
Mentor engineers to improve their skills
Canonical is a leading provider of open-source software and operating systems for global enterprise and technology markets. It is a pioneer of global distributed collaboration, with 1200+ colleagues in more than 80 countries and very few office-based roles.
Build and lead the team responsible for the reliability, security, and scalability of Gensyn’s production infrastructure and developer platform.
Own the availability, scalability, and security posture of production systems: SLOs/SLIs, incident response, postmortems, reliability improvements, and hardening.
Drive delivery across ambiguous, high-stakes initiatives: roadmap planning, prioritization, and execution against tight timelines.
Gensyn is building a protocol that networks together the core resources required for machine intelligence to flourish alongside human intelligence. They value autonomy, independence, direct feedback and an extreme learning rate, and strive to reject mediocrity and waste.
Hire, manage, and develop a team of engineers, providing regular feedback and supporting each person’s growth through career conversations.
Collaborate closely with product, design, and engineering leadership to define goals that move the squad forward and align with broader business objectives.
Foster a psychologically safe environment where engineers can learn, experiment, and iterate quickly, encouraging innovation and a culture of continuous improvement.
Grafana Labs is a remote-first, open-source powerhouse with more than 20M users of Grafana. They help more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack. Their team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything they do.
Work directly with enterprise customers to deploy and configure OpenTelemetry instrumentation across their environments.
Build custom integrations, dashboards, and tooling to help customers realize the full value of Dash0.
Troubleshoot complex issues in distributed systems, Kubernetes clusters, and observability pipelines.
Dash0 is building an AI-centric platform that eliminates vendor lock-in and meaningless toil and is OpenTelemetry-native. They are backed by top-tier investors including Balderton Capital, Accel and Cherry Ventures and led by a founding team with decades of experience in observability.
Build, maintain, and release C8 distributions for our self-managed customers.
Improve reliability and usability of our deployment artifacts.
Collaborate cross-functionally with Development Teams, Product Management, Quality Assurance, Documentation, and Support.
Camunda is the leader in enterprise agentic automation, orchestrating complex business processes across agents, people, and systems. They are a fully remote, global company named a Great Place to Work, growing fast and looking for top talent to join their team.
Build, mentor, and lead high-performing engineering teams.
Champion technical excellence by setting high standards.
Foster a culture focused on customer satisfaction.
Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. The system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Lead, mentor, and grow the team of Platform Engineers.
Partner with Cybersecurity, Product, and Engineering teams.
Drive standards and frameworks for Infrastructure as Code.
Onebrief provides collaboration and AI-powered workflow software specifically for military staffs. The company aims to make staff faster, smarter, and more efficient. Founded in 2019, Onebrief's team spans veterans from all forces and global organizations, and technologists from leading-edge software companies.
Make GitLab easier to deploy and more secure by default.
Improve installation, upgrade, and day-to-day operations.
GitLab is the intelligent orchestration platform for DevSecOps. They enable organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. More than 50 million registered users trust them.
Lead, mentor, and develop a high-performing team of Software and SRE engineers.
Drive the ongoing development and improvement of the Operations Platform.
Define clear operational standards, guidelines, and best practices.
Chainlink is the industry-standard oracle platform bringing capital markets onchain and powering decentralized finance (DeFi). The Chainlink stack provides essential data, interoperability, compliance, and privacy standards to power advanced blockchain applications.
Spearhead development and implementation of observability tools.
Drive performance and ensure resilient systems.
Provide technical guidance and improve operational efficiencies.
Jobgether is a platform connecting job seekers with employers using AI-powered matching. They aim to ensure applications are reviewed quickly and fairly, focusing on core role requirements.
Own and drive the architectural direction for critical infrastructure platforms that support GitLab at global scale.
GitLab is the intelligent orchestration platform for DevSecOps. They enable organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. GitLab has a high-performance culture driven by their values.
Own delivery for the Developer Tools team, ensuring timely, high-quality releases aligned with Elasticsearch schedules and key milestones
Lead and mentor a globally distributed team of language-specialist engineers
Foster a collaborative, inclusive team culture with strong ownership and accountability
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter.
Collaborate with application engineering teams on platform infrastructure.
Enhance observability and spearhead the adoption of SRE best practices.
Build and maintain reliable CI/CD pipelines, tooling, and infrastructure.
Rula strives to provide quality, evidence-based, compassionate mental healthcare and aims to create a world where mental health is no longer stigmatized. They are a remote-first company operating in most U.S. states, and are dedicated to having a culture of inclusion that supports their employees.
You will lead the teams responsible for the foundation that every engineering team builds on: infrastructure, developer experience, shared libraries, CI/CD, observability, and governance.
You will own delivery across developer experience, core platform tooling, and platform infrastructure ensuring that domain teams building on the platform have an opinionated, supported path.
This is a hands-on engineering management role where you review technical designs, pairing with engineers on hard problems, and making informed tradeoff calls across infrastructure, tooling, and developer experience.
EzCater is the leading food for work technology company in the US, connecting anyone who needs food for their workplace to over 100,000 restaurants nationwide. They are backed by top investors including Insight, Iconiq, Lightspeed, GIC, SoftBank, and Quadrille with engaged and passionate colleagues.
Serve as a hands-on technical leader accountable for delivery quality, system health, and team effectiveness.
Drive technical strategy, optimize software development lifecycle (SDLC) processes, and translate business objectives into actionable engineering roadmaps.
Guide architectural decisions, remove delivery roadblocks, and foster a high-trust, high-performance culture across time zones.
iSeatz drives enduring brand loyalty through exceptional, connected experiences. Their digital commerce and technology solutions enable travel and lifestyle bookings for the world’s leading brands. They have a history of long-term trusted relationships and innovation that drives tangible value to their customers through a customizable, scalable, and secure platform.
Own SLI/SLO/SLA definitions for the Akuity SaaS platform and drive continuous improvement.
Participate in an on-call rotation and act as incident commander for high-severity production events.
Partner with engineering teams to build reliability into new features before they ship to production
Akuity helps enterprises ship software faster and more reliably with modern GitOps best practices. The Akuity Platform enables teams to manage the development and deployment across hundreds – if not thousands – of Kubernetes clusters from a single control plane.
Build, lead, and develop a remote engineering team of ~5 engineers.
Act as a hands-on technical leader—participating in code reviews, architecture decisions, and occasional implementation work.
Improve engineering processes, quality, and productivity as the company scales.
EdSights builds technology that helps colleges and universities better understand and support their students at scale. They partner with 250+ colleges and universities and are backed by an $80M investment, in a phase of rapid execution and expansion.
Manage, mentor, and hire for a high-performing team of engineers.
Instill best practices for monitoring, alerting, and incident response.
Streamline how we build and maintain integrations with external systems.
Inspiren offers a complete and connected ecosystem in senior living, founded by Michael Wang. Their integrated platform connects smart hardware, embedded software, and cloud infrastructure to deliver real-time insights that improve safety, operational efficiency, and care outcomes.
Help deploy and configure Dynatrace OneAgent and ActiveGates with automated tooling.
Define and instrument user‑centric metrics and objectives in Dynatrace.
Combine Davis® AI with Copilot/Claude to identify root causes and reduce MTTR.
AWP Safety's IT Internship Program is a hands‑on, learning experience for early‑career professionals who want to build a future in IT Site Reliability Engineering. They operate at the intersection of Software Engineering and Systems Operations, using Dynatrace to diagnose performance bottlenecks and automate "toil" out of existence.