Operating and evolving 100+ multi-cloud streaming clusters and related database infrastructure.
Diagnosing and eliminating cross-layer failure modes.
Designing safe upgrade and rollout strategies at scale.
Grafana Labs is a remote-first, open-source powerhouse with over 20M users of Grafana, its open source visualization tool. Grafana Labs helps more than 3,000 companies manage their observability strategies with the Grafana LGTM Stack, and its team thrives in an innovation-driven environment.
Leading a team focused on designing, building, and evolving cloud-native, containerized infrastructure.
Driving complex technical initiatives and ensuring the availability, security, scalability, and reliability of our data ecosystem.
Guiding and developing engineering talent, setting priorities, driving execution, and partnering across teams.
Pismo, founded in 2016, provides a comprehensive processing platform for banking, card issuing and financial market infrastructure. Pismo has 500+ employees located in more than 10 countries around the world and was acquired by Visa in 2024.
Lead the Infrastructure Engineering team, taking full ownership of cloud infrastructure, Kubernetes platforms, DevOps tooling, and CI/CD pipelines.
Drive reliability, scalability, and security across the production environment while maintaining a sharp focus on developer velocity and business impact.
Mentor and guide engineers across SRE, DevOps, and Database Reliability functions, fostering a culture of operational excellence and pragmatic problem-solving.
Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial services for entrepreneurs with an all-in-one B2B platform. They have raised $346 million, are expanding across key EU markets, and foster innovation, prioritizing research and solutions that benefit users, employees, partners, and the business.
Deliver a scalable internal infrastructure platform on public cloud environments.
Establish and evolve Kubernetes-based platform capabilities to support high-availability, production-grade workloads at scale.
Build a secure and reliable foundation that supports CI/CD pipelines and minimizes operational risk across engineering teams
Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). Since inventing decentralized oracle networks, Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi.
Design and build the core data infrastructure powering Vantage's platform.
Own architecture decisions for systems built on ClickHouse, Temporal, Kubernetes, and Postgres.
Drive reliability, performance, and scalability initiatives across the platform as data volume and customer load grows
Vantage is the FinOps platform built for modern engineering teams. They are a high-output team of ~50 employees based in New York City with a remote-friendly culture.
Design, build, and maintain scalable, highly available and fault-tolerant infrastructures.
Implement and improve monitoring, alerting, and incident response systems to ensure optimal system performance and minimize downtime.
Drive continuous improvement in infrastructure automation, deployment, and orchestration.
Mistral AI is dedicated to democratizing AI through high-performance, optimized, open-source models, products, and solutions designed to integrate seamlessly into daily working life. They are a dynamic, collaborative team passionate about AI and its potential to transform society dedicated to innovation.
Own SLI/SLO/SLA definitions for the Akuity SaaS platform and drive continuous improvement.
Participate in an on-call rotation and act as incident commander for high-severity production events.
Partner with engineering teams to build reliability into new features before they ship to production
Akuity helps enterprises ship software faster and more reliably with modern GitOps best practices. The Akuity Platform enables teams to manage the development and deployment across hundreds – if not thousands – of Kubernetes clusters from a single control plane.
Lead, mentor, and grow a team of 8-10 skilled and globally distributed engineers, supporting their technical success, career development, and personal growth
Plan and deliver high-quality solutions that meet business and technical goals
Collaborate with Product, the Senior Director of Engineering - Cloud, and other Engineering teams to align database capabilities with business needs
Ditto is redefining how data moves at the edge, aiming to make building resilient, real-time applications seamless regardless of network conditions. As a globally distributed, fast-growing startup with over $145 million in funding, we're committed to a diverse and inclusive team to solve connectivity problems.
Design, build, and operate core cloud infrastructure across compute, storage, databases, and networking layers.
Own and improve the reliability, scalability, and security of Valon’s production systems as we scale to support major enterprise deployments.
Evaluate, adopt, and operationalize new infrastructure technologies (e.g., Vitess, Clickhouse, Redis) to meet evolving product and scale requirements.
Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. They are a Series C company backed by a16z, transforming industries that others have written off as too complex to innovate.
Take technical ownership of our cloud infrastructure and DevOps practices.
Help us design resilient systems, scale infrastructure, mentor engineers, and collaborate across teams.
Own and improve CI/CD workflows, enabling fast and reliable deployments across teams.
Halcyon is the industry’s first dedicated, adaptive security platform that combines multiple proprietary advanced prevention engines along with AI models focused specifically on stopping ransomware. Formed in 2021 by a team of cyber industry veterans after battling ransomware for years, Halcyon is focused on solutions for mid-market and enterprise customers.
Manage and support infrastructure for Growth teams, including Nomad, Hashistack, databases, and any other underlying systems
Maintain and troubleshoot GitLab CI pipelines, ensuring reliable and fast build, test, and deployment cycles
Provide operational support across Onboarding, Acquire, and Engage teams, helping debug issues in staging and production environments
Kraken is a mission-focused company rooted in crypto values, aiming to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. As a fully remote company, they have Krakenites in 70+ countries who speak over 50 languages.
Help define and drive the technical direction of our Cloud Infrastructure team within Platform Engineering.
Work across Valon’s production systems—compute, databases, storage, and networking—shaping the infrastructure foundations that every product and team depends on.
Set the technical direction for how we meet those challenges.
Valon is building the AI-native operating system for regulated finance, starting with mortgage servicing. We're a Series C company backed by a16z, transforming industries that others have written off as too complex to innovate.
Build and maintain infrastructure-as-code for our AWS EKS and GCP GKE clusters, plus on-premises deployments.
Own CI/CD pipelines and drive GitOps adoption.
Deploy, scale, and optimize ML/NLP inference workloads.
Vectara is the Enterprise Agent Platform that enables businesses to build and deploy governed, grounded, auditable AI agents across SaaS, VPC, and on-prem. We’re a passionate team that’s hyper-focused on solving enterprise-level technology and business problems with AI.
Build platforms that scale; Design and operate foundational infrastructure that handle billions of events and enable company to grow with minimal friction.
Enable product velocity; Create tooling that let engineers ship faster and more reliably without becoming infrastructure experts themselves.
Drive technical direction; Shape Metronome's infrastructure strategy, make platform-level architectural decisions, and mentor engineers across the organization.
Metronome is the leading usage-based billing platform built for modern software companies. They compute millions of invoices per billing period and are scaling rapidly to accommodate new customers, saving them hours of development time and manual invoicing. They've raised over $128M from leading investors including NEA, Andreessen Horowitz, General Catalyst, Elad Gil, and Workday Ventures.
Design and implement the complex distributed infrastructure that powers our core AI engine and distributed analysis systems.
Tune and optimize cloud services across compute, storage, networking, and observability to drive performance and reliability.
Develop our core services, written in TypeScript, Kotlin and Go to support our unique deployment and infrastructure requirements.
XBOW is building the future of offensive security. They create the platform that puts security ahead in the arms race, using AI to autonomously discover, validate, and exploit vulnerabilities. Founded by Oege de Moor, the company is backed by Sequoia, Altimeter, and other leading investors.
Lead, mentor, and foster a healthy, high-performing globally distributed engineering team.
Own the execution and delivery of highly critical, complex yearly roadmap items centered around large-scale foundational infrastructure upgrades, high availability, and platform resilience.
Own and drive the change management processes across engineering and product domains.
Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Their global team of 230+ members is a diverse group of experienced engineers, traders, and brokerage professionals fostering a vibrant community.
Collaborate with application engineering teams on platform infrastructure.
Enhance observability and spearhead the adoption of SRE best practices.
Build and maintain reliable CI/CD pipelines, tooling, and infrastructure.
Rula strives to provide quality, evidence-based, compassionate mental healthcare and aims to create a world where mental health is no longer stigmatized. They are a remote-first company operating in most U.S. states, and are dedicated to having a culture of inclusion that supports their employees.
Develop and maintain observability solutions using platforms like Datadog, Prometheus and Grafana
Take a leading role in incident management, including coordinating response efforts, troubleshooting issues, and identifying follow-up actions
Partner with product engineering teams to architect reliable systems, recover from incidents, and learn from mistakes
Ditto is redefining how data moves at the edge, aiming to make resilient, real-time applications seamless for developers, regardless of network conditions. It's a globally distributed and fast-growing startup with over $145 million in funding that is committed to building a diverse and inclusive team.
Design, implement, and operate cloud-native infrastructure across GCP, AWS, or Azure using Terraform.
Take full ownership of MongoDB Atlas in production, including cluster architecture, scaling, and security.
Build and maintain CI/CD pipelines with a strong focus on automation, reliability, and scalability.
Smart Working Solutions believes jobs should feel right every day and connect skilled professionals with global teams for full-time, long-term roles. They offer a supportive community and value growth in a remote-first environment, priding themselves as a top-rated workplace.
Design and build reusable platform solutions empowering engineering and SRE teams across AWS, Azure, and GCP.
Spearhead the evolution of our Packer-driven VM image pipelines, establishing standardized, maintainable processes.
Lead application migrations into GCP while rapidly mastering our complex, multi-cloud infrastructure footprint.
TELUS Agriculture and Consumer Goods (TAC) is committed to disrupting the status quo with state-of-the-art applications that leverage data to reimagine the way we approach food. TAC is composed of inspired individuals united in passion and purpose, working collaboratively to bring extraordinary opportunities to life.