Design and implement infrastructure and tools that empower our product teams to rapidly and securely iterate, emphasizing reliability and automation.
Influence the strategic direction of our infrastructure and operational practices, ensuring that we are well-positioned to scale and support our growing organization.
Take a proactive role in the resolution of production issues, ensuring that we are well-prepared to handle incidents and that we learn from them in a blameless manner.
Support and operate Legion’s AWS-based cloud platform and Kubernetes (EKS) environments.
Build and maintain infrastructure-as-code using Terraform.
Improve CI/CD pipelines to increase deployment safety and velocity.
Legion Technologies delivers the industry’s most innovative workforce management platform. The AI-driven Legion WFM platform maximizes labor efficiency and employee engagement. They are a remote, mission-driven team that embraces a collaborative, fast-paced, and entrepreneurial culture.
You will plan and execute infrastructure deployments, using automation to ensure a stable platform.
You will manage operations, troubleshoot, and optimize workflows to maintain high availability.
You will own backend features supporting our platforms and interface with users for feedback.
Trust Wallet is the leading non-custodial cryptocurrency wallet, trusted by over 200 million people worldwide to securely manage and grow their digital assets. They aim to give individuals the opportunity to own their assets and participate in the future economy.
Design, implement, and operate cloud-native infrastructure for production workloads.
PointClickCare's mission is to help providers deliver exceptional care. They are a leading health tech company that’s founder-led and privately held that empowers their employees to push boundaries, innovate, and shape the future of healthcare. They have the largest long-term and post-acute care dataset and a Marketplace of 400+ integrated partners, their platform serves over 30,000 provider organizations.
Work with other Engineering teams to design sustainable infrastructure and microservice solutions.
Automate tools and infrastructure to reduce manual work.
Monitor applications and participate in an on-call rotation as required.
Bloomreach is building the world’s premier agentic platform for personalization, revolutionizing how businesses connect with their customers by building and deploying AI agents to personalize the entire customer journey. They power personalization for more than 1,400 global brands.
Own SLI/SLO/SLA definitions for the Akuity SaaS platform and drive continuous improvement.
Participate in an on-call rotation and act as incident commander for high-severity production events.
Partner with engineering teams to build reliability into new features before they ship to production
Akuity helps enterprises ship software faster and more reliably with modern GitOps best practices. The Akuity Platform enables teams to manage the development and deployment across hundreds – if not thousands – of Kubernetes clusters from a single control plane.
Deploy, configure, and manage blockchain networks (e.g., Bitcoin, Ethereum, Solana)
Design and implement cloud infrastructure on AWS in line with best practices.
Administer and scale Kubernetes clusters (EKS) for deploying blockchain nodes and related services.
Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. Trusted by 300+ million people in 100+ countries, they offer trading, finance, education, research, payments, institutional services, Web3 features, and more.
Design, build, and optimize cloud platform capabilities.
Tackle complex infrastructure challenges and raise engineering quality.
Apply AI and AIOps to make the platform smarter and more resilient.
PerfectServe offers Best in KLAS clinical communication and physician scheduling solutions and is a Leader in the Gartner Magic Quadrant for Clinical Communication and Collaboration. We focus on optimizing provider schedules and dynamically routing messages to advance patient care and clinical workflows, valuing growth, transparency, and innovation.
Define and evolve reliability standards for the SmarterDx platform.
Enhance observability systems (metrics, logs, traces, alerting) to provide actionable insights and reduce mean time to detect (MTTD) and resolve (MTTR).
Reduce operational toil through automation, self-healing systems, and improved deployment and rollback mechanisms.
SmarterDx, a Smarter Technologies company, builds clinical AI that is transforming how hospitals translate care into payment. Founded by physicians in 2020, their platform connects clinical context with revenue intelligence, helping health systems recover millions in missed revenue, improve quality scores, and appeal every denial.
Propel builds technology that strengthens the social safety net. They are a passionate team of ~100 Propellers who envision a future where every American has the tools and resources they need to thrive, offering a remote-first working environment with headquarters in Brooklyn.
Collaborate with engineering teams to design and implement scalable, secure systems.
Establish and manage service level objectives (SLOs) and service level agreements (SLAs).
Enhance incident response processes and post-mortem analysis for outages.
ClickHouse, recognized on the 2025 Forbes Cloud 100 list, is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.
Build and deploy computing services and infrastructure in customer environments.
Clarify and surface requirements from ambiguous use cases defined by cross-functional stakeholders.
Improve reliability and scalability by resolving edge cases, studying failure modes, and writing tests.
Planet designs, builds, and operates the largest constellation of imaging satellites in history. They deliver an unprecedented dataset of empirical information via a revolutionary cloud-based platform to authoritative figures in commercial, environmental, and humanitarian sectors. Planet has a people-centric approach toward culture and community and it strives to iterate in a way that puts their team members first and prepares their company for growth.
Lead the Infrastructure Engineering team, taking full ownership of cloud infrastructure, Kubernetes platforms, DevOps tooling, and CI/CD pipelines.
Drive reliability, scalability, and security across the production environment while maintaining a sharp focus on developer velocity and business impact.
Mentor and guide engineers across SRE, DevOps, and Database Reliability functions, fostering a culture of operational excellence and pragmatic problem-solving.
Finom is a European tech startup headquartered in Amsterdam, revolutionizing financial services for entrepreneurs with an all-in-one B2B platform. They have raised $346 million, are expanding across key EU markets, and foster innovation, prioritizing research and solutions that benefit users, employees, partners, and the business.
Design, build, and maintain Kubernetes-based infrastructure and cloud environments.
Build and optimize CI/CD pipelines that enable fast, safe, and repeatable deployments.
Leverage AI coding tools and agentic workflows as a core part of your work.
Intrahealth, a subsidiary of HEALWELL AI Inc., is an enterprise class EMR provider supporting approximately 20,000 providers and the care delivery of tens of millions of patients and clients across Canada, Australia and New Zealand. Intrahealth provides a suite of flexible software solutions to a wide variety of customers including health authorities, public health, community health, home care, and primary care professionals.
Designing, building, and operating Kubernetes infrastructure across multiple cloud providers.
Building and maintaining automation for cluster lifecycle management, node provisioning, and provider onboarding.
Developing platform tooling and abstractions that enable other Canva engineers to deploy and scale workloads.
Canva is a design platform redefining how the world experiences design. They have campuses in Sydney and Melbourne, along with co-working spaces in Brisbane, Perth and Adelaide, offering a flexible and inclusive work environment.
Design infrastructure, networking, and software platform architecture.
Build and maintain automation of Continuous Integration and Continuous Deployment pipelines.
Troubleshoot infrastructure, internal applications, networking, and security issues.
Loadsmart is a technology company focused on the logistics and supply chain industry. They leverage data and technology to automate and optimize freight transportation, connecting shippers and carriers to streamline the shipping process. They are a mid-sized company passionate about transforming the future of freight.
Building tools and applications to extends Calendly’s infrastructure platform
Evaluating and deploying cloud native open source tools
Exercising expertise in cloud infrastructure concepts and patterns
Calendly's product powers connections for millions through impactful innovation. They are in the midst of exciting growth and desire people that want to learn, grow, and do their best work.
Lead the push toward a modern, cloud-native organization by designing and managing scalable, resilient systems on AWS.
Own the Infrastructure as Code (IaC) strategy using Terraform, ensuring environments are repeatable, versioned, and stable.
Build and optimize high-velocity deployment pipelines using GitHub Actions, ArgoCD, and Helm to get code from "commit" to "production" seamlessly.
TrueML is undergoing a major platform rearchitecture, moving toward a fully cloud-native, modernized infrastructure. They seem to be a medium-sized company with a focus on innovation and providing engineers with the tools and data they need to make smart, impactful choices.
Design, develop, and maintain core cloud platform services including compute orchestration, resource management, multi-tenancy, and API gateway components using Go, Java, Python, or Rust.
Build and optimize RESTful/gRPC APIs and microservices that support cloud resource provisioning, lifecycle management, and monitoring on Bitdeer AI Cloud.
Develop scalable, fault-tolerant distributed systems that handle high-throughput workloads across multi-region deployments.
Bitdeer is a world-leading technology company for Bitcoin mining and AI cloud. They are committed to providing comprehensive Bitcoin mining solutions for its customers, designing industry-leading ASIC chips, and manufacturing mining rigs, with operations globally and a diversified 3 GW energy portfolio.
Working with engineers across Yelp in supporting new features and services.
Integrating tools to monitor platform stability and performance.
Help scale our Kubernetes clusters and AWS-based infrastructure while maintaining our platform's SLOs.
Yelp's engineering culture values individual authenticity and encourages creative solutions. They focus on helping users, growing as engineers, and having fun in a collaborative environment.
Design and implement the complex distributed infrastructure that powers our core AI engine and distributed analysis systems.
Tune and optimize cloud services across compute, storage, networking, and observability to drive performance and reliability.
Develop our core services, written in TypeScript, Kotlin and Go to support our unique deployment and infrastructure requirements.
XBOW is building the future of offensive security. They create the platform that puts security ahead in the arms race, using AI to autonomously discover, validate, and exploit vulnerabilities. Founded by Oege de Moor, the company is backed by Sequoia, Altimeter, and other leading investors.