Partner with engineering leadership, EMs, and Product Managers to define and deliver AI products.
Architect scalable, high-performance systems that support a growing number of AI-powered products.
Drive technical strategy and make architectural decisions that compound - enabling the team to ship more AI experiences faster.
Webflow is building the world’s leading AI-native Digital Experience Platform as a remote-first company built on trust, transparency, and a whole lot of creativity. They empower teams to design, launch, and optimize for the web without barriers, from entrepreneurs launching their first idea to global enterprises scaling their digital presence.
Contribute to building and operating the infrastructure that supports the HackerOne platform.
Improve the reliability, security, and scalability of our systems.
Design and operate highly available cloud systems and apply best practices for reliability, observability, and security.
HackerOne is a global leader in Continuous Threat Exposure Management (CTEM). The HackerOne Platform unites agentic AI solutions with the ingenuity of the world’s largest community of security researchers to continuously discover, validate, prioritize, and remediate exposures across code, cloud, and AI systems. They combine the ingenuity of the largest security research community with a best-in-class AI-powered platform, trusted by the world’s top organizations.
Collaborate with service engineering teams to design, implement, and maintain scalable and resilient infrastructure solutions.
Implement SRE principles to improve system reliability and reduce downtime.
Improve developer workflows by creating self-service tools, optimizing CI/CD pipelines, and enhancing deployment processes.
Flex is a growth-stage FinTech company creating the best rent payment experience. They empower renters with flexibility over their most significant recurring expense and are growing quickly with a focus on building an inclusive culture.
Own and drive the architectural direction for critical infrastructure platforms that support GitLab at global scale.
GitLab is the intelligent orchestration platform for DevSecOps. They enable organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. GitLab has a high-performance culture driven by their values.
Develop automation code to provision and operate infrastructure at scale.
Build resilient, scalable, secure, and observable services with cost optimization.
Proactively identify and address security concerns across systems and infrastructure.
Globality uses AI to transform enterprise spending into a more efficient and inclusive process. They aim to revolutionize enterprise procurement with AI and have a culture built on trust, collaboration, and innovation, fostering an environment where every individual feels valued and included.
Build and scale infrastructure to support billions of messages per day and real-time events
Automate deployments, alerting, and incident response
Tune MySQL and other datastore performance and improve reliability across distributed systems
Customer.io's platform enables over 8,000 companies, from scrappy startups to global brands, to send billions of automated emails, push notifications, in-app messages, and SMS every day. They foster a culture that values empathy, transparency, and responsibility.
Build and lead the team responsible for the reliability, security, and scalability of Gensyn’s production infrastructure and developer platform.
Own the availability, scalability, and security posture of production systems: SLOs/SLIs, incident response, postmortems, reliability improvements, and hardening.
Drive delivery across ambiguous, high-stakes initiatives: roadmap planning, prioritization, and execution against tight timelines.
Gensyn is building a protocol that networks together the core resources required for machine intelligence to flourish alongside human intelligence. They value autonomy, independence, direct feedback and an extreme learning rate, and strive to reject mediocrity and waste.
Lead infrastructure initiatives across the engineering organization.
Design technical quality bar and architectural standards.
Build platforms and AI-enabled systems for multiple teams.
Fieldguide is automating and streamlining the work of assurance and audit practitioners specifically within cybersecurity, privacy, and financial audit, building software for the people who enable trust between businesses. They are based in San Francisco, CA, but built as a remote-first company with an inclusive, driven, humble and supportive team.
Maximize the velocity of our product engineering team.
Ensure platform scalability, reliability, and security.
Champion best practices and shape the engineering culture.
They are building a robust, scalable trading platform to serve high-traffic, latency-sensitive applications. They leverage state-of-the-art technologies to support real-time trading while providing unparalleled reliability and performance.
Perform infrastructure security reviews across cloud services, network design, IAM, and platform components.
Design and build internal security services, APIs, and tools that automate infrastructure vulnerability detection, triage, reporting, and remediation.
Develop security automation that integrates with CI/CD, cloud control planes, and developer workflows to shift detection and remediation earlier in the lifecycle.
Webflow is building the world’s leading AI-native Digital Experience Platform as a remote-first company. They empower teams to design, launch, and optimize for the web without barriers, from entrepreneurs to global enterprises, and believe the future of the web, and work, is more open, more creative, and more equitable.
Build and Lead the Platform Architecture Organization.
Own Production Readiness as a Company Capability.
Drive Operational Excellence and Business Outcomes.
Temporal is an open source programming model that simplifies code, makes applications reliable, and helps developers focus on delivering features faster. They aim to be the reliable foundation of every developer’s toolbox, with a team that embraces curiosity, drive, collaboration, and humility.
Design and implement a multi-region AWS architecture.
Architect self-healing infrastructure using advanced cloud load balancing and auto-scaling patterns.
Modernize CI/CD pipelines and implement Blue/Green and Canary deployment strategies.
Zscaler is a pioneer and global leader in zero trust security. They secure users, branches, applications, data & devices, and accelerate digital transformation initiatives. Zscaler is distributed across more than 160 data centers globally and they believe in transparency and value constructive, honest debate. They champion an “AI Forward, People First” philosophy to help us accelerate and innovate, empowering their people to embrace their potential.
Design, deploy and maintain a cloud infrastructure to support a Dataiku SaaS offering mainly on AWS and Azure and GCP
Continuously improve the infrastructure, deployment and configuration to deliver more reliable, resilient, scalable and secure services
Automate as much as possible all technical operations
Dataiku is The Universal AI Platform™, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. They connect many data science technologies and integrate the best of data and AI tech.
Build and operate cutting-edge cloud infrastructure to support Diagrid's core products
Define standards, deliver tools, processes, and frameworks to make our products secure, reliable, efficient, and highly available
Build and maintain CI/CD pipelines that enable delivering software quickly and securely across clouds
Diagrid believes that open-source software, open standards and APIs are the greatest transformational tools for organizations. They provide developers with APIs and tools that help them focus on their code and not on infrastructure and are founded by the creators of the Dapr and KEDA open-source projects.
Develop and maintain resilient, cost-efficient infrastructure using AWS and other cloud services to meet evolving business needs.
Use IaC solutions to enable automated provisioning and ensure consistency across all environments.
Design, develop, and maintain advanced pipelines, ensuring automated testing integration and deployment efficiency at scale.
Pagefreezer's vision is to make the Internet a safer place by delivering solutions that transform how people protect integrity online, ensuring accountability, and enabling the pursuit of justice. They simplify compliance and litigation by automatically archiving websites, social media, mobile text messages, and enterprise collaboration platforms. It appears they have a good company culture as they have been named Canada’s Most Admired Culture 2023, 2024 and 2025, one of BC’s Top Employers 2024 and as one of Canada’s Top Small & Medium Employers for 2024.
Manage and mentor Systems Engineers across multiple product initiatives.
Provide clear performance expectations, coaching, and career development.
Foster accountability, ownership, and technical rigor while building a resilient, high-trust engineering culture.
Dragos is dedicated to arming customers with technology, threat intelligence, and services to protect their systems. They are a remote-first culture with operations in North America, Europe, the Middle East, and APAC, looking for mission-oriented teammates.
Own the end-to-end lifecycle (design, provisioning, upgrades, and decommissioning) of core platform components.
Lead the design and implementation of infrastructure bootstrap orchestration, including: Automated cluster and environment provisioning.
Apply and promote SRE practices across the platform, including: Clear ownership and runbooks for platform components.
Pismo provides a comprehensive processing platform for banking, card issuing and financial market infrastructure and helps customers innovate and build the next generation of banking and payment solutions. Pismo’s 500+ employees are located in more than 10 countries around the world.
Advise and architect scalable infrastructure solutions for high-growth crypto and FinTech applications.
Design and implement robust data pipelines and distributed systems.
Build and optimize cloud infrastructure with comprehensive observability solutions.
Lazer is a digital product studio with over 180 senior engineers and designers. They partner with clients ranging from early-stage startups to recognizable retail brands, helping them design, engineer, and grow their products.
Designing and implementing SLI/SLO frameworks with error budgets to guide reliability and performance decisions.
Building and maintaining AWS-based production infrastructure using Infrastructure as Code (Terraform, CloudFormation), including ECS, EKS/Kubernetes, and microservices orchestration.
Developing internal tools, automation frameworks, and reliability services in TypeScript, Python, or similar languages to enhance operational efficiency.
Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. They identify the top-fitting candidates, and this shortlist is then shared directly with the hiring company.