Collaborate with exceptional engineers on building systems and services for the world's largest companies.
Lead architecture for distributed services at scale that synchronize shared state across clients.
Drive cross-team technical alignment via design docs and decision records; unblock execution across org boundaries.
Webflow is building the world’s leading AI-native Digital Experience Platform. They are a remote-first company built on trust, transparency, and creativity, empowering teams to design, launch, and optimize for the web without barriers.
Define and plan the long-term strategy for the Data Platform.
Design and develop scalable distributed systems for data management.
Improve and add features to the ETL framework while maintaining SLAs.
Jobgether is a platform that connects job seekers with companies using an AI-powered matching process. It's a platform that ensures applications are reviewed quickly, objectively, and fairly against the role's core requirements.
Design and implement distributed scheduling and workflow systems.
Build scalable, reliable platform services and storage abstractions.
Improve system reliability, observability, and operational performance.
Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. They have become a multibillion-dollar asset manager, and they have ambitious goals for the future.
Architect and optimize distributed training and inference systems for large-scale AI models
Design and deliver customer-focused solutions that maximize performance and business value
Lead the transition of ML pipelines from POC to scalable production systems
The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.
Build distributed systems that support reliability, resiliency, and safe operation at scale.
Design and operate traffic control mechanisms: circuit breakers, rate limiting, admission control, backpressure, and graceful degradation.
Develop tooling that improves incident detection, response, and automated mitigation.
Whatnot is the largest live shopping platform in North America and Europe to buy, sell, and discover the things you love. They are a remote co-located team, inspired by innovation and anchored in their values.
Build and scale distributed data pipelines for large-scale time series, log data, and high-volume event streams.
Design and maintain reliable, high-performance Spark and Python workflows to support model training datasets.
Analyze and resolve performance bottlenecks related to latency, memory utilization, data skew, and throughput.
ItD blends diversity, innovation, and integrity with real business results as a consulting and software development company. Their structure rejects any strong hierarchy, empowering them to deliver excellent results as a woman- and minority-led firm.
Define long-term architectural strategy for multi-cloud compute and traffic platforms.
Provide mentorship to engineers through design reviews and code contributions.
Partner with Security to build ‘secure by default’ systems.
Temporal Technologies develops an open-source programming model that simplifies code and enhances application reliability. With a focus on developer experience and open-source software, they foster a culture of curiosity, collaboration, and genuine impact.
Work directly with CV researchers to understand their goals, review their code, and engineer it for reliability and performance at scale.
Profile and optimize performance-sensitive code across both training and real-time inference.
Identify patterns across research efforts and propose standardized, composable abstractions.
GameChanger believes in the life changing impact youth sports have on and off the field. By building the first and best place to experience the youth sports moments important to their community, they are helping families elevate the next generation through youth sports. They are a remote first, dynamic tech company based in New York City, and they are solving some of the biggest challenges in youth sports today.
Read, understand, and write code and unit tests (primarily in Java )
Investigate, diagnose, and implement improvements for performance bottlenecks and cost inefficiencies
Implement, test, and deploy architecture and library changes which enable new insights and understanding, including cost modeling/reporting and data patterns
Airship helps brands drive revenue growth and customer loyalty with exceptional cross-channel customer experiences. Airship's platform empowers growth-focused teams to create, test, and orchestrate hyper-personalized experiences across all channels.
Take an active role in influencing our roadmap and your own career objectives
Work with your team to deliver new features, then use the results to iterate and improve.
Drive projects from initial idea all the way to operations once it is in the hands of customers
Grafana Labs is a remote-first, open-source powerhouse with over 20M Grafana users globally. With a global collaborative culture, Grafana Labs fosters transparency, autonomy, and trust in an innovation-driven environment.
Own Technical Excellence: Define and drive the architecture, design patterns, and engineering standards for the feature store platform.
V2 Implementation: Assist and execute the next generation of our feature store—building for scale, low-latency serving, and enterprise-grade reliability.
Guide Product Roadmap: Partner with Product and leadership to help shape the technical roadmap.
Redis created the product that runs the fast apps our world runs on. At Redis, people work with technology and build it, tell its story, and sell it to 10,000+ worldwide customers creating a faster world with simpler experiences.
Architect end-to-end software solutions using modern frameworks aligned with scalability.
Lead complex, cross-functional projects from concept to delivery, aligning engineering solutions with business needs.
Build and maintain distributed systems using Spring Boot microservices, Docker, and Kubernetes.
SupplyHouse.com is an industry-leading e-commerce company specializing in HVAC, plumbing, heating, and electrical supplies since 2004. They value every individual team member and cultivate a community where people come first.
Own the reliability, performance, and operational health of production AI services
Refactor and harden existing systems to improve resilience, clarity, and maintainability
Diagnose and resolve issues across distributed services, data pipelines, and storage layers
MixMode is a leading provider of AI-powered cybersecurity solutions at scale, pioneering a patented third-wave, context-aware AI approach. They are backed by PSG and Entrada Ventures and headquartered in Santa Barbara, California.
Design and build the infrastructure layer powering AI agent systems in production
Develop high-performance Rust services that handle model inference, orchestration, and execution
Architect scalable systems capable of supporting millions of users and high request throughput
Kraken is a mission-focused company rooted in crypto values, aiming to accelerate the global adoption of crypto so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has employees in 70+ countries and is committed to industry-leading security, crypto education, and client support.
Create robust pipelines to process massive daily volumes of data.
Build and support scalable pipelines as part of Torc’s Data Factory.
Scale Torc’s data lake through a distributed storage system.
Torc has been a leader in autonomous driving since 2007 and is now part of the Daimler family, focused on developing software for automated trucks. Their culture is collaborative, energetic, and team-focused, offering flexibility and valuing work/life balance.
Ontrac Solutions partners with elite engineering organizations to build systems that operate at planetary scale. We operate at the intersection of control, clarity, velocity, and institutional trust.
Collaborate with engineering teams to design and implement scalable, secure systems.
Establish and manage service level objectives (SLOs) and service level agreements (SLAs).
Enhance incident response processes and post-mortem analysis for outages.
ClickHouse, recognized on the 2025 Forbes Cloud 100 list, is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads.
Own reliability outcomes for operating Temporal Cloud end to end.
Define, implement, and evolve reliability targets and associated practices.
Lead load testing and performance testing efforts, including test design, tooling, and analysis of bottlenecks.
Temporal is an open source programming model that simplifies code and makes applications more reliable. They are a growing company looking for those who share their values, challenge 'standard' thinking, and want to influence their future.
Build and Lead the Platform Architecture Organization.
Own Production Readiness as a Company Capability.
Drive Operational Excellence and Business Outcomes.
Temporal is an open source programming model that simplifies code, makes applications reliable, and helps developers focus on delivering features faster. They aim to be the reliable foundation of every developer’s toolbox, with a team that embraces curiosity, drive, collaboration, and humility.