Build and optimize data pipelines and backend services to process device and behavioral data in real time.
Develop and deploy ML models for fraud detection, ensuring they run reliably and efficiently in production.
Turn raw data into production-ready features that feed our fraud detection systems.
Sardine is a leader in fraud prevention and AML compliance. Their platform uses device intelligence, behavior biometrics, machine learning, and AI to stop fraud before it happens. Over 300 banks, retailers, and fintechs worldwide use Sardine; they have a remote-first work culture, valuing performance over hours and hiring self-motivated individuals.
Design and build robust, highly scalable data pipelines and lakehouse infrastructure with PySpark, Databricks, and Airflow on AWS.
Improve the data platform development experience for Engineering, Data Science, and Product by creating intuitive abstractions, self‑service tooling, and clear documentation.
Own and maintain core data pipelines and models that power internal dashboards, ML models, and customer-facing products.
Parafin aims to grow small businesses by providing them with the financial tools they need through the platforms they already sell on. They are a Series C company backed by prominent venture capitalists, with a tight-knit team of innovators from companies like Stripe, Square, and Coinbase.
Design, develop, and deploy LLM- and RAG-powered applications that enhance analyst and hacker productivity across offensive security use cases.
Architect and maintain large-scale, high-performance data pipelines to process vulnerability, asset, and activity datasets from multiple sources.
Collaborate with security researchers and engineers to translate offensive security workflows into data-driven automation.
Bugcrowd empowers organizations to take back control and stay ahead of threat actors. With a network of hackers, Bugcrowd brings diverse expertise to uncover hidden weaknesses and adapts swiftly to evolving threats.
Develop and maintain SentiLink’s fraud detection models through the full model development lifespan.
Build foundational modeling to drive SentiLink’s expanding suite of Fraud and Financial Risk products.
Research new types of fraud and develop new SentiLink products around identity verification.
SentiLink provides identity and risk solutions, empowering institutions and individuals to transact with confidence. They are backed by world-class investors and have been named to the Forbes Fintech 50 list every year since 2023.
Own and evolve our data infrastructure, including pipelines into our data warehouse
Manage and improve cloud infrastructure and DevOps workflows
Ensure platform reliability so product and design teams aren’t pulled into backend or operational firefighting
Meridio is a remote-first company on a mission to make health benefits for small businesses simple, affordable, and accessible. As they scale smart, they’re focused on building systems that reduce complexity instead of adding it.
Design, develop, and deploy machine learning models to solve complex business problems in the AdTech domain
Analyze large datasets to generate actionable insights and improve product performance
Build and maintain scalable data pipelines using big data tools and frameworks
Sigma Software brings together talented professionals to deliver high-load, data-driven platforms for global clients. They value expertise, listen to their employees' voices, and empower career growth.
Design, build, maintain, and operate scalable streaming and batch data pipelines.
Work with AWS services, including Redshift, EMR, and ECS, to support data processing and analytics workloads.
Develop and maintain data workflows using Python and SQL.
Southworks helps companies with software development and digital transformation. They focus on solving complex problems and delivering innovative solutions.
Architect our AWS-based data warehouse and ingestion pipelines.
Transform high-volume simulation outputs into clean, trusted datasets.
Establish schema standards and data contracts with engineering.
Onebrief provides collaboration and AI-powered workflow software designed for military staffs, making them faster, smarter, and more efficient. The company, founded in 2019, values ownership and excellence, with a team spanning veterans and technologists; it has raised $320m+ from investors and is valued at $2.15B.
Architect, build, and operate data infrastructure that powers Tebra’s intelligent features.
Translate business requirements into software solutions that accelerate our ability to deploy AI.
Monitor data pipelines, detect anomalies, and implement automated recovery systems.
Tebra unites Kareo and PatientPop, providing a digital backbone for practice well-being, supporting both products with a shared vision for modernized care. Over 100,000 providers trust Tebra to elevate patient experience and grow their practice, building the future of well-being with compassion and humanity.
Define and evolve the technical vision for AI and agentic systems across products.
Design orchestration, data, and serving patterns that handle global scale with reliability.
Collaborate with AI Research to turn prototypes into extensible, governed production frameworks.
KnowBe4 is a cybersecurity company that puts security first, empowering over 70,000 organizations worldwide to strengthen their security culture. They value radical transparency, extreme ownership, and continuous professional development in a welcoming workplace that encourages all employees to be themselves.
Architect and maintain central storage and cloud environment.
Design and automate scalable ELT/ETL pipelines for data.
Support scientists and operational teams by designing data models.
Funga is a public benefit corporation using forest fungal networks to address climate change. They combine DNA sequencing and machine learning with forest microbiome research to improve wood creation, carbon sequestration, and forest resilience. They are a team of scientists and builders aiming to remove three gigatons of carbon dioxide from the atmosphere by 2050.
Design and implement robust, production-grade pipelines using Python, Spark SQL, and Airflow.
Lead efforts to canonicalize raw healthcare data into internal models.
Onboard new customers by integrating their raw data into internal pipelines and canonical models.
Machinify is a healthcare intelligence company delivering value, transparency, and efficiency to health plan clients. They serve over 85 health plans, including many of the top 20, representing more than 270 million lives, with an AI-powered platform and expertise.
Own SentiLink’s real-time ML model monitoring domain.
Own our ML experimentation, model tracking, and versioning infrastructure.
Drive improvements to the model development process.
SentiLink provides identity and risk solutions for secure transactions. They are backed by investors like Craft Ventures and Andreessen Horowitz, recognized by Forbes Fintech 50, and have offices across the U.S. and India.
Build and operate backend services and automation for the Snowflake data platform.
Support data ingestion pipelines (RDS/Oracle → Snowflake) and reverse ETL (Snowflake → RDS).
Develop and maintain Airflow (AWS MWAA) workflows for ingestion, data quality, and ops automation.
Upwork is the world’s work marketplace, serving everyone from one-person startups to over 30% of the Fortune 100. They provide a powerful, trust-driven platform that enables companies and talent to work together in new ways that unlock their potential. Last year, more than $3.8 billion of work was done through Upwork.
Data integrity assurance, monitoring and resolution.
Query Performance optimization.
Baubap's mission is to provide exceptional microloan services. They aim to be the most inclusive digital bank in LATAM, with a multicultural and highly driven team of professionals.
Partner with stakeholders to tackle technical problems at scale, building framework agnostic services.
Establish roadmap and architecture for Wealthsimple’s Machine Learning platform.
Build highly performant scalable systems, contributing to our ML platform on Kubernetes, Bedrock and Sagemaker.
Wealthsimple aims to provide financial freedom by making financial services transparent and low-cost. As the largest fintech company in Canada, with over 1,500 employees, they manage over $100 billion in assets and foster a collaborative and quality-focused culture.
Design, develop, and maintain a core Python ETL framework.
Develop and optimize an automated refresh pipeline orchestrated through AWS Batch, Lambda, Step Functions, and EventBridge.
Build Python integrations with external systems that are robust, testable, and reusable.
BlastPoint is a B2B data analytics startup that helps companies engage with customers more effectively by discovering insights in their data. Founded in 2016 by Carnegie Mellon Alumni, they are a tight-knit, forward-thinking team that serves diverse industries including energy, finance, retail, and transportation.
Designing, deploying, and optimizing data-driven machine learning solutions on AWS.
Creating secure and scalable ML systems, enabling effective data management and model deployment.
Leading the enhancement of best practices within the data and ML lifecycle, making a substantial impact across projects and teams.
Jobgether uses an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company.
Build and optimize scalable, efficient ETL and data lake processes.
Own the ingestion, modeling, and transformation of structured and unstructured data.
Maintain and enhance database monitoring, anomaly detection, and quality assurance workflows.
Launch Potato is a digital media company that connects consumers with brands through data-driven content and technology. They have a remote-first team spanning over 15 countries and have built a high-growth, high-performance culture.
Build and scale distributed data pipelines for large-scale time series, log data, and high-volume event streams.
Design and maintain reliable, high-performance Spark and Python workflows to support model training datasets.
Analyze and resolve performance bottlenecks related to latency, memory utilization, data skew, and throughput.
ItD blends diversity, innovation, and integrity with real business results as a consulting and software development company. Their structure rejects any strong hierarchy, empowering them to deliver excellent results as a woman- and minority-led firm.