Architect and implement scalable ETL and data pipelines for real-time risk management and advanced analytics.
Design, develop, and optimize distributed data storage solutions for high performance and reliability at scale.
Drive schema evolution, data modeling, and pipeline orchestration with ownership of end-to-end data flow.
Oscilar builds the most advanced AI Risk Decisioning™ Platform for banks, fintechs, and digitally native organizations to manage fraud, credit, and compliance risk. The company is mission-driven with a remote-first culture and team members from Meta, Uber, Citi, and Confluent.
Design and implement data-driven solutions on GCP including BigQuery, Cloud Storage, Dataflow, Pub/Sub, and Looker/BI.
Build and optimize ETL pipelines using SQL and Python to extract, clean, and transform structured and unstructured data from ERP, procurement, logistics, and facility management systems.
Ensure data governance, lineage, and compliance across supply chain datasets while continuously optimizing query performance and pipeline reliability.
Innodata is a global data engineering company that enables the responsible advancement of artificial intelligence by providing data, evaluation frameworks, and human expertise. With over 36 years of legacy, Innodata delivers high-quality data and outstanding outcomes for generative AI builders and adopters.
Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.
YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.
Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.
InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.
Play a crucial role in helping client organizations transform raw data into reliable, well-modeled assets that drive business decisions.
Design, build, and maintain scalable data pipelines and ELT workflows, with Databricks as the primary platform.
Collaborate with data engineers, analysts, and clients on end-to-end data requirements and project delivery.
Velir is an established mid-sized agency with a top-tier portfolio of clients, ranging from the world’s largest non-profits to Fortune 500 brands. Our culture is built on a foundation of trust, collaboration, and continued improvement, and we are a remote first company that offers competitive pay and excellent benefits.
Design and maintain scalable data architectures, including OLTP systems and data pipelines.
Optimize database performance through indexing, query tuning, and data quality remediation.
Implement production processes including monitoring, alerting, backup/recovery, and incident response.
Mitek Systems provides identity verification solutions for financial services and other industries. The company is a global leader in mobile capture and identity verification software, employing a dedicated team focused on innovation and data-driven decision-making.
Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.
Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.
Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.
Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.
Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.
Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.
Design and implement modern data platforms and scalable data pipelines to enable better data-driven decisions.
Develop and maintain ETL/ELT pipelines using SQL, Spark/PySpark, and Microsoft Fabric or Databricks.
Work closely with data architects, BI developers, and customer stakeholders in an Agile environment.
Tieto, through MentorMate, creates durable technical solutions that deliver digital transformation at scale by blending strategic insights and thoughtful design with brilliant engineering. The company provides its people with the opportunity to work on impactful, global projects for recognizable brands.
Build and operate production-grade ingestion pipelines from clinical, operational, and third-party systems into a Databricks lakehouse.
Develop and maintain dbt models to transform raw data into clean, documented, analytics-ready datasets.
Establish data quality, testing, and monitoring practices to ensure pipeline reliability and performance.
Zócalo Health is a tech-enabled, community-oriented primary care organization serving underserved populations with culturally competent care. Founded in 2021, the company is backed by leading healthcare investors and is scaling rapidly with a focus on value-based care.
Work with large data sets and implement sophisticated data pipelines with both structured and semi-structured data.
Collaborate with stakeholders to design scalable solutions and manage internal data pipelines.
Define data governance policies and leverage AI tools to streamline data pipeline development.
For over four decades, PAR Technology Corporation has been a leader in restaurant technology, empowering brands worldwide to create lasting connections with their guests. With over 100,000 restaurants in more than 110 countries, we embrace a 'Better Together' ethos and offer comprehensive software and hardware solutions.
Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
Participate in architectural decisions and evangelize data engineering best practices.
OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.
Architect and maintain cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases.
Design and automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI.
Manage end-to-end data lifecycles including ingestion, storage, processing, and delivery of structured and unstructured data.
Trustonic makes smartphones affordable for the many, enabling global access to devices and digital finance through secure smartphone locking technology. They partner with mobile carriers, retailers, and financiers across 30+ countries, and pride themselves on a diverse, inclusive culture that values doing the right thing for each other, the community, and the planet.
Design and maintain production-grade ETL/ELT pipelines in a multi-hundred terabyte Snowflake environment.
Translate client loyalty program requirements into dimensional models and platform tables.
Build reliable, event-driven data architecture to support AI-powered loyalty products.
Kobie is a leader in loyalty solutions, helping brands build lasting emotional connections with consumers. Named a Top Workplace in the USA, the company fosters a collaborative, growth-focused culture with a diverse suite of benefits and flexible work arrangements.
Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.
Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.
Partner with the Director of Data to translate platform vision into engineering execution.
Lead the design and implementation of Kimball-style dimensional models and star schema architectures.
Design, build, and maintain production-grade data pipelines with a focus on reliability and scalability.
BlueLabs is an analytics services and technology company that helps organizations use data for social and commercial impact. With over 400 clients and a diverse team of analysts, scientists, engineers, and strategists, they foster a supportive, collaborative, and mission-driven culture.
Architect Spark-driven workflows at scale and design data platforms as products for internal teams.
Develop and maintain end-to-end data pipelines and backend ingestion workflows across multiple sources.
Champion Samsara's cultural principles and mentor junior team members to drive data-driven decisions.
Samsara is the pioneer of the Connected Operations Cloud, enabling organizations to harness IoT data for actionable insights to improve safety, efficiency, and sustainability. As a recently public company, it fosters a culture of rapid career development, ownership, and high performance.
Design and build scalable data pipelines and architectures using Databricks, Azure Data Factory, and ADLS to support analytics and AI use cases.
Integrate structured and unstructured data from multiple enterprise sources into robust cloud data platforms for financial domains like credit analysis and document intelligence.
Apply DevOps practices and collaborate with stakeholders to modernize legacy reporting systems and enable real-time data-powered decision-making.
This role is listed on behalf of a partner company that focuses on data-driven transformation initiatives, designing scalable data pipelines for advanced analytics and AI use cases. They offer a collaborative technical environment and invest in continuous learning and cutting-edge technologies.
Design, build, and optimize large-scale data and analytics platforms on the Databricks Lakehouse.
Architect and maintain scalable ETL/ELT pipelines using PySpark, Spark SQL, and Delta Lake.
Implement medallion data architectures, enforce data quality, and manage Unity Catalog for governance.
Bounteous is a premier end-to-end digital transformation consultancy that partners with ambitious brands to create digital solutions. With over 4,000 expert team members across the Americas, APAC, and EMEA, they deliver innovative strategies and technical expertise.