Develop long-term technical vision and design scalable data systems.
Build and maintain production data pipelines using Python and integrate external APIs.
Mentor engineers and uphold standards for engineering excellence.
Correlation One is the largest provider of AI and data workforce development programs globally, having trained over 500,000 professionals across 11 countries. They work with Fortune 500 enterprises and government agencies to close skills gaps, and foster a culture of empowerment and diversity.
Design and implement scalable, high-performance data pipelines to ingest and transform data from a variety of sources.
Build and maintain APIs that enable flexible, secure, and tenant-aware data integrations with external systems.
Implement observability, monitoring, and alerting to track data freshness, failures, and performance issues.
Northbeam is building the world's most advanced marketing intelligence platform for top eCommerce brands, providing powerful attribution modeling and customizable dashboards. The company is experiencing rapid growth with a strong product-market fit and a remote-friendly culture.
Build and scale data infrastructure powering targeting, identity, and measurement capabilities.
Optimize core ETL/ELT pipelines and ensure operational reliability with documented SLAs.
Implement privacy-compliant data methodologies meeting GDPR/CCPA standards.
Kargo creates powerful moments of connection between brands and consumers to build businesses. With 600+ employees and offices across the US, UK, Australia, and Ireland, they take a creative science approach to deliver unique ad experiences across premium platforms.
Develop tools and applications in Python for data processing, analysis, and forecasting.
Design and optimize Python/Snowflake-based data pipelines for ingesting, cleaning, and transforming data.
Collaborate with global technical and analyst teams to gather requirements and build innovative analytical solutions.
IDC provides trusted technology intelligence for business and technology leaders. With over 1,000 analysts globally, the company fosters a culture of rigor, integrity, and shared success.
Design and build infrastructure for optimal extraction, transformation, and loading of data using cloud services and SQL.
Develop, maintain, and optimize mission-critical data pipelines to ensure continuous flow of high-quality data.
Collaborate with cross-functional teams to understand requirements and deliver integrated end-to-end data solutions.
This company specializes in data engineering and AI/ML, building robust systems for data flow and accessibility. The team is dedicated to designing and maintaining scalable infrastructure to drive data-driven innovation.
Design and build scalable data pipelines using Python and SQL to ingest, transform, and curate data from internal and external sources.
Implement schema validation, data quality checks, and job monitoring to ensure trustworthy data outputs.
Collaborate with analytics engineers, architects, and product partners to define technical requirements and deliver data products.
Cohere Health provides a clinical intelligence platform that uses AI to connect health plans and providers, improving care quality and reducing costs. We are a growing company recognized as a top LinkedIn startup and backed by leading investors, fostering a supportive, growth-oriented culture.
Architect and implement scalable ETL and data pipelines for real-time risk management and advanced analytics.
Design, develop, and optimize distributed data storage solutions for high performance and reliability at scale.
Drive schema evolution, data modeling, and pipeline orchestration with ownership of end-to-end data flow.
Oscilar builds the most advanced AI Risk Decisioning™ Platform for banks, fintechs, and digitally native organizations to manage fraud, credit, and compliance risk. The company is mission-driven with a remote-first culture and team members from Meta, Uber, Citi, and Confluent.
Design, build, and maintain robust data pipelines and data lake architecture for both batch and real-time streaming use cases.
Optimize ETL/ELT workflows for performance, scalability, and fault tolerance, and develop dbt workflows for partner analysis.
Collaborate with analytics, product, and ML engineers to develop and deploy reliable data products and support machine learning initiatives.
NinjaTrader is an industry-leading trading platform and futures broker that empowers traders with cutting-edge products and services. Since 2003, the company has grown to over 2 million users and is the number one rated futures brokerage worldwide, fostering a dynamic culture focused on social connection, professional development, and employee recognition.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Architect and maintain cloud-native data platforms (AWS, Snowflake, Databricks) supporting batch and streaming use cases.
Design and automate ETL/ELT workflows, optimize data models, and enable self-serve analytics and AI.
Manage end-to-end data lifecycles including ingestion, storage, processing, and delivery of structured and unstructured data.
Trustonic makes smartphones affordable for the many, enabling global access to devices and digital finance through secure smartphone locking technology. They partner with mobile carriers, retailers, and financiers across 30+ countries, and pride themselves on a diverse, inclusive culture that values doing the right thing for each other, the community, and the planet.
Build, maintain, and scale data pipelines integrating internal and external data into the warehouse.
Partner with internal stakeholders and engineering teams to understand analysis needs and improve data logging.
Participate in architectural decisions and evangelize data engineering best practices.
OXIO is the world’s first telecom-as-a-service platform, democratizing telecom for brands and enterprises to own proprietary mobile networks. The company is a rapidly growing startup with a diverse and inclusive team.
Design, build, and maintain scalable data lake solutions and processing pipelines handling large volumes of data.
Develop distributed data processing applications using Apache Spark on Databricks and build real-time streaming pipelines with Apache Kafka.
Apply software engineering best practices to data pipelines including CI/CD, automated testing, and peer code review.
InPost is an e-commerce parcel delivery company that operates a network of Automated Parcel Machines (APMs) and pick-up points across nine European countries. Founded in 1999, the company employs thousands and fosters a diverse, international, and cross-functional culture with opportunities for growth and training.
Lead and manage a global data engineering team building large-scale data pipelines and production datasets for the Public Investor business.
Collaborate with product, research, and operations teams to translate roadmap priorities into scalable technical plans and customer-facing data feeds.
Drive operational excellence through data quality frameworks, observability, and AI-assisted development practices.
YipitData is the leading market research and analytics firm for the disruptive economy, providing actionable insights from alternative data. With over $475M raised and offices globally, it has a people-centric culture recognized as a Best Workplace for three consecutive years.
Design, build, and maintain ETL pipelines moving data between application databases, cloud warehouses, third-party APIs, and object stores.
Partner with product managers, research scientists, and engineers to translate ML requirements into scalable data solutions.
Investigate and resolve data integrity issues including missing data, incorrect mappings, duplicates, and schema mismatches.
Welo Global is a leader in multilingual AI, technology, and content solutions serving over 2,000 clients in 300 languages. The company has a network of over 500,000 linguists and domain experts with seven ISO certifications.
Build and operate production-grade ingestion pipelines from clinical, operational, and third-party systems into a Databricks lakehouse.
Develop and maintain dbt models to transform raw data into clean, documented, analytics-ready datasets.
Establish data quality, testing, and monitoring practices to ensure pipeline reliability and performance.
Zócalo Health is a tech-enabled, community-oriented primary care organization serving underserved populations with culturally competent care. Founded in 2021, the company is backed by leading healthcare investors and is scaling rapidly with a focus on value-based care.
Architect Spark-driven workflows at scale and design data platforms as products for internal teams.
Develop and maintain end-to-end data pipelines and backend ingestion workflows across multiple sources.
Champion Samsara's cultural principles and mentor junior team members to drive data-driven decisions.
Samsara is the pioneer of the Connected Operations Cloud, enabling organizations to harness IoT data for actionable insights to improve safety, efficiency, and sustainability. As a recently public company, it fosters a culture of rapid career development, ownership, and high performance.
Partner with analytics, marketing, and product teams to understand data needs and build systems for AI applications to access trusted data at scale.
Define standards, infrastructure, and governance for AI-driven experiences, ensuring data reliability, security, and usability.
Drive projects from design through production deployment, implementing data security and governance practices for sensitive data.
Rula is a mental healthcare company dedicated to treating the whole person with evidence-based, compassionate care. They are a remote-first organization with a focus on diversity, equity, and inclusion, hiring in most US states.
Play a crucial role in helping client organizations transform raw data into reliable, well-modeled assets that drive business decisions.
Design, build, and maintain scalable data pipelines and ELT workflows, with Databricks as the primary platform.
Collaborate with data engineers, analysts, and clients on end-to-end data requirements and project delivery.
Velir is an established mid-sized agency with a top-tier portfolio of clients, ranging from the world’s largest non-profits to Fortune 500 brands. Our culture is built on a foundation of trust, collaboration, and continued improvement, and we are a remote first company that offers competitive pay and excellent benefits.
Design and deliver scalable, low-latency streaming data solutions for real-time customer analytics.
Analyze business needs, optimize data models, and write clean code using Scala, Python, and SQL.
Mentor team members and optimize performance of data platforms like AWS Kinesis, Kafka, and Redshift.
Aircall is an AI-powered customer communications platform used by 22,000+ companies worldwide, unifying voice, SMS, WhatsApp, and AI. The company is a unicorn backed by world-class investors, with 45+ nationalities and a strong, collaborative culture.
Build scalable Python-based data pipelines and backend services for analytics workflows.
Design software systems using object-oriented programming and sound engineering practices.
Create and support platforms for analytics development, model training, and model deployment.
Experian is a global data and technology company that powers opportunities for people and businesses worldwide across markets like financial services, healthcare, and automotive. With a team of 25,200 people in 32 countries, Experian invests in advanced technologies and its people to unlock the power of data.