Design and build end-to-end data pipelines across the RAW, Silver, and Gold layers of the Medallion Architecture.
Architect data ingestion, transformation, standardization, and serving processes, that structure data flows from diverse and heterogeneous sources into a coherent analytical foundation.
Model data for analytical consumption following Data Warehouse best practices, including Star Schema design and dimensional modeling suited for business intelligence and AI-readiness.
Design and implement batch and real time ingestion pipelines from internal and external sources.
Implement automated data quality checks, observability, and SLA monitoring.
Optimise datasets and pipelines for analytics, ML training, and API consumption.
Software Mind develops solutions that make an impact for companies around the globe. They build cross-functional engineering teams that take ownership and crave more, always on the lookout for talented people who bring passion and creativity to every project.
Design and build ETL processes in collaboration with software and model development teams.
Create and maintain scalable data infrastructure.
Own full pipeline and infrastructure lifecycle including performance monitoring and optimization.
OpenTeams builds AI that empowers, with models that are energy-efficient, cost-effective, and fully yours. They are proponents of open source, reinvesting 3% of profits back into the open-source community and value freedom, teamwork, accountability, and uncompromising quality.
Design, build, and own scalable data pipelines and systems that power analytics, machine learning, and business operations.
Drive system design for data architecture, owning data models and storage solutions to create scalable foundations for the team.
Collaborate with engineering, product, and data teams to translate business needs into technical solutions, ensuring data quality and performance standards.
Goodway Group is a remote-first, data-driven, and technology-enabled digital media and marketing services firm with a 90+ year history, offering the security of an established company with a start-up feel. It is a diverse team of strategists, practitioners, technologists, and data scientists that is recognized as a top workplace and a certified partner to The Trade Desk.
Design, develop, and maintain scalable ETL/ELT data pipelines using Python.
Process and integrate data from multiple formats and sources (JSON, CSV, XML).
Build and manage data transformations and orchestration workflows using dbt and orchestration tools such as Airflow, Prefect, or Dagster.
I lack information about the company from the job posting. Please provide information about what the company does, size/employees, and culture, and I will fill this section out.
Build and Maintain Bronze/Silver Layer Pipelines: You will ensure core data sources lands accurately, on time, and with full lineage.
Lead Data Ingestion, Transformation, and Enrichment: You will own the end-to-end pipeline from raw file landing through cleansed, conformed staging tables, including deduplication, standardization, code mapping, and entity resolution.
Develop Automated Ingestion Pipelines: You will use Snowpipe, Matillion, or custom solutions with reliability, observability, and minimal manual intervention in mind.
Precision AQ is building a centralized Data Hub to consolidate fragmented data infrastructure, establish enterprise-wide data governance, and enable AI-ready analytics across our life sciences portfolio. This is a foundational initiative, not a maintenance role.
Develop and maintain data models for core package application and reporting databases.
Monitor execution and performance of daily pipelines and escalate issues.
Collaborate with analytics and business teams to improve data models and pipelines.
Bluelight Consulting is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.
Contribute to the design and implementation of scalable data solutions.
Build and optimize batch and streaming ingestion pipelines.
Ensure data quality, reliability, and performance across pipelines and datasets.
Blend is an AI services provider that co-creates impact for clients through data science, AI, technology, and people. They aim to fuel bold visions by aligning human expertise with artificial intelligence, fostering innovation, and unlocking value for their clients.
Build infrastructure and data automation pipelines to ingest, process, and load data from various sources.
Collaborate with stakeholders and data science teams to develop data products aligned with organizational goals.
Develop data analysis tools to provide insights and capture key metrics.
Columbia General is seeking a Senior Data Engineer to help transform data into actionable insights that drive decision-making. The company fosters a dynamic, collaborative environment that supports growth and innovation.
Design and administer cloud-native data systems using AWS services like Glue, Lambda, Redshift, and S3 to build scalable data architectures.
Develop and maintain reliable ETL processes using Python and SQL to ingest, clean, and transform complex healthcare data, optimizing pipeline performance.
Implement data security, governance, and compliance measures while collaborating with product and analytics teams to translate business needs into technical solutions.
Evio is a pharmacy solutions company founded by and working with health plans to implement transformative specialty medication initiatives. It is a lean, independent entity with six owner health plans serving over 20 million members, investing heavily in a strong, intentional team culture and values.
Design, build, and maintain scalable data pipelines
Develop and optimize ETL processes to support data products
Work with structured and unstructured data across SQL and NoSQL systems
They are seeking a Data Engineer to support the development of data products that power critical business functions. They seem to have a collaborative, cross-functional Agile environment where you'll partner closely with technical and business teams to deliver high-quality data solutions.
Design, build, and maintain scalable data pipelines using AWS Glue (PySpark), or equivalent orchestration and transformation tools.
Engineer and optimise the ClickHouse warehouse for sub-second query performance across all back-offices.
Implement data contracts between back-office and the platform.
Block Labs is a premier technology studio operating at the bleeding edge of Web3, Artificial Intelligence, and iGaming. We are a collective of senior engineers, product strategists, and builders who refuse to compromise on architecture.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Architect and evolve scalable data ingestion and egress frameworks and pipelines that are well tested and offer strong data quality monitoring.
Architect and evolve our CI/CD processes - enhancing the testing environment and observability.
Enhance our Claude Code / LLM development support capabilities - creating tools / skills / agents that give our LLMs more context and help us continually improve their abilities to debug, create code, and maintain systems.
Life360’s mission is to keep people close to the ones they love. They have a mobile app, tracking devices, and a pet GPS tracker. Life360 has more than 500 (and growing!) remote-first employees and delivers peace of mind and enhances everyday family life.
Build and own end-to-end data pipelines in Snowflake — from raw ingestion through transformation to serving layers for AI products.
Partner with ML engineers and data scientists to build and maintain AI-specific data infrastructure.
Consolidate fragmented data sources across the organization into reliable, automated pipelines.
Power Digital is a tech-enabled growth firm at the intersection of marketing, consulting, and data intelligence. They ignite revenue and brand recognition for leading and emerging companies. They are a people-first firm with a focus on diversity and have a dynamic team of consultative marketers, creatives, analysts and technologists.
Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.
Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.
Design, develop, and maintain data pipelines using Azure Databricks.
Build and optimize data transformations using PySpark and SQL in Databricks.
Implement and maintain Lakehouse architectures using Delta Lake.
Miratech helps visionaries change the world with enterprise and start-up innovation, supporting digital transformation for some of the world's largest enterprises. They are a values-driven organization with nearly 1000 full-time professionals and an annual growth rate exceeding 25%.
Design, build, and operate scheduled and event-driven data pipelines for simulation outputs, telemetry, logs, dashboards, and scenario metadata
Build and operate data storage systems (structured and semi-structured) optimized for scale, versioning, and replay
Support analytics, reporting, and ML workflows by exposing clean, well-documented datasets and APIs
Onebrief provides collaboration and AI-powered workflow software designed specifically for military staffs, valued at $2.15B. They operate as an all-remote company with a team spanning veterans from all forces and global organizations, and technologists from leading-edge software companies.
Build and optimize scalable data pipelines using Python and dbt.
Design and maintain Snowflake warehouse structures, database tables, and performant data models.
Develop reliable ETL/ELT workflows for extracting, transforming, loading, and validating data from multiple sources.
We are seeking a Senior Data Engineer to support core marketplace analytics data products and platform work. Enterprise experience is strongly preferred.