Source Job

Global Unlimited PTO

  • Design, build, and maintain production data pipelines using Python, Prefect, Airflow, Jenkins or any other orchestration framework multi-phase algorithmic workflows.
  • Build and optimize advanced SQL transformations in Snowflake, including window functions, CTEs, stored procedures, UDFs, and semi-structured data processing.
  • Build and maintain dbt models for data transformation, identity resolution, and slowly changing dimension (SCD Type 2) tracking across 80+ models and multiple pipeline stages.

Python SQL Dbt PySpark Snowflake

20 jobs similar to Machine Learning Data Engineer

Jobs ranked by similarity.

$127,000–$175,000/yr
US

  • Partner closely with business stakeholders to understand their challenges and design end-to-end architecture.
  • Design, develop, and own robust, efficient, and scalable data models in Snowflake and Iceberg using dbt and advanced SQL.
  • Build and manage reliable data pipelines and CI/CD workflows using tools like Airflow, Python, and Terraform.

Motive empowers people who run physical operations with tools to make their work safer, more productive, and more profitable. Motive serves nearly 100,000 customers and provides complete visibility and control across a wide range of industries.

$180,000–$220,000/yr
US Unlimited PTO 14w maternity

  • Design, build, and maintain databases that power Hologram's operations.
  • Build and maintain ETL pipelines that move and transform data reliably.
  • Audit existing pipelines and data models, identify complexity, and refactor bad decisions.

Hologram is building the future of IoT connectivity, delivering internet access to millions of connected devices worldwide. They process over 5 billion transactions per month across their global infrastructure and values a fun, upbeat, and remote-first team united by their mission.

US

  • Collaborate closely with business stakeholders to design comprehensive data solutions.
  • Design, develop, and manage robust data models in Snowflake and Iceberg utilizing dbt and advanced SQL.
  • Build and maintain data pipelines and CI/CD workflows using Airflow, Python, and Terraform.

They are a company that's using AI-driven initiatives. The company values innovation and a dynamic environment.

$151,000–$205,000/yr
US Unlimited PTO

  • Extend, optimize, and maintain core data models for reports, machine learning, and generative AI.
  • Implement automation and operationalize ML models to streamline operational processes and improve efficiency.
  • Partner with engineering, product, and analytics teams to deliver seamless integrations and customer-facing data products.

Boulevard provides a client experience platform for appointment-based, self-care businesses, helping customers enhance client experiences. They value diversity and inclusivity, offering equal opportunities and aiming to create a supportive work environment.

$110,000–$130,000/yr
US

  • Build and maintain data pipelines, transform raw data into reliable models.
  • Develop Tableau dashboards that put insights in front of clients.
  • Work directly with clients and shape how their platform evolves.

DataDrive is a fast-growing managed analytics service provider. They support ongoing training, adoption, and growth of their clients’ data cultures and offer a unique team-oriented environment.

Global

  • Design and implement scalable data models in Snowflake
  • Build and maintain transformation pipelines using dbt
  • Develop optimized star/snowflake schemas for analytics and reporting

We are looking for a highly skilled Snowflake Data Engineer. We work closely with business stakeholders and deliver high-quality data models and insights.

$179,469–$242,811/yr
US

  • Lead and grow a team of data engineers, providing mentorship and technical guidance.
  • Own execution of customer integrations across multiple product lines, ensuring on-time delivery.
  • Improve data quality and pipeline reliability by investing in better alerting and resilience.

Afresh is the leading AI company in fresh food, partnering with grocers to order billions of dollars of fresh food. They are on a mission to eliminate food waste and make fresh food accessible to all and has saved 200M lbs of food waste in 2025 alone.

Europe Asia

  • Create innovative solutions for handling peta-bytes of data with billions of rows & joins.
  • Create real time and offline features generation pipelines to managing our data infrastructure to be reliable and fast!
  • Develop and productionize data pipelines for our ML models in both bare-metal and the cloud environment.

Kayzen is a mobile demand-side platform (DSP) dedicated to democratizing programmatic advertising. They enable leading apps, agencies, media buyers, and brands to run programmatic customer acquisition, retargeting, and brand performance campaigns through their self-serve and managed service options.

$170,000–$193,363/yr
US

  • Design fault-tolerant dbt models to synthesize data from multiple sources into mart tables
  • Design and implement Sigma dashboards and Streamplit apps to provide clear insights into performance
  • Automate regular reporting workflows to reduce manual effort and increase data consistency

Weedmaps is a global leader in the cannabis industry. They are dedicated to transparency, education, and community and serve cannabis to consumers and businesses in the U.S. and worldwide.

$118,000–$148,000/yr
US

  • Design, build, and maintain scalable batch and real-time data pipelines that power analytics, experimentation, and machine learning
  • Partner cross-functionally with analytics, product, engineering and operations to deliver high-quality data solutions that drive measurable business impact
  • Champion data quality, reliability, and observability by implementing best practices in testing, monitoring, lineage, and incident response

Gopuff is reimagining how people purchase everyday essentials, from snacks to household goods to alcohol, all delivered in minutes. They are assembling a team of thinkers, dreamers and risk-takers who know the value of peace of mind in an unpredictable world.

Data Engineer

YLD
Europe

  • Responsible for building core infrastructure software (pipelines, APIs, data modelling) as part of our client's data platform team.
  • Coach & mentor other engineers to support the growth of their technical expertise.
  • Implementing the appropriate technologies for scaling data access patterns, batch processing, and data streaming for soft real-time consumption.

YLD is a software engineering and design consultancy that creates digital capabilities for their clients. The company has offices in London, Lisbon, and Porto and aims to attract, inspire, develop, and retain extraordinary people.

North America 5w PTO

  • Design, build, and maintain scalable data pipelines.
  • Strategic partner to design scalable data solutions.
  • Develop reliable data models.

Optro is the leading audit, risk, ESG, and InfoSec platform on the market, surpassing $300M ARR and continuing to grow. More than 50% of the Fortune 500 leverage their award-winning technology. They innovate and are proud of what they are producing, assisting each other and breaking through barriers.

US Unlimited PTO

  • Work cross-functionally with Product and subject matter experts to conceptualize, prototype, and build data solutions
  • Connect disparate datasets (e.g. claims, contract rates, demographics data) to empower internal and external stakeholders
  • Build and maintain data engineering systems that support AI use cases, including scalable ingestion pipelines, feature generation, and downstream products

Turquoise Health aims to make healthcare pricing simpler, more transparent, and lower cost. They are a Series B startup backed by top VCs with an accomplished group of folks with a passion for improving healthcare.

$135,500–$200,000/yr
US

  • Architect, design, implement, and operate end-to-end data engineering solutions using Agile methodology.
  • Develop and manage robust data integrations with external vendors and organizations (including complex API integrations).
  • Collaborate closely with Data Analysts, Data Scientists, DBAs, and cross-functional teams to understand requirements and deliver high-impact data solutions.

SmartAsset is an online destination for consumer-focused financial information and advice, whose mission is helping people make smart financial decisions, reaching over an estimated 59 million people each month. A successful $110 million Series D funding round in 2021 valued the company at over $1 billion.

$219,625–$235,675/yr
US Unlimited PTO

  • Define and work within our data governance practices, including a catalog/dictionary and management of data quality.
  • Manage lights-out data operations of our ETL/ELT pipelines ranging from streaming inputs to batch file loads, to support customer reporting, development, and operations.
  • Untangle, normalize, synthesize as needed to permit joining and comparisons from disparate sources, and further analysis including ML processing.

Evermore is a technology company that administers Smart Benefits to connect people to products and services. They are backed by leading investors including General Catalyst, Define Ventures, Lightspeed Venture Partners, Pinegrove Capital Partners, and Qiming Venture Partners.

Europe

  • Evolve and scale Nuitée’s data platform.
  • Own high-volume data pipelines that ingest, normalize, and serve global hotel inventory, pricing, availability, and transaction data.
  • Strengthen data modeling & data quality foundations to establish scalable patterns for reusable data products across business domains.

Nuitée is building the API backbone for the global travel industry with a mission to transform a fragmented travel ecosystem. They are a global infrastructure provider trusted by industry leaders like Hopper, Expedia, Priceline, Google, and Uber with teams across the globe and hubs in London, New York, San Francisco, Palma de Mallorca and Casablanca.

US

  • Manage client data gateways and SharePoint file security.
  • Automate file movements and data cleanup using Python and PowerShell.
  • Map data formats into the Data Warehouse and develop dbt models.

AffirmedRx aims to improve health care outcomes by bringing clarity, integrity, and trust to pharmacy benefit management. They are committed to making pharmacy benefits easy to understand and straightforward to access, leading with clinical approaches and utilizing state-of-the-art technology.

$100,000–$140,000/yr
US

  • Design, build, and maintain scalable data pipelines for clients across industries.
  • Architect and optimize cloud data warehouse solutions, adapting to each client's stack.
  • Collaborate with analysts and data scientists to ensure data is clean, reliable, and well-modeled.

NuView Analytics helps companies accelerate the time to insights from their data through data analytics, diligence, and fractional data science. They are a growth-stage company looking to drive additional value from the data they are sitting on and value humility, intellectual rigor, and stewardship.

Europe

  • Build pipelines to load data from various systems into Dataiku via S3 or Snowflake.
  • Increase the robustness of existing production pipelines, identify bottlenecks, and set up a robust monitoring, testing processes, and documentation templates.
  • Build custom applications and integrations to automate manual tasks related to customer operations to help Product Operations / Support / SRE in their day-to-day activities

Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value.

$140,400–$224,250/yr
US

  • Lead the implementation of a resilient, privacy-first data platform architecture.
  • Lead the design, infrastructure, and tooling decisions for platform optimization.
  • Develop AI-ready architecture by creating semantic layers that define and standardize business logic.

Headspace provides access to lifelong mental health support. They combine evidence-based content, clinical care, and innovative technology to help millions of members around the world get support that’s effective and personalized. They value connecting with courage, ownership, and iterating to great.